nextGPT
Project description
nextGPT
📢 Open source implementation for ChatGPT replica to build the end-to-end pipeline from SFT to RLHF.
- 🔥 Step 1) SFT: Surpervised Fine-tuning
- 🔥 Step 2) RM: Reward Model
- 🔥 Step 3) PPO: Proximal Policy Optimization
Installation
$ pip install nextgpt
or install from git repo to the get always the latest version.
$ git clone https://github.com/louiezzang/next-gpt.git
$ cd next-gpt/
$ pip install .
$ cd ../
Examples
See chatGPT example
RLHF
What is RLHF?
Implementation of RLHF (Reinforcement Learning with Human Feedback) was powered by Colossal-AI. More details can be found in the blog.
The RLHF was forked and modified from these git repos.
- https://github.com/airobotlab/KoChatGPT/tree/main/colossalai_ChatGPT_230319
- https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat
References
- https://github.com/airobotlab/KoChatGPT
- https://github.com/airobotlab/KoChatGPT/tree/main/colossalai_ChatGPT_230319
- https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat
- https://github.com/juncongmoo/chatllama
- https://github.com/huggingface/peft
- https://github.com/dredwardhyde/gpt-neo-fine-tuning-example/blob/main/gpt_neo.py
- https://github.com/databrickslabs/dolly
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nextgpt-0.0.4.tar.gz
(34.6 kB
view details)
File details
Details for the file nextgpt-0.0.4.tar.gz.
File metadata
- Download URL: nextgpt-0.0.4.tar.gz
- Upload date:
- Size: 34.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c534c6f3d98598b6954e9f15add8ab790fcc299746a86c97d1a9e0b8790298b9
|
|
| MD5 |
01914ad34ef72c591210f143c7254011
|
|
| BLAKE2b-256 |
7cfc5ba1a653efa334f8d50994a7f9728fa5dacedaba7ebb3135d0000e622963
|