nextGPT
Project description
nextGPT
📢 Open source implementation for ChatGPT replica to build the end-to-end pipeline from SFT to RLHF.
- 🔥 Step 1) SFT: Surpervised Fine-tuning
- 🔥 Step 2) RM: Reward Model
- 🔥 Step 3) PPO: Proximal Policy Optimization
Installation
$ pip install nextgpt
or
$ git clone https://github.com/louiezzang/next-gpt.git
$ cd next-gpt/
$ pip install .
$ cd ../
Examples
See chatGPT example
RLHF
What is RLHF?
Implementation of RLHF (Reinforcement Learning with Human Feedback) was powered by Colossal-AI. More details can be found in the blog.
The RLHF was forked and modified from these git repos.
- https://github.com/airobotlab/KoChatGPT/tree/main/colossalai_ChatGPT_230319
- https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat
References
- https://github.com/airobotlab/KoChatGPT
- https://github.com/airobotlab/KoChatGPT/tree/main/colossalai_ChatGPT_230319
- https://github.com/hpcaitech/ColossalAI/tree/main/applications/Chat
- https://github.com/huggingface/peft
- https://github.com/dredwardhyde/gpt-neo-fine-tuning-example/blob/main/gpt_neo.py
- https://github.com/databrickslabs/dolly
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nextgpt-0.0.2.tar.gz
(32.7 kB
view hashes)