nextGPT

Project description

nextGPT

📢 Open source implementation for ChatGPT replica to build the end-to-end pipeline from SFT to RLHF.

🔥 Step 1) SFT: Surpervised Fine-tuning
🔥 Step 2) RM: Reward Model
🔥 Step 3) PPO: Proximal Policy Optimization

ChatGPT Diagram

Installation

$ pip install nextgpt

or install from git repo to the get always the latest version.

$ git clone https://github.com/louiezzang/next-gpt.git
$ cd next-gpt/
$ pip install .
$ cd ../

Examples

See chatGPT example

RLHF

What is RLHF?

Implementation of RLHF (Reinforcement Learning with Human Feedback) was powered by Colossal-AI. More details can be found in the blog.

The RLHF was forked and modified from these git repos.

References

Project details

Release history Release notifications | RSS feed

This version

0.0.4

Apr 15, 2023

0.0.3

Apr 10, 2023

0.0.2

Apr 10, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nextgpt-0.0.4.tar.gz (34.6 kB view details)

Uploaded Apr 15, 2023 Source

File details

Details for the file nextgpt-0.0.4.tar.gz.

File metadata

Download URL: nextgpt-0.0.4.tar.gz
Upload date: Apr 15, 2023
Size: 34.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.7

File hashes

Hashes for nextgpt-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`c534c6f3d98598b6954e9f15add8ab790fcc299746a86c97d1a9e0b8790298b9`
MD5	`01914ad34ef72c591210f143c7254011`
BLAKE2b-256	`7cfc5ba1a653efa334f8d50994a7f9728fa5dacedaba7ebb3135d0000e622963`

See more details on using hashes here.

nextgpt 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

nextGPT

Installation

Examples

RLHF

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes