Skip to main content

nextGPT

Project description

nextGPT

📢 Open source implementation for ChatGPT replica to build the end-to-end pipeline from SFT to RLHF.

  • 🔥 Step 1) SFT: Surpervised Fine-tuning
  • 🔥 Step 2) RM: Reward Model
  • 🔥 Step 3) PPO: Proximal Policy Optimization

ChatGPT Diagram

Installation

$ pip install nextgpt

or install from git repo to the get always the latest version.

$ git clone https://github.com/louiezzang/next-gpt.git
$ cd next-gpt/
$ pip install .
$ cd ../

Examples

See chatGPT example

RLHF

What is RLHF?

Implementation of RLHF (Reinforcement Learning with Human Feedback) was powered by Colossal-AI. More details can be found in the blog.

The RLHF was forked and modified from these git repos.

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nextgpt-0.0.4.tar.gz (34.6 kB view details)

Uploaded Source

File details

Details for the file nextgpt-0.0.4.tar.gz.

File metadata

  • Download URL: nextgpt-0.0.4.tar.gz
  • Upload date:
  • Size: 34.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.7

File hashes

Hashes for nextgpt-0.0.4.tar.gz
Algorithm Hash digest
SHA256 c534c6f3d98598b6954e9f15add8ab790fcc299746a86c97d1a9e0b8790298b9
MD5 01914ad34ef72c591210f143c7254011
BLAKE2b-256 7cfc5ba1a653efa334f8d50994a7f9728fa5dacedaba7ebb3135d0000e622963

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page