Skip to main content

Reinforcment Learning from Human Feedback

Project description

LLMs that are Helpful, Honest, Harmless, and Huggy 🤗

Installation

Download and install h4 by running:

python -m pip install h4

If you want the bleeding-edge version, install from source by running:

python -m pip install git+https://github.com/huggingface/h4.git

Developer installation

To contribute code to this project, first create a Python virtual environment using e.g. Conda:

conda create -n h4 python=3.8 && conda activate h4

Then install the base requirements with:

python -m pip install -e '.[dev]'

This will install core packages like black and isort that we use to ensure consistent code formatting.

Formatting your code

We use black and isort to ensure consistent code formatting. After following the installation steps, you can check your code locally by running:

make style && make quality

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

h4-0.1.0.tar.gz (7.4 kB view hashes)

Uploaded Source

Built Distribution

h4-0.1.0-py3-none-any.whl (6.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page