Axolotl Plugin integrating Nous Research's Atropos for RL
Project description
plugin-atropos
A plugin to train LLMs using Axolotl with Atropos, a large-scale RL Gym.
Installation
Install axolotl + vllm if not already installed
pip install axolotl[vllm,flash-attn]
Install Atropos
git clone https://github.com/NousResearch/atropos.git
cd atropos
git checkout add-support-for-custom-api-servers
pip install -e .
cd -
Install this plugin
git clone https://github.com/axolotl-ai-cloud/plugin-atropos.git
cd plugin-atropos
pip install -e .
Usage
Note: requires 3 GPUs: 0,1 runs vllm, 2 runs trainer
# start the vLLM server (can take a few minutes), this will block the session
CUDA_VISIBLE_DEVICES=0,1 axolotl vllm-serve examples/train-fft.yaml --serve-module plugin_atropos.vllm_serve
# CUDA_VISIBLE_DEVICES=0,1 vllm serve Qwen/Qwen3-4B --port 9001 --host 0.0.0.0 --tensor-parallel-size=2 --max-model-len 4096 --kv-cache-dtype fp8
# in a new terminal session
# start the API server in the background and redirect both stdout and stderr
run-api &> logs.txt &
# start the RL environment, this will block the session
python examples/gsm8k_server.py serve --slurm false
Start the trainer
CUDA_VISIBLE_DEVICES=2 axolotl train examples/train-fft.yaml
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file axolotl_plugin_atropos-0.0.2.tar.gz.
File metadata
- Download URL: axolotl_plugin_atropos-0.0.2.tar.gz
- Upload date:
- Size: 24.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
008f83594e1f90ebbc601f23e41fbce3837d193eb063b69bc59f04b7a93cc63f
|
|
| MD5 |
76e85d381fd6cb19cfa9fc1047ca1bc4
|
|
| BLAKE2b-256 |
fbbaf2116a22fe324584729b298c07b6ec6b8bce8ffc8408460b6c4ef40c2ba4
|
Provenance
The following attestation bundles were made for axolotl_plugin_atropos-0.0.2.tar.gz:
Publisher:
pypi.yaml on axolotl-ai-cloud/plugin-atropos
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
axolotl_plugin_atropos-0.0.2.tar.gz -
Subject digest:
008f83594e1f90ebbc601f23e41fbce3837d193eb063b69bc59f04b7a93cc63f - Sigstore transparency entry: 212503443
- Sigstore integration time:
-
Permalink:
axolotl-ai-cloud/plugin-atropos@17c8f7677bf689bc8ee2f5ff07221bf61e2f64fb -
Branch / Tag:
refs/tags/v0.0.2 - Owner: https://github.com/axolotl-ai-cloud
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
pypi.yaml@17c8f7677bf689bc8ee2f5ff07221bf61e2f64fb -
Trigger Event:
push
-
Statement type: