Skip to main content

Axolotl Plugin integrating Nous Research's Atropos for RL

Project description

plugin-atropos

A plugin to train LLMs using Axolotl with Atropos, a large-scale RL Gym.

Installation

Install axolotl + vllm if not already installed

pip install axolotl[vllm,flash-attn]

Install Atropos

git clone https://github.com/NousResearch/atropos.git
cd atropos
git checkout add-support-for-custom-api-servers
pip install -e .
cd -

Install this plugin

git clone https://github.com/axolotl-ai-cloud/plugin-atropos.git
cd plugin-atropos
pip install -e .

Usage

Note: requires 3 GPUs: 0,1 runs vllm, 2 runs trainer

# start the vLLM server (can take a few minutes), this will block the session
CUDA_VISIBLE_DEVICES=0,1 axolotl vllm-serve examples/train-fft.yaml --serve-module plugin_atropos.vllm_serve
#  CUDA_VISIBLE_DEVICES=0,1 vllm serve Qwen/Qwen3-4B --port 9001 --host 0.0.0.0 --tensor-parallel-size=2 --max-model-len 4096 --kv-cache-dtype fp8

# in a new terminal session
# start the API server in the background and redirect both stdout and stderr
run-api &> logs.txt &
# start the RL environment, this will block the session
python examples/gsm8k_server.py serve --slurm false

Start the trainer

CUDA_VISIBLE_DEVICES=2 axolotl train examples/train-fft.yaml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

axolotl_plugin_atropos-0.0.2.tar.gz (24.9 kB view details)

Uploaded Source

File details

Details for the file axolotl_plugin_atropos-0.0.2.tar.gz.

File metadata

  • Download URL: axolotl_plugin_atropos-0.0.2.tar.gz
  • Upload date:
  • Size: 24.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for axolotl_plugin_atropos-0.0.2.tar.gz
Algorithm Hash digest
SHA256 008f83594e1f90ebbc601f23e41fbce3837d193eb063b69bc59f04b7a93cc63f
MD5 76e85d381fd6cb19cfa9fc1047ca1bc4
BLAKE2b-256 fbbaf2116a22fe324584729b298c07b6ec6b8bce8ffc8408460b6c4ef40c2ba4

See more details on using hashes here.

Provenance

The following attestation bundles were made for axolotl_plugin_atropos-0.0.2.tar.gz:

Publisher: pypi.yaml on axolotl-ai-cloud/plugin-atropos

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page