Skip to main content

RLVR training framework for LLMs

Project description

retrain

RLVR (Reinforcement Learning with Verifiable Rewards) training framework for LLMs. Train reasoning models on MATH with composable advantage functions and adaptive scheduling.

Features

  • Composable advantages -- GRPO, MaxRL, GTPO entropy weighting, HICRA planning amplification, SEPA entropy pooling
  • Pluggable inference -- PyTorch, MAX, vLLM, SGLang, or any OpenAI-compatible server
  • Pluggable rewards -- string match, symbolic math, LLM judge, or bring your own
  • Back pressure -- USL+Roofline adaptive batch sizing
  • Campaign orchestrator -- sweep all 5 conditions across seeds with wandb logging
  • Checkpoint resume -- save and restore full trainer state across preemptions

Install

pip install -e .

Quick start

# 1. Drop a config
cp retrain.toml my_run.toml

# 2. Train
retrain my_run.toml

# 3. Override from CLI
retrain my_run.toml --seed 42 --wandb-project my-project

Configuration

All configuration lives in a TOML file. See retrain.toml for the default config, or run retrain help for the full reference.

Documentation

Full documentation: retrain.readthedocs.io

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

retrain-0.2.0.tar.gz (348.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

retrain-0.2.0-py3-none-any.whl (43.6 kB view details)

Uploaded Python 3

File details

Details for the file retrain-0.2.0.tar.gz.

File metadata

  • Download URL: retrain-0.2.0.tar.gz
  • Upload date:
  • Size: 348.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.16

File hashes

Hashes for retrain-0.2.0.tar.gz
Algorithm Hash digest
SHA256 5156651835f29e8c0b6fb9a28e7330119acedeece114573a884f54951d2b8b38
MD5 9e8f382f2400c76119db9d82cba9f59d
BLAKE2b-256 470eb8e47f71599f949496b4bb5c9302dd046cec3c8fb987e5dd7a8ba56025a3

See more details on using hashes here.

File details

Details for the file retrain-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: retrain-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 43.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.16

File hashes

Hashes for retrain-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7174c7299e1b910065b694820c1a8fef08c7c14ee2c465c348d855a3e461200c
MD5 6597d19aeeba85ae1c7f84054de94a3f
BLAKE2b-256 412f63fd5e6a9c769dd6315194a6a3b9bf564a3bc5d4e3a9bf0e5ed7f9dad362

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page