Verifiers: Environments for LLM Reinforcement Learning

These details have not been verified by PyPI

Project links

Project description

Prime Intellect

Verifiers: Environments for LLM Reinforcement Learning

Documentation • Environments Hub • PRIME-RL

News & Updates

[04/17/26] v0.1.12 is released, featuring a new composable Task/Agent/Environment architecture, upstreamed opencode and RLM harnesses/tasksets, major RLMEnv improvements (context dropping, prompt builder, hardened transport), multi-worker env server support, expanded vf-tui capabilities, and richer eval configuration.
[03/12/26] v0.1.11 is released, featuring a unified client stack, major RLMEnv and env server reliability improvements, a substantially refined eval TUI, new pass@k and ablation sweep support, and bundled opencode environments.
[02/10/26] v0.1.10 is released, featuring OpenEnv and BrowserEnv integrations, resumed evals, improved rollout and token tracking, safer sandbox lifecycle behavior, refreshed workspace setup, and opencode harbor improvements.
[01/08/26] v0.1.9 is released, featuring a number of new experimental environment class types, monitor rubrics for automatic metric collection, improved workspace setup flow, improved error handling, bug fixes, and a documentation overhaul.
[11/19/25] v0.1.8 is released, featuring a major refactor of the rollout system to use trajectory-based tracking for token-in token-out training across turns, as well as support for truncated or branching rollouts.
[11/07/25] Verifiers v0.1.7 is released! This includes an improved quickstart configuration for training with prime-rl, a new included "nano" trainer (vf.RLTrainer, replacing vf.GRPOTrainer), and a number of bug fixes and improvements to the documentation.
[10/27/25] A new iteration of the Prime Intellect Environments Program is live!

Overview

Verifiers is our library for creating environments to train and evaluate LLMs.

Environments contain everything required to run and evaluate a model on a particular task:

A dataset of task inputs
A harness for the model (tools, sandboxes, context management, etc.)
A reward function or rubric to score the model's performance

Environments can be used for training models with reinforcement learning (RL), evaluating capabilities, generating synthetic data, experimenting with agent harnesses, and more.

Verifiers is tightly integrated with the Environments Hub, as well as our training framework prime-rl and our Hosted Training platform.

Getting Started

Ensure you have uv installed, as well as the prime CLI tool:

# install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
# install the prime CLI
uv tool install prime
# log in to the Prime Intellect platform
prime login

To set up a new workspace for developing environments, do:

# ~/dev/my-lab
prime lab setup

This sets up a Python project if needed (with uv init), installs verifiers (with uv add verifiers), creates the recommended workspace structure, and downloads useful starter files:

configs/
├── endpoints.toml      # OpenAI-compatible API endpoint configuration
├── rl/                 # Example configs for Hosted Training
├── eval/               # Example multi-environment eval configs
└── gepa/               # Example configs for prompt optimization
.prime/
└── skills/             # Bundled workflow skills for create/browse/review/eval/GEPA/train/brainstorm
environments/
└── AGENTS.md           # Documentation for AI coding agents
AGENTS.md               # Top-level documentation for AI coding agents
CLAUDE.md               # Claude-specific pointer to AGENTS.md

Alternatively, add verifiers to an existing project:

uv add verifiers && prime lab setup --skip-install

Environments built with Verifiers are self-contained Python modules. To initialize a fresh environment template, do:

prime env init my-env # creates a new template in ./environments/my_env

For OpenEnv integration, use:

prime env init my-openenv --openenv

Then copy your OpenEnv project into environments/my_openenv/proj/ and build the image with:

uv run vf-build my-openenv

This will create a new module called my_env with a basic environment template.

environments/my_env/
├── my_env.py           # Main implementation
├── pyproject.toml      # Dependencies and metadata
└── README.md           # Documentation

Environment modules should expose a load_environment function which returns an instance of the Environment object, and which can accept custom arguments. For example:

# my_env.py
import verifiers as vf

def load_environment(dataset_name: str = 'gsm8k') -> vf.Environment:
    dataset = vf.load_example_dataset(dataset_name) # 'question'
    async def correct_answer(completion, answer) -> float:
        completion_ans = completion[-1]['content']
        return 1.0 if completion_ans == answer else 0.0
    rubric = Rubric(funcs=[correct_answer])
    env = vf.SingleTurnEnv(dataset=dataset, rubric=rubric)
    return env

To install the environment module into your project, do:

prime env install my-env # installs from ./environments/my_env

To install an environment from the Environments Hub into your project, do:

prime env install primeintellect/math-python

To run a local evaluation with any OpenAI-compatible model, do:

prime eval run my-env -m openai/gpt-5-nano # run and save eval results locally

Evaluations use Prime Inference by default; configure your own API endpoints in ./configs/endpoints.toml.

View local evaluation results in the terminal UI:

prime eval tui

To publish the environment to the Environments Hub, do:

prime env push --path ./environments/my_env

To run an evaluation directly from the Environments Hub, do:

prime eval run primeintellect/math-python

Documentation

Environments — Create datasets, rubrics, and custom multi-turn interaction protocols.

Evaluation - Evaluate models using your environments.

Training — Train models in your environments with reinforcement learning.

Development — Contributing to verifiers

API Reference — Understanding the API and data structures

FAQs - Other frequently asked questions.

Citation

Originally created by Will Brown (@willccbb).

If you use this code in your research, please cite:

@misc{brown_verifiers_2025,
  author       = {William Brown},
  title        = {{Verifiers}: Environments for LLM Reinforcement Learning},
  howpublished = {\url{https://github.com/PrimeIntellect-ai/verifiers}},
  note         = {Commit abcdefg • accessed DD Mon YYYY},
  year         = {2025}
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.15.dev7 pre-release

May 15, 2026

0.1.15.dev6 pre-release

May 14, 2026

0.1.15.dev5 pre-release

May 14, 2026

0.1.15.dev4 pre-release

May 14, 2026

0.1.15.dev3 pre-release

May 13, 2026

0.1.15.dev2 pre-release

May 13, 2026

0.1.15.dev1 pre-release

May 12, 2026

0.1.15.dev0 pre-release

May 12, 2026

0.1.14

May 7, 2026

0.1.13.dev8 pre-release

Apr 28, 2026

0.1.13.dev7 pre-release

Apr 24, 2026

0.1.13.dev6 pre-release

Apr 23, 2026

0.1.13.dev5 pre-release

Apr 22, 2026

0.1.13.dev4 pre-release

Apr 22, 2026

0.1.13.dev3 pre-release

Apr 19, 2026

0.1.13.dev2 pre-release

Apr 19, 2026

This version

0.1.13.dev1 pre-release

Apr 18, 2026

0.1.12

Apr 17, 2026

0.1.12.dev6 pre-release

Apr 15, 2026

0.1.12.dev5 pre-release

Apr 14, 2026

0.1.12.dev4 pre-release

Apr 13, 2026

0.1.12.dev3 pre-release

Apr 7, 2026

0.1.12.dev2 pre-release

Apr 1, 2026

0.1.12.dev1 pre-release

Mar 25, 2026

0.1.12.dev0 pre-release

Mar 22, 2026

0.1.11

Mar 13, 2026

0.1.11.dev1 pre-release

Mar 2, 2026

0.1.11.dev0 pre-release

Feb 14, 2026

0.1.10

Feb 11, 2026

0.1.10.dev5 pre-release

Feb 10, 2026

0.1.10.dev4 pre-release

Feb 9, 2026

0.1.10.dev3 pre-release

Feb 8, 2026

0.1.10.dev2 pre-release

Feb 4, 2026

0.1.10.dev1 pre-release

Feb 4, 2026

0.1.10.dev0 pre-release

Jan 17, 2026

0.1.9.post3

Jan 14, 2026

0.1.9.post2

Jan 10, 2026

0.1.9.post1

Jan 10, 2026

0.1.9.post0

Jan 8, 2026

0.1.9

Jan 8, 2026

0.1.8.post2

Dec 11, 2025

0.1.8.post1

Nov 26, 2025

0.1.8.post0

Nov 25, 2025

0.1.8

Nov 19, 2025

0.1.7.post0

Nov 7, 2025

0.1.7

Nov 7, 2025

0.1.6.post0

Oct 21, 2025

0.1.6

Oct 21, 2025

0.1.5.post0

Oct 9, 2025

0.1.5

Oct 8, 2025

0.1.5.dev1 pre-release

Sep 30, 2025

0.1.5.dev0 pre-release

Sep 30, 2025

0.1.4

Sep 22, 2025

0.1.3.post0

Sep 5, 2025

0.1.3

Aug 26, 2025

0.1.2.post1

Aug 23, 2025

0.1.2.post0

Aug 9, 2025

0.1.2

Jul 31, 2025

0.1.1

Jun 18, 2025

0.1.0

Jun 5, 2025

0.0.0

Jan 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

verifiers-0.1.13.dev1.tar.gz (457.6 kB view details)

Uploaded Apr 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

verifiers-0.1.13.dev1-py3-none-any.whl (427.8 kB view details)

Uploaded Apr 18, 2026 Python 3

File details

Details for the file verifiers-0.1.13.dev1.tar.gz.

File metadata

Download URL: verifiers-0.1.13.dev1.tar.gz
Upload date: Apr 18, 2026
Size: 457.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for verifiers-0.1.13.dev1.tar.gz
Algorithm	Hash digest
SHA256	`ce647e63c038884adf04f16f5fd28fd6d8833145e5fac6306ee8c3a9d390de07`
MD5	`06661a0edfa35cae1877aa1060e289f7`
BLAKE2b-256	`6b90706ea4d2d13e4758705bbddc721ca1d8ff43d5d26922028698f9f13f5bc4`

See more details on using hashes here.

File details

Details for the file verifiers-0.1.13.dev1-py3-none-any.whl.

File metadata

Download URL: verifiers-0.1.13.dev1-py3-none-any.whl
Upload date: Apr 18, 2026
Size: 427.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.7 {"installer":{"name":"uv","version":"0.11.7","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for verifiers-0.1.13.dev1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da210efb3c723015d501dfbaba01c7c52e19074c0f42f3898a43544d534f01c6`
MD5	`5f4d9060dad7b05fb15e1e7b1f2e1469`
BLAKE2b-256	`7e13e204c9237b98a9a8924ab5f35fe9bd5a7d8eafbe411a2ad77b5d80a33b0d`

See more details on using hashes here.

verifiers 0.1.13.dev1

Navigation

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Verifiers: Environments for LLM Reinforcement Learning

News & Updates

Overview

Getting Started

Documentation

Citation

Project details

Verified details

Owner

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes