Skip to main content

The official Python SDK for Eval Protocol (EP.) EP is an open protocol that standardizes how developers author evals for large language model (LLM) applications.

Project description

Eval Protocol

PyPI - Version Ask DeepWiki

Eval Protocol (EP) is an open solution for doing reinforcement learning fine-tuning on existing agents — across any language, container, or framework.

Eval Protocol overview

Most teams already have complex agents running in production — often across remote services with heavy dependencies, Docker containers, or TypeScript backends deployed on Vercel. When they try to train or fine-tune these agents with reinforcement learning, connecting them to a trainer quickly becomes painful.

Eval Protocol makes this possible in two ways:

  1. Expose your agent through a simple API Wrap your existing agent (Python, TypeScript, Docker, etc.) in a simple HTTP service using EP’s rollout interface. EP handles the rollout orchestration, metadata passing, and trace storage automatically.
  2. Connect with any trainer Once your agent speaks the EP standard, it can be fine-tuned or evaluated with any supported trainer — Fireworks RFT, TRL, Unsloth, or your own — with no environment rewrites.

The result: RL that works out-of-the-box for existing production agents.

Who This Is For

  • Applied AI teams adding RL to existing production agents.
  • Research engineers experimenting with fine-tuning complex, multi-turn or tool-using agents.
  • MLOps teams building reproducible, language-agnostic rollout pipelines.

Quickstart

Resources

License

MIT

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eval_protocol-0.2.98.dev1.tar.gz (2.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

eval_protocol-0.2.98.dev1-py3-none-any.whl (2.1 MB view details)

Uploaded Python 3

File details

Details for the file eval_protocol-0.2.98.dev1.tar.gz.

File metadata

  • Download URL: eval_protocol-0.2.98.dev1.tar.gz
  • Upload date:
  • Size: 2.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for eval_protocol-0.2.98.dev1.tar.gz
Algorithm Hash digest
SHA256 a5079b14474e9892b9aafd7434bd5b67a6aae365de3310dfe1230d9630454cee
MD5 4d18a72244d093813d53f58584b3afec
BLAKE2b-256 0157449fbb7dfc3c9d9289cdcf025b78d0e34ca23892e6819480328b37f49c27

See more details on using hashes here.

Provenance

The following attestation bundles were made for eval_protocol-0.2.98.dev1.tar.gz:

Publisher: release.yml on eval-protocol/python-sdk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file eval_protocol-0.2.98.dev1-py3-none-any.whl.

File metadata

File hashes

Hashes for eval_protocol-0.2.98.dev1-py3-none-any.whl
Algorithm Hash digest
SHA256 75a5b796e483d7076336b52b311ecc7d85f08ebfc33a02b6dfa396f86e9d4d8f
MD5 d4b1adf16411dd2edf927aba037288e5
BLAKE2b-256 a13afebe4365503e60e47b1c88209c175e259aa94466471ad123c1000461dff5

See more details on using hashes here.

Provenance

The following attestation bundles were made for eval_protocol-0.2.98.dev1-py3-none-any.whl:

Publisher: release.yml on eval-protocol/python-sdk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page