Agent evaluation framework. Define expectations. Catch regressions. Prove your agent isn't getting worse.
Project description
Rigr
Agent evaluation for teams that can't afford to be wrong.
Open-source core. Enterprise platform. Prove your AI agents are safe, consistent, and not quietly degrading — before your customers or your compliance team find out.
pip install rigr
rigr init && rigr test
Who this is for
You have AI agents in production. Every model update, prompt change, or retrieval tweak risks breaking them. Your compliance team wants evidence they're safe. Your CTO wants to know if v2 is actually better than v1. You need more than "looks good to me."
What it does
- Define expectations — Structured schema for what your agent must output
- Write test cases — Inputs with expected outputs. Version-controlled, reviewable
- Freeze baselines — Lock known-good results. Every run compares against them
- Catch regressions — New errors flagged before deployment. Resolved errors tracked
- Generate audit reports — Per-field accuracy, changelog, compliance-ready evidence
Enterprise
For teams deploying agents in regulated environments. SSO, audit logs, SOC 2, on-prem deployment, priority support. Book a call.
Quickstart
pip install rigr
rigr init # Creates rigr.yaml + test_cases/
rigr test --agent my_agent # Runs tests against your agent
rigr freeze # Locks current results as baseline
rigr compare # Shows regressions vs baseline
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file rigr-0.1.0.tar.gz.
File metadata
- Download URL: rigr-0.1.0.tar.gz
- Upload date:
- Size: 9.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c9db31807df94e5bdfbd8a312834d6a6f2ef407cfcc4c7d197d586066189b20f
|
|
| MD5 |
7ec725991e4c5ea4589bc5a3730d9c2e
|
|
| BLAKE2b-256 |
285b3f27da236ee8a0b4912214253c1c41b3a52ba1070a19d080ce6ae174ac1d
|
File details
Details for the file rigr-0.1.0-py3-none-any.whl.
File metadata
- Download URL: rigr-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0709cb2b2e9ca62ab7be8ddcadcad9c389fdc35f0e9762d7afd082d90d8f84f3
|
|
| MD5 |
1fb8b4ff19ea4de994b84a18d4449dca
|
|
| BLAKE2b-256 |
952cf35f671fc48496447aa6dfd9474fede1eb81319834d7a623cdb167e28e1b
|