Skip to main content

Agent evaluation framework. Define expectations. Catch regressions. Prove your agent isn't getting worse.

Project description

Rigr

Agent evaluation for teams that can't afford to be wrong.

Open-source core. Enterprise platform. Prove your AI agents are safe, consistent, and not quietly degrading — before your customers or your compliance team find out.

pip install rigr
rigr init && rigr test

Who this is for

You have AI agents in production. Every model update, prompt change, or retrieval tweak risks breaking them. Your compliance team wants evidence they're safe. Your CTO wants to know if v2 is actually better than v1. You need more than "looks good to me."

What it does

  1. Define expectations — Structured schema for what your agent must output
  2. Write test cases — Inputs with expected outputs. Version-controlled, reviewable
  3. Freeze baselines — Lock known-good results. Every run compares against them
  4. Catch regressions — New errors flagged before deployment. Resolved errors tracked
  5. Generate audit reports — Per-field accuracy, changelog, compliance-ready evidence

Enterprise

For teams deploying agents in regulated environments. SSO, audit logs, SOC 2, on-prem deployment, priority support. Book a call.

Quickstart

pip install rigr
rigr init                     # Creates rigr.yaml + test_cases/
rigr test --agent my_agent   # Runs tests against your agent
rigr freeze                   # Locks current results as baseline
rigr compare                  # Shows regressions vs baseline

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rigr-0.1.0.tar.gz (9.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rigr-0.1.0-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file rigr-0.1.0.tar.gz.

File metadata

  • Download URL: rigr-0.1.0.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rigr-0.1.0.tar.gz
Algorithm Hash digest
SHA256 c9db31807df94e5bdfbd8a312834d6a6f2ef407cfcc4c7d197d586066189b20f
MD5 7ec725991e4c5ea4589bc5a3730d9c2e
BLAKE2b-256 285b3f27da236ee8a0b4912214253c1c41b3a52ba1070a19d080ce6ae174ac1d

See more details on using hashes here.

File details

Details for the file rigr-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: rigr-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rigr-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0709cb2b2e9ca62ab7be8ddcadcad9c389fdc35f0e9762d7afd082d90d8f84f3
MD5 1fb8b4ff19ea4de994b84a18d4449dca
BLAKE2b-256 952cf35f671fc48496447aa6dfd9474fede1eb81319834d7a623cdb167e28e1b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page