Skip to main content

Automatically generate evals for every AI change

Project description

Parity

PyPI License: MIT Python 3.11+

You changed a prompt. You don't know if you broke something.

Parity automatically generates evals for every AI change — on every pull request, before anything ships.

What it does

Parity runs in CI. For every PR that touches your AI system, it:

  1. Detects changes to prompts, instructions, guardrails, validators, tool descriptions, classifiers, and other artifacts that shape how your agent behaves.
  2. Checks your existing eval coverage against what changed — and finds the gaps.
  3. Generates ranked, targeted test cases for those gaps, including multi-turn conversational evals for conversational agents.
  4. Posts a PR comment with proposed evals for your review.
  5. Writes approved evals to your eval platform only after you explicitly label the PR.

Parity is not an eval runner. It generates eval inputs that plug into LangSmith, Braintrust, Arize Phoenix, Promptfoo, or file-based workflows.

No evals yet? Parity starts from zero. It generates starter evals from your diff, system prompt, and whatever product context you provide. The more context you give it, the sharper it gets.

Quick start

pip install parity-ai
parity init

parity init generates parity.yaml, a GitHub Actions workflow, and context/ stubs. Fill in your context files, add your API keys as GitHub secrets, and open a PR that touches a prompt.

See docs/configuration.md for prerequisites, cost control, and the full configuration reference.

Try it on a real example

Test Parity against a real LangGraph repo with the in-repo demo:

examples/langgraph-agentic-rag | quickstart guide

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parity_ai-0.1.0.tar.gz (45.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

parity_ai-0.1.0-py3-none-any.whl (62.6 kB view details)

Uploaded Python 3

File details

Details for the file parity_ai-0.1.0.tar.gz.

File metadata

  • Download URL: parity_ai-0.1.0.tar.gz
  • Upload date:
  • Size: 45.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.0.tar.gz
Algorithm Hash digest
SHA256 519df0e34b0c4f8ea4f6efb7d57fc0658c829c86c564a9b5b359528739ec2378
MD5 d636a0962cc6125782c6b1a593724a33
BLAKE2b-256 12807b6c2104b38d31711df1cc71b436e9d8c82974b5201df422aeeef9341ac9

See more details on using hashes here.

File details

Details for the file parity_ai-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: parity_ai-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 62.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6b29ecda189fbdae488198953b6312f8c9a8f954e6dee5fccfca97042406f004
MD5 3a800e81fa281370e9df37b2d419a407
BLAKE2b-256 16534f01e1ec15841c5ad8d37aaa1c9e02dcb1038ace4fe69288d0e47ea032f1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page