Skip to main content

Automatically generate evals for every AI change

Project description

Parity

PyPI License: MIT Python 3.11+

You changed a prompt. You don't know if you broke something.

Parity automatically generates evals for every AI change — on every pull request, before anything ships.

What it does

Parity runs in CI. For every PR that touches your AI system, it:

  1. Detects changes to prompts, instructions, guardrails, validators, tool descriptions, classifiers, and other artifacts that shape how your agent behaves.
  2. Checks your existing eval coverage against what changed — and finds the gaps.
  3. Generates ranked, targeted test cases for those gaps, including multi-turn conversational evals for conversational agents.
  4. Posts a PR comment with proposed evals for your review.
  5. Writes approved evals to your eval platform only after you explicitly label the PR.

Parity is not an eval runner. It generates eval inputs that plug into LangSmith, Braintrust, Arize Phoenix, Promptfoo, or file-based workflows.

No evals yet? Parity starts from zero. It generates starter evals from your diff, system prompt, and whatever product context you provide. The more context you give it, the sharper it gets.

Quick start

pip install parity-ai
parity init

parity init generates parity.yaml, a GitHub Actions workflow, and context/ stubs. Fill in your context files, add your API keys as GitHub secrets, and open a PR that touches a prompt.

See docs/configuration.md for prerequisites, cost control, and the full configuration reference.

Try it on a real example

Test Parity against a real LangGraph repo with the in-repo demo:

examples/langgraph-agentic-rag | quickstart guide

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parity_ai-0.1.3.tar.gz (46.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

parity_ai-0.1.3-py3-none-any.whl (63.0 kB view details)

Uploaded Python 3

File details

Details for the file parity_ai-0.1.3.tar.gz.

File metadata

  • Download URL: parity_ai-0.1.3.tar.gz
  • Upload date:
  • Size: 46.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.3.tar.gz
Algorithm Hash digest
SHA256 49092eff9b1abd9f46d047fc03c408b7355aa09aa117b5724692f354070f4c48
MD5 7ecd53fa5fb3796770d3468a01331a32
BLAKE2b-256 b78c75ba7092e9c30cfd477a0cecd35103ec87e0a05586a22c73efeadde2a8b3

See more details on using hashes here.

File details

Details for the file parity_ai-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: parity_ai-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 63.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 fb4e95e62b20b6055d10dcb6524af14b5d5bf3b9e1928e31f504f2353f21826a
MD5 b01c0d010523e56ce456225fa6a640d0
BLAKE2b-256 c1aacc95d7dc4c03f04d9f3fa3f914055ed1d343fb8b192ec83877e43da87505

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page