Skip to main content

Automatically generate evals for every AI change

Project description

Parity

PyPI License: MIT Python 3.11+

You changed a prompt. You don't know if you broke something.

Parity automatically generates evals for every AI change — on every pull request, before anything ships.

What it does

Parity runs in CI. For every PR that touches your AI system, it:

  1. Detects changes to prompts, instructions, guardrails, validators, tool descriptions, classifiers, and other artifacts that shape how your agent behaves.
  2. Checks your existing eval coverage against what changed — and finds the gaps.
  3. Generates ranked, targeted test cases for those gaps, including multi-turn conversational evals for conversational agents.
  4. Posts a PR comment with proposed evals for your review.
  5. Writes approved evals to your eval platform only after you explicitly label the PR.

Parity is not an eval runner. It generates eval inputs that plug into LangSmith, Braintrust, Arize Phoenix, Promptfoo, or file-based workflows.

No evals yet? Parity starts from zero. It generates starter evals from your diff, system prompt, and whatever product context you provide. The more context you give it, the sharper it gets.

Quick start

pip install parity-ai
parity init

parity init generates parity.yaml, a GitHub Actions workflow, and context/ stubs. Fill in your context files, add your API keys as GitHub secrets, and open a PR that touches a prompt.

See docs/configuration.md for prerequisites, cost control, and the full configuration reference.

Try it on a real example

Test Parity against a real LangGraph repo with the in-repo demo:

examples/langgraph-agentic-rag | quickstart guide

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parity_ai-0.1.1.tar.gz (45.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

parity_ai-0.1.1-py3-none-any.whl (62.7 kB view details)

Uploaded Python 3

File details

Details for the file parity_ai-0.1.1.tar.gz.

File metadata

  • Download URL: parity_ai-0.1.1.tar.gz
  • Upload date:
  • Size: 45.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c7c1512c8a6707d7974105e0b6a51b737a4592276c93000ee4991a302ad655e1
MD5 dcacaa727ae5de8dc548f6e6d9619ca0
BLAKE2b-256 eb1ed101293f94b2df0769ed499676e4a401071ec32a3fde3ffab59928f909fd

See more details on using hashes here.

File details

Details for the file parity_ai-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: parity_ai-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 62.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9b1e10c56465e2d03b9cc04664f4ee6065bcccaa4bdf8a0f9878f6566ee95b3e
MD5 4757fee4d23f23fd44af7d4e7f358ede
BLAKE2b-256 2ac2a32acbe0d4a44681ea5555481232467fc6c826ca6787a7f87461ec03496d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page