Skip to main content

Automatically generate evals for every AI change

Project description

Parity

PyPI License: MIT Python 3.11+

You changed a prompt. You don't know if you broke something.

Parity automatically generates evals for every AI change — on every pull request, before anything ships.

What it does

Parity runs in CI. For every PR that touches your AI system, it:

  1. Detects changes to prompts, instructions, guardrails, validators, tool descriptions, classifiers, and other artifacts that shape how your agent behaves.
  2. Checks your existing eval coverage against what changed — and finds the gaps.
  3. Generates ranked, targeted test cases for those gaps, including multi-turn conversational evals for conversational agents.
  4. Posts a PR comment with proposed evals for your review.
  5. Writes approved evals to your eval platform only after you explicitly label the PR.

Parity is not an eval runner. It generates eval inputs that plug into LangSmith, Braintrust, Arize Phoenix, Promptfoo, or file-based workflows.

No evals yet? Parity starts from zero. It generates starter evals from your diff, system prompt, and whatever product context you provide. The more context you give it, the sharper it gets.

Quick start

pip install parity-ai
parity init

parity init generates parity.yaml, a GitHub Actions workflow, and context/ stubs. Fill in your context files, add your API keys as GitHub secrets, and open a PR that touches a prompt.

See docs/configuration.md for prerequisites, cost control, and the full configuration reference.

Try it on a real example

Test Parity against a real LangGraph repo with the in-repo demo:

examples/langgraph-agentic-rag | quickstart guide

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parity_ai-0.1.2.tar.gz (46.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

parity_ai-0.1.2-py3-none-any.whl (62.9 kB view details)

Uploaded Python 3

File details

Details for the file parity_ai-0.1.2.tar.gz.

File metadata

  • Download URL: parity_ai-0.1.2.tar.gz
  • Upload date:
  • Size: 46.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.2.tar.gz
Algorithm Hash digest
SHA256 dd9312558cabb60c6e3b504afa522831c6812a46592789b4be469572d9364887
MD5 59feb6580d4f1db030249f9844f71917
BLAKE2b-256 61048a264b64ca15e3c8b4a528437646be5de47d9f0ebaf565b8e467bb5100d1

See more details on using hashes here.

File details

Details for the file parity_ai-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: parity_ai-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 62.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4fce92b6f3f19f089b056eb210f7b68c64d2d6f0ca95e0bd1c1ae92c63380b67
MD5 094712cd88285a1b3b0b03d2a743d5a6
BLAKE2b-256 32f7c856681570d95ca92b02bc64982d588a9867cb1a3bf621df6d34b8485714

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page