Skip to main content

Automatically generate evals for every AI change

Project description

Parity

PyPI License: MIT Python 3.11+

Parity discovers how your eval stack actually works, finds the coverage gaps introduced by an AI behavior change, and proposes native eval additions that fit the target suite instead of forcing everything through one generic probe row.

Parity is not an eval runner. It is a method-first eval synthesis system for LangSmith, Braintrust, Arize Phoenix, Promptfoo, and repo-local eval assets.

What Parity Optimizes For

For every PR that touches prompts, instructions, guardrails, judges, validators, or other behavior-defining assets, Parity:

  1. Detects the behavioral change.
  2. Discovers the most relevant existing eval target and how that target actually works.
  3. Validates which gaps are real against the discovered corpus, row shape, and evaluator regime.
  4. Synthesizes ranked native eval additions for that concrete target.
  5. Writes only native_ready evals after explicit approval.

Parity reuses the target's existing active evaluator regime when the platform manages evaluators outside the row itself. It does not create, rebind, or mutate hosted evaluator infrastructure.

Pipeline

  • Stage 1: Behavior Change Analysis
  • Stage 2: Eval Analysis
  • Stage 3: Native Eval Synthesis
  • Deterministic writeback: parity write-evals

The main runtime artifacts are:

  • BehaviorChangeManifest
  • EvalAnalysisManifest
  • EvalProposalManifest

Bootstrap Behavior

If Parity cannot find a safe existing target, it falls back to bootstrap mode. Bootstrap means starter eval generation, not evaluator setup. These results remain proposal-oriented and are not auto-written unless they later become native_ready.

Quick Start

pip install parity-ai
parity init

parity init generates parity.yaml, a GitHub Actions workflow, and context/ stubs. Fill in the context files, add your API keys as GitHub secrets, and open a PR that changes agent behavior.

See docs/configuration.md for config details, docs/spec.md for the technical architecture, and parity.yaml.example for the full schema.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parity_ai-0.1.5.tar.gz (83.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

parity_ai-0.1.5-py3-none-any.whl (105.6 kB view details)

Uploaded Python 3

File details

Details for the file parity_ai-0.1.5.tar.gz.

File metadata

  • Download URL: parity_ai-0.1.5.tar.gz
  • Upload date:
  • Size: 83.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.5.tar.gz
Algorithm Hash digest
SHA256 5510bf9fa13f573e4534e86f8a292caaf187c40ba72e9624893c9607f97eec0d
MD5 be543e8a820af6c70fdebe3aea184dad
BLAKE2b-256 bc4d0a3b978d98bc16ebe07685d9b7146f1ed046fb45bf94c7b3bac70d68d75b

See more details on using hashes here.

File details

Details for the file parity_ai-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: parity_ai-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 105.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for parity_ai-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 20344c554d972c78f8e684b62dd3a35a68ee98beeb3a1bb88bb7262602b2918f
MD5 8efd5ac5475fb5b1fd8e87b0a650bd30
BLAKE2b-256 92a2d6358525665a67eaed5413e11bed060efee19f47f9c5722c940e85d7f3f1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page