Tamper-evident audit trails for AI systems

These details have not been verified by PyPI

Project links

Project description

Assay

Tamper-evident audit trails for AI systems.

We scanned 30 popular AI projects and found 202 high-confidence LLM call sites. Zero had tamper-evident audit trails. Full results.

Assay adds independently verifiable execution evidence to AI systems: cryptographically signed receipt bundles that a third party can verify offline without trusting your server logs. Two lines of code. Four exit codes.

pip install assay-ai && assay quickstart

Boundary: Assay proves tamper-evident internal consistency and completeness relative to scanned call sites. It does not prevent a fully compromised machine from fabricating a consistent story. That's what trust tiers are for.

Not this: Assay is not a logging framework, an observability dashboard, or a monitoring tool. It produces signed evidence bundles that a third party can verify offline. If you need Datadog, this isn't it.

See It -- Then Understand It

No API key needed. Runs on synthetic data:

assay demo-incident     # two-act scenario: honest PASS vs honest FAIL

Act 1: Agent uses gpt-4 with a guardian check. Integrity PASS, claims PASS. Act 2: Someone swaps the model and drops the guardian. Integrity PASS, claims FAIL.

That second result is an honest failure -- authentic evidence proving the run violated its declared standards. Not a cover-up. Exit code 1.

Exit 1 is audit gold: authentic evidence that a control failed, with no ability to edit history. Auditors love "controls can fail, but failure is detectable and retained."

How that works

Assay separates two questions on purpose:

Integrity: "Were these bytes tampered with after creation?" (signatures, hashes, required files)
Claims: "Does this evidence satisfy our declared governance checks?" (receipt types, counts, field values)

Integrity	Claims	Exit	Meaning
PASS	PASS	0	Evidence checks out, behavior meets standards
PASS	FAIL	1	Honest failure: authentic evidence of a standards violation
FAIL	--	2	Tampered evidence
--	--	3	Bad input (missing files, invalid arguments)

The split is the point. Systems that can prove they failed honestly are more trustworthy than systems that always claim to pass.

Add to Your Project

# 1. Find uninstrumented LLM calls
assay scan . --report

# 2. Patch (one line per SDK, or auto-patch all)
assay patch .

# 3. Run + build a signed evidence pack
assay run -c receipt_completeness -- python my_app.py

# 4. Verify
assay verify-pack ./proof_pack_*/

assay scan . --report finds every LLM call site (OpenAI, Anthropic, LangChain) and generates a self-contained HTML gap report. assay patch inserts the two-line integration. assay run wraps your command, collects receipts, and produces a signed 5-file evidence pack. assay verify-pack checks integrity + claims and exits with one of the four codes above. Then run assay explain on any pack for a plain-English summary.

Why now: EU AI Act Article 12 requires automatic logging for high-risk AI systems; Article 19 requires providers to retain automatically generated logs for at least 6 months. High-risk obligations apply from 2 Aug 2026 (Annex III) and 2 Aug 2027 (regulated products). SOC 2 CC7.2 requires monitoring of system components and analysis of anomalies as security events. "We have logs on our server" is not independently verifiable evidence. Assay produces evidence that is. See compliance citations for exact references.

CI Gate

Three commands, three exit codes, one lockfile:

assay run -c receipt_completeness -- python my_app.py
assay verify-pack ./proof_pack_*/ --lock assay.lock --require-claim-pass
assay diff ./baseline_pack/ ./proof_pack_*/ --gate-cost-pct 25 --gate-errors 0 --gate-strict

The lockfile catches config drift. Verify-pack catches tampering. Diff catches regressions and budget overruns. See Decision Escrow for the protocol model.

# Lock your verification contract
assay lock write --cards receipt_completeness -o assay.lock

Daily use after CI is green

Regression forensics:

assay diff ./proof_pack_*/ --against-previous --why

--against-previous auto-discovers the baseline pack. --why traces receipt chains to explain what regressed and which call sites caused it.

Cost/latency drift (from receipts):

assay analyze --history --since 7

Shows cost, latency percentiles, error rates, and per-model breakdowns from your local trace history.

Trust Model

What Assay detects, what it doesn't, and how to strengthen guarantees.

Assay detects:

Retroactive tampering (edit one byte, verification fails)
Selective omission under a completeness contract
Claiming checks that were never run
Policy drift from a locked baseline

Assay does not prevent:

A fully fabricated false run (attacker controls the machine)
Dishonest receipt content (receipts are self-attested)
Timestamp fraud without an external time anchor

Completeness is enforced relative to the call sites enumerated by the scanner and/or declared by policy. Undetected call sites are a known residual risk, reduced via multi-detector scanning and CI gating.

To strengthen guarantees:

Transparency ledger (independent witness)
CI-held org key + branch protection (separation of signer and committer)
External timestamping (RFC 3161)

The cost of cheating scales with the complexity of the lie. Assay doesn't make fraud impossible -- it makes fraud expensive.

The Evidence Compiler

Assay is an evidence compiler for AI execution. If you've used a build system, you already know the mental model:

Concept	Build System	Assay
Source	`.c` / `.ts` files	Receipts (one per LLM call)
Artifact	Binary / bundle	Evidence pack (5 files, 1 signature)
Tests	Unit / integration tests	Verification (integrity + claims)
Lock	`package-lock.json`	`assay.lock`
Gate	CI deploy check	CI evidence gate

Commands

The core path is 6 commands:

assay quickstart          # discover
assay scan / assay patch  # instrument
assay run                 # produce evidence
assay verify-pack         # verify evidence
assay diff                # catch regressions
assay score               # evidence readiness (0-100, A-F)

Full command reference:

Command	Purpose
`assay quickstart`	One command: demo + scan + next steps
`assay status`	One-screen operational dashboard: am I set up?
`assay start demo`	See Assay in action (quickstart flow)
`assay start ci`	Guided CI evidence gate setup (5 steps)
`assay start mcp`	Guided MCP tool call auditing setup (4 steps)
`assay scan`	Find uninstrumented LLM call sites (`--report` for HTML)
`assay patch`	Auto-insert SDK integration patches into your entrypoint
`assay run`	Wrap command, collect receipts, build signed evidence pack
`assay verify-pack`	Verify an evidence pack (integrity + claims)
`assay explain`	Plain-English summary of an evidence pack
`assay analyze`	Cost, latency, error breakdown from pack or `--history`
`assay diff`	Compare packs: claims, cost, latency (`--against-previous`, `--why`, `--gate-*`)
`assay score`	Evidence Readiness Score (0-100, A-F) with anti-gaming caps
`assay doctor`	Preflight check: is Assay ready here?
`assay mcp-proxy`	Transparent MCP proxy: intercept tool calls, emit receipts
`assay mcp policy init`	Generate a starter MCP policy YAML file
`assay ci init github`	Generate a GitHub Actions workflow
`assay lock write`	Freeze verification contract to lockfile
`assay lock check`	Validate lockfile against current card definitions
`assay key list`	List local signing keys and active signer
`assay key rotate`	Generate a new signer key and switch active signer
`assay key set-active`	Set active signing key for future runs
`assay cards list`	List built-in run cards and their claims
`assay cards show`	Show card details, claims, and parameters
`assay demo-incident`	Two-act scenario: passing run vs failing run
`assay demo-challenge`	CTF-style good + tampered pack pair
`assay demo-pack`	Generate demo packs (no config needed)
`assay onboard`	Guided setup: doctor -> scan -> first run plan

Documentation

Full Picture -- architecture, trust tiers, repo boundaries, release history
Quickstart -- install, golden path, command reference
For Compliance Teams -- what auditors see, evidence artifacts, framework alignment
Compliance Citations -- exact regulatory references (EU AI Act, SOC 2, ISO 42001)
Decision Escrow -- protocol model: agent actions don't settle until verified
Roadmap -- phases, product boundary, execution stack
Repo Map -- what lives where across the Assay ecosystem
Pilot Program -- early adopter program details

Common Issues

"No receipts emitted" after assay run: First, check whether your code has call sites: assay scan . -- if scan finds 0 sites, you may not be using a supported SDK yet. If scan finds sites, check: (1) Is # assay:patched in the file? Run assay scan . --report to see patch status per file. (2) Did you install the SDK extra (pip install assay-ai[openai])? (3) Did you use -- before your command (assay run -- python app.py)? Run assay doctor for a full diagnostic.
LangChain projects: assay patch auto-instruments OpenAI and Anthropic SDKs but not LangChain (which uses callbacks, not monkey-patching). For LangChain, add AssayCallbackHandler() to your chain's callbacks parameter manually. See src/assay/integrations/langchain.py for the handler.
assay run python app.py gives "No command provided": You need the -- separator: assay run -c receipt_completeness -- python app.py. Everything after -- is passed to the subprocess.
Quickstart blocked on large directories: assay quickstart guards against scanning system directories (>10K Python files). Use --force to bypass: assay quickstart --force.

Get Involved

Try it: pip install assay-ai && assay quickstart
Questions / feedback: GitHub Discussions
Bug reports: Issues
Want this in your stack in 2 weeks? Pilot program -- we instrument your AI workflows, set up CI gates, and hand you a working evidence pipeline. Open a pilot inquiry.

Related Repos

Repo	Purpose
assay	Core CLI, SDK, conformance corpus (this repo)
assay-verify-action	GitHub Action for CI verification
assay-ledger	Public transparency ledger

License

Apache-2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.23.0

May 8, 2026

1.22.0

Apr 9, 2026

1.21.0

Apr 7, 2026

1.20.2

Apr 5, 2026

1.20.1

Apr 4, 2026

1.20.0

Mar 30, 2026

1.19.0

Mar 26, 2026

1.18.0

Mar 25, 2026

1.17.0

Mar 15, 2026

1.16.1

Mar 12, 2026

1.16.0

Mar 8, 2026

1.15.1

Mar 4, 2026

1.15.0

Mar 4, 2026

1.14.0

Mar 3, 2026

1.13.0

Mar 3, 2026

1.12.1

Mar 3, 2026

1.11.1

Feb 26, 2026

1.11.0

Feb 26, 2026

1.10.1

Feb 25, 2026

This version

1.6.0

Feb 19, 2026

1.5.4

Feb 19, 2026

1.5.3

Feb 15, 2026

1.5.2

Feb 15, 2026

1.5.1

Feb 15, 2026

1.5.0

Feb 14, 2026

1.4.2

Feb 14, 2026

1.4.1

Feb 14, 2026

1.4.0

Feb 14, 2026

1.3.3

Feb 13, 2026

1.3.2

Feb 13, 2026

1.3.1

Feb 12, 2026

1.3.0

Feb 12, 2026

1.2.2

Feb 9, 2026

1.2.1

Feb 9, 2026

1.2.0

Feb 9, 2026

1.1.0

Feb 8, 2026

1.0.0

Feb 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

assay_ai-1.6.0.tar.gz (184.3 kB view details)

Uploaded Feb 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

assay_ai-1.6.0-py3-none-any.whl (204.1 kB view details)

Uploaded Feb 19, 2026 Python 3

File details

Details for the file assay_ai-1.6.0.tar.gz.

File metadata

Download URL: assay_ai-1.6.0.tar.gz
Upload date: Feb 19, 2026
Size: 184.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for assay_ai-1.6.0.tar.gz
Algorithm	Hash digest
SHA256	`88d1a8306ff07b78d374f014b195a6effe7d140829d4251246c41c0756f25e1c`
MD5	`0f0a274e6a71a4acfa66e1261c538143`
BLAKE2b-256	`dc02bbe7ea623c6e834330e5f991b867558d60e8d3e04a04b107c9c26c5f7dc3`

See more details on using hashes here.

File details

Details for the file assay_ai-1.6.0-py3-none-any.whl.

File metadata

Download URL: assay_ai-1.6.0-py3-none-any.whl
Upload date: Feb 19, 2026
Size: 204.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for assay_ai-1.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a0e43e64d10231b8802732f48ddf023c094cdec79ced49731168ec1843d9893c`
MD5	`2c362abd51fa4fa7d3ec6292e270740e`
BLAKE2b-256	`d8b774c8dea4ac7862407e1f6ffd16509e66d2f3f72bdd9ac8cb1a0cc7e21fa0`

See more details on using hashes here.

assay-ai 1.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Assay

See It -- Then Understand It

How that works

Add to Your Project

CI Gate

Daily use after CI is green

Trust Model

The Evidence Compiler

Commands

Documentation

Common Issues

Get Involved

Related Repos

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes