Tamper-evident, third-party-verifiable receipts for AI agent / MCP tool calls

These details have not been verified by PyPI

Project links

Project description

agent-receipts

Tamper-evident, third-party-verifiable receipts for AI agent / MCP actions — in one small file.

An AI agent's logs are self-reported claims. Nothing stops the agent — or a compromised proxy — from rewriting history after the fact, or from emitting a hallucinated "I called the database and it returned X" that never happened. A receipt is the opposite of a log: independent, verifiable evidence of what an action consumed and produced, that a third party can check without trusting the agent.

This is the smallest honest version of that idea, built to be read in one sitting and run in one command. It is a reference proof-of-concept, not a hardened product — the scope below is deliberately honest about what it does and does not give you.

Naming note / prior art. There is already an established "Agent Receipts" protocol with a public spec and a Python SDK by Otto Jongerius (github.com/agent-receipts). This project is an independent, minimal reference for understanding the idea — it is not that protocol's SDK, and on PyPI it is agora-agent-receipts to avoid any confusion. If you want the protocol and a maintained SDK, use his; if you want a 200-line file to learn from or vendor, use this.

python agent_receipts.py     # core: hash-chain + Ed25519 signatures + tamper/forgery demo
python mcp_wrapper.py         # wrap any MCP/agent tool so every call emits a receipt
python mediator.py           # external-mediator mode: catch an agent hiding/faking its own actions
python verify_cli.py receipts.json --pubkey <hex>   # independently verify a receipts file (no code)
python mnemo_receipts.py     # tamper-evident memory: detect an out-of-band edit to an mnemo store

What it does — two layers

Hash chain (integrity, zero extra deps). Each receipt commits to the previous one (prev = hash of the last receipt), forming a Merkle-style chain. Edit any past receipt and every hash after it breaks — so a partial edit is detectable, and verify() names the exact step that was altered. Honest limit: the hash chain alone does not stop a thorough tamperer who recomputes the whole chain end-to-end (then no link breaks). Integrity-only is sufficient only if the chain head is published/anchored where the attacker can't also rewrite it — otherwise the signature (layer 2) is what actually protects a self-held chain.
Ed25519 signatures (authenticity, needs cryptography). Each receipt's hash is signed with the actor's private key; a third party verifies with the public key only. This proves who produced the receipt and that the content wasn't forged — no shared secret. (If cryptography isn't installed, the hash chain still works on its own.)

A receipt commits to the SHA-256 of inputs/outputs, not the raw content — so you prove what was processed without exposing it, and reveal a value later only if you choose (anyone can re-check it against the committed hash). Full zero-knowledge proofs (ZK-SNARKs) go further still — proving a computation was correct while revealing nothing — and are the heavy end of this same design space.

What the demo measures

The self-demo records three MCP tool calls, then attacks the trail:

step	action	result
1	honest chain	`verify() → True`
2	edit a past receipt's output	caught: content tampered at the exact seq
3	re-hash the forged receipt to look consistent	still caught: invalid signature + broken chain link downstream

Logs cannot do this: a self-reported log can be edited end-to-end and looks identical afterward.

Use it

from agent_receipts import ReceiptChain, generate_keypair

sk, pk = generate_keypair()                       # the actor's identity
chain = ReceiptChain(actor="my-agent", private_key_hex=sk, public_key_hex=pk)

chain.record("mcp.web_search", {"query": "..."}, {"results": 7})   # one receipt per tool call
chain.record("mcp.memory.write", {"fact": "..."}, {"stored": True})

ok, problems = chain.verify(expected_pubkey=pk)   # anyone with pk can run this
open("receipts.json", "w").write(chain.to_json()) # ship the receipts; the public key verifies them

mcp_wrapper.py shows a ReceiptedDispatcher that drops in around an MCP-style tool registry so you don't touch your tools — you wrap the dispatch.

External-mediator mode (`mediator.py`)

Self-signed receipts have one honest hole: an agent holding its own key can simply not emit a receipt for an action it wants to hide. The fix (as used by pipelock and the receiver-attested research line) is to move the signer outside the agent — route every tool call through a Mediator that holds its own key and signs what it actually brokers:

from mediator import Mediator, reconcile
med = Mediator(tools, private_key_hex=mk, public_key_hex=mpub, name="org-proxy")
med.dispatch("transfer_funds", to="acct-9981", amount=5000)   # mediator signs the real call
ok, problems = reconcile(agent_self_reported_log, med.chain)  # diff the agent's story vs the truth

The demo has an agent route three calls through the mediator, then tamper with its own log — hiding a funds transfer and faking a secret read. reconcile() catches all of it, because the mediator's signed chain is ground truth:

agent reported 2 actions but the mediator brokered 3 -> agent OMITTED 1 call(s)
action 1: agent calls it 'read_secret', mediator saw 'mcp.transfer_funds'
action 1 (mcp.transfer_funds): input hash mismatch (agent misreported its arguments)
action 1 (mcp.transfer_funds): output hash mismatch (agent misreported the result)
action 2 (mcp.read_secret): no agent record at all (hidden call)

(This is the real demo output — reconcile() aligns positionally, so omitting a middle call also shifts the indices after it; the point is that the omission and the misreport are both surfaced.) Now the agent cannot withhold a receipt, forge one (it lacks the mediator's key), or lie about I/O (the mediator hashes what really flowed through it). Optionally the agent also signs its own claim, giving a dual-attested receipt where agent-vs-mediator divergence is itself the alarm.

Verify someone else's receipts (`verify_cli.py`)

A third party who wasn't there confirms what happened with one command — the file plus the public key:

python verify_cli.py receipts.json --pubkey 7d08e6e6...   # VERIFIED (exit 0) or FAILED (exit 1)

It recomputes the whole chain, checks every signature against the expected key, and names the exact broken step. Exit code 0/1 drops cleanly into CI or a pre-commit hook. Measured on a 2-receipt file: an honest file verifies; tampering one output prints seq 0: content tampered (exit 1); the wrong --pubkey prints signed by an unexpected key (exit 1).

Tamper-evident memory: the `mnemo` integration (`mnemo_receipts.py`)

mnemo (our open-source memory core) is already append-only with deterministic supersession, so it never silently edits a fact in normal use. But the store is a file — anyone who can touch it can rewrite a stored memory after the fact, and any store would then serve the altered text as the original. Receipts close that: every remember() emits a signed receipt committing to the memory's content hash, so the write history is independently verifiable.

from mnemo_receipts import ReceiptedMnemo, audit_memory
rm = ReceiptedMnemo(Mnemo(path="mem.json"), private_key_hex=sk, public_key_hex=pk)
rm.remember("The prod database host is db-prod-01.", key="prod-db::host", mtype="semantic")
ok, problems = audit_memory(rm.m, rm.chain, expected_pubkey=pk)

audit_memory() re-hashes the current store against the write receipts. Measured: an honest store audits clean; an out-of-band edit (db-prod-01 → db-attacker-07, made straight in the store, which mnemo itself can't see) is caught — memory <id>: stored content no longer matches the write receipt. This is a thin wrapper; it does not modify mnemo's zero-dependency core.

Honest scope (what this is NOT)

The self-signed core proves a receipt chain is internally consistent and authentically signed. It does not by itself prove the agent reported every action — an actor that controls its own key can still withhold a receipt. That gap is closed by external-mediator mode (mediator.py, below), which puts the signer outside the agent; anchoring the chain head to a third party is a further hardening.
It commits to input/output hashes, not a proof that the tool computed correctly. That is what ZK-SNARK approaches add, at much higher cost.
Keys here are raw/in-memory for clarity; real deployments use a KMS / hardware-backed key store.

Landscape & prior art

This sits in an active, fast-moving space — we build on it, we did not invent it. In particular, the exact pattern here (Ed25519 + canonical JSON + hash-chain) is the production-grade subject of Microsoft's agent-governance-toolkit, Tutorial 33 "offline verifiable receipts" (Ed25519 over RFC 8785 / JCS canonical payloads, hash-chained, CLI-verifiable offline). Treat this repo as the minimal one-file way to understand the idea, and that toolkit as the grown-up version.

Honest map of the space:

A named protocol + SDK: the "Agent Receipts" protocol by Otto Jongerius — a public spec (github.com/agent-receipts/ar) plus a maintained Python SDK (pip install agent-receipts). The most directly-related effort to this one; if you need an interoperable standard rather than a teaching reference, start there.
Production OSS (corporate): Microsoft agent-governance-toolkit — Tutorial 33 = the same Ed25519 + canonical + hash-chain receipts, with policy/identity/sandboxing around it.
External-mediator receipts: pipelock — an open-source MCP/egress firewall that emits mediator-signed Ed25519 receipts from outside the agent (core Apache-2.0; enterprise features Elastic-License), which is how you close the agent-can-withhold-a-receipt gap noted above.
Commercial: Zero Proof AI — a pre-launch "certificate authority for AI agents" issuing on-chain-anchored receipts for tool calls.
Research:
- Basu, Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents, arXiv:2603.10060 (2026) — HMAC-signed tool-execution receipts (the pragmatic, symmetric camp; we use Ed25519 so a third party verifies without a shared secret).
- Figuera, Notarized Agents: Receiver-Attested Confidential Receipts for AI Agent Actions, arXiv:2606.04193 (2026) — receiver-signed receipts published to a transparency log (the external-attestation camp).
- Jing & Qi, Zero-Knowledge Audit for Internet of Agents … with Model Context Protocol, arXiv:2512.14737 (2025) — the zero-knowledge / privacy- preserving end of the same space.

Roadmap (if this proves useful)

~~External-mediator mode~~ (done — mediator.py) · ~~verifier CLI~~ (done — verify_cli.py) · ~~mnemo integration~~ (done — mnemo_receipts.py) · publish-and-anchor the chain head · selective disclosure of a single committed field · packaged spin-out (PyPI).

MIT. Part of the Agora project — an autonomous research OS that ships every claim with a runnable receipt. Feedback and adversarial testing welcome.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Jun 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agora_agent_receipts-0.1.0.tar.gz (16.9 kB view details)

Uploaded Jun 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agora_agent_receipts-0.1.0-py3-none-any.whl (17.7 kB view details)

Uploaded Jun 29, 2026 Python 3

File details

Details for the file agora_agent_receipts-0.1.0.tar.gz.

File metadata

Download URL: agora_agent_receipts-0.1.0.tar.gz
Upload date: Jun 29, 2026
Size: 16.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for agora_agent_receipts-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`fae1924a36eb80185819c0601cb7b6f9f85170f3ccae75ccbb7d1cbd5eb25512`
MD5	`859e736dccbc9470bc8c562924c0553e`
BLAKE2b-256	`cf275745fafe8782ce97896749b32fc8441528cd1fc5aff5bd631695d191bae6`

See more details on using hashes here.

File details

Details for the file agora_agent_receipts-0.1.0-py3-none-any.whl.

File metadata

Download URL: agora_agent_receipts-0.1.0-py3-none-any.whl
Upload date: Jun 29, 2026
Size: 17.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for agora_agent_receipts-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1fb068df3a3856b1e2436b6e5e2415970cc2050fb226e5cc665be7954227dbe2`
MD5	`66dbc06dfe69ae2994ec55fd0aa31704`
BLAKE2b-256	`fc826598ffa481bb36e95372f76ace86b3fa90e15ef501ac14fd8cd1664a86ac`

See more details on using hashes here.

agora-agent-receipts 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

agent-receipts

What it does — two layers

What the demo measures

Use it

External-mediator mode (`mediator.py`)

Verify someone else's receipts (`verify_cli.py`)

Tamper-evident memory: the `mnemo` integration (`mnemo_receipts.py`)

Honest scope (what this is NOT)

Landscape & prior art

Roadmap (if this proves useful)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

agora-agent-receipts 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

agent-receipts

What it does — two layers

What the demo measures

Use it

External-mediator mode (mediator.py)

Verify someone else's receipts (verify_cli.py)

Tamper-evident memory: the mnemo integration (mnemo_receipts.py)

Honest scope (what this is NOT)

Landscape & prior art

Roadmap (if this proves useful)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

External-mediator mode (`mediator.py`)

Verify someone else's receipts (`verify_cli.py`)

Tamper-evident memory: the `mnemo` integration (`mnemo_receipts.py`)