vaara

Runtime evidence layer for AI agents under the EU AI Act: policy-gated tool calls, hash-chained tamper-evident audit trails with external time anchoring, and independently verifiable attestation plus execution receipts per MCP tool call

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Vaara

These details have not been verified by PyPI

Project links

Project description

Vaara

Your AI agent transferred the funds, wrote the file, called the tool. Later, someone who does not trust you asks you to prove exactly what it did and why. A regulator, an auditor, a customer after an incident. Your own logs will not settle it, because you could have edited them.

Vaara is an open-source evidence layer for AI governance. It checks every agent tool call against your policy, writes the call and its outcome into a hash-chained, signed record, and binds that record to your machine's own TPM 2.0 hardware root. An outside party can verify the whole trail offline, with no access to your system and none of your software. EU AI Act Article 12 record-keeping is what it was built for; it answers any "show me what the agent actually did" just as well.

It runs entirely in your own environment. No SaaS, no telemetry. Python 3.10+, zero runtime dependencies.

Install and first call

pip install vaara

Releases ship SLSA Build Level 3 provenance, verifiable with slsa-verifier verify-artifact. Optional ML classifier: pip install 'vaara[ml]'.

from vaara.pipeline import InterceptionPipeline

pipeline = InterceptionPipeline()
result = pipeline.intercept(
    agent_id="agent-007",
    tool_name="fs.write_file",
    parameters={"path": "/etc/service.yaml", "content": "..."},
    agent_confidence=0.8,
)
if result.allowed:
    pipeline.report_outcome(result.action_id, outcome_severity=0.0)
else:
    print(result.reason)

Every call gets a risk score and an allow / block / escalate decision against your policy, then the call, the decision, and the real outcome are written to the audit trail. report_outcome closes the loop: the scorer reweights based on which signals actually predicted the outcome.

That is the whole loop. The rest of this page is what makes the record worth keeping.

Verify it without trusting the producer

Writing a trail is the easy half. The half that matters is letting someone who does not trust you check it, with no key, no access, and none of your code. Every Vaara record is content-addressed and fail-closed on authenticity, and ships with public conformance vectors plus a standalone checker that imports no Vaara code, so an independent party reproduces every verdict offline.

vaara verify-bundle evidence-bundle.json

ok only when a signature is actually established, not merely present in a log. The same property drives the standards work behind SEP-2828: evidence that holds up for someone who runs none of your software. The full verifier set, the trust model for each verb, and where trust comes from in each case are in docs/verifying-evidence.md.

What the evidence looks like

vaara compliance report --format json against a real trail produces an article-level evidence record an auditor reads directly. Articles with no recorded events return evidence_insufficient, not a rubber stamp.

{
  "system_name": "Acme HR Assistant",
  "overall_status": "evidence_insufficient",
  "trail_integrity": {"size": 105, "chain_intact": true},
  "articles": [
    {"article": "Article 12(1)", "title": "Record-Keeping (Logging)",
     "status": "evidence_sufficient", "strength": "strong", "evidence_count": 105},
    {"article": "Article 15(1)", "title": "Accuracy, Robustness and Cybersecurity",
     "status": "evidence_insufficient", "strength": "absent", "evidence_count": 0}
  ]
}

Each verdict carries the threshold-versus-observed snapshot, the rationale, and the underlying records, so a reviewer traces status back to a concrete event. The same data renders as a Notified-Body PDF, a static HTML dashboard, or a Sigstore-signed handoff envelope. See docs/COMPLIANCE.md.

What you get

Gate every tool call against your own policy: allow, block, or escalate.
A tamper-evident trail an outside party verifies without trusting your stack, with the chain head anchorable to an external RFC 3161 / eIDAS timestamp so its existence is provable against a clock you do not control.
Article-level EU AI Act evidence, honest about the gaps instead of papering over them.
Governance of the model call itself, not only the tools around it: a hardware-rooted inference receipt that a second, different local model cross-checks. This is the sovereign inference harness, new in v1.0.

Where it plugs in

Native adapters route the major Python agent frameworks through the same pipeline, each via the framework's own hook, emitting identical audit events:

Framework	Entry point
LangChain	`VaaraCallbackHandler`, `vaara_wrap_tool`
CrewAI	`VaaraCrewGovernance`
OpenAI Agents SDK	`VaaraToolGuardrail`, `vaara_wrap_function`
MCP server	`vaara.integrations.mcp_server`

To put Vaara in front of an MCP server, run it as a proxy. Every tools/call routes through the pipeline before reaching the upstream; allowed calls forward transparently, blocked calls return an MCP error.

vaara-mcp-proxy \
  --upstream npx --upstream-arg -y --upstream-arg @sap/mdk-mcp-server \
  --db ./mcp_audit.db

Point your MCP client (Claude Code, Cursor, any host) at the proxy instead of the upstream. There is also an HTTP API (pip install 'vaara[server]', vaara serve) and a first-party TypeScript client on npm (@vaara/client) for non-Python agents. Framework details, the cloud and OSS guardrail adapters (Bedrock, Azure, GCP, NeMo, Guardrails AI, LLM Guard, Rebuff), and the multi-tenant proxy are in docs/adapters.md.

How it scores

Each risk score blends five expert signals and keeps adapting as outcomes come back, and it carries a confidence interval with a coverage guarantee that holds regardless of the input distribution. On a held-out adversarial corpus the classifier reaches 84.7% recall (95% Wilson [82.4, 86.7]) at a 4.1% false-positive rate, and 1.2% FPR on benign calls under live injection pressure. The hot-path rule scorer adds 140 µs mean per call on commodity CPU; the ML classifier is opt-in (vaara[ml]) and off that path. Every figure is reproducible via make bench.

Full numbers, corpus, calibration, and chain of custody

12,155-entry adversarial corpus (250 hand-curated + 11,905 LLM-generated), 70/15/15 split stratified by (category, source).
Classifier v9 (236 hand-features + 384-dim MiniLM embeddings) at calibrated threshold 0.9150 on held-out TEST n=1,827: recall 84.7% [82.4, 86.7] at FPR 4.1% [2.9, 5.7].
Cross-model held-out recall 66.8% [64.9, 68.7] over n=2,277 with no eval-set attacker model in TRAIN; the weakest sub-cell is data_exfil against a closed-weight model at 38.9%. This is the honest worst case; the in-distribution number above is the easier denominator.
BIPIA-pressure FPR on benign tool calls 1.2% [0.4, 3.6] across four agent backends (Claude Haiku 4.5, Llama-3.1-8B, Mistral-7B, Qwen-2.5-7B). Down from 35.2% on v8.
Multi-attacker PAIR robustness: 0/25 successes per attacker across Qwen2.5-32B, Qwen2.5-72B, Llama-3.3-70B on identical seeds, Wilson upper 13.3%.
Distribution-free conformal coverage on the score; MWU regret bound O(sqrt(T log N)).
Chain of custody: corpus, split, training commit, and bundle SHAs locked and printed by every script.

Method and per-cell breakdown: docs/architecture.md and bench/.

Standards and attestation

SEP-2828 signed execution records and SEP-2787 request-attestation test vectors, in the MCP standards process. A second independent implementation has reproduced the SEP-2828 conformance vectors from a clean checkout with no shared code.
OVERT 1.0 (overt.is): Vaara is the Arbiter and emits Protocol Profile 1.0 Base Envelopes (canonical CBOR, Ed25519) alongside every record when attestation is on.
Post-quantum: an optional parallel ML-DSA-65 / FIPS 204 signature over the same preimage, so a stripped post-quantum signature is a detectable downgrade rather than a silent loss.
Sovereign inference harness (v1.0): a local model behind a signing proxy that emits a hardware-rooted inference receipt a second local model cross-checks. Developed privately, published here under AGPL-3.0.

Details and the offline checkers for each: docs/standards.md.

Docs

Path	Contents
docs/verifying-evidence.md	Every verifier and its trust model
docs/architecture.md	Scoring, conformal coverage, time anchor, formal properties
docs/standards.md	SEP-2828, SEP-2787, OVERT, the sovereign inference harness
docs/adapters.md	Framework and cloud/OSS guardrail adapters, multi-tenant proxy
docs/COMPLIANCE.md	EU AI Act and DORA article mapping, eval numbers
CHANGELOG.md	Version-by-version evolution
docs/PRIOR_ART.md	When each concept first shipped, plus adjacent work

Acknowledgements

Listed in the industry acknowledgements of the IMDA Model AI Governance Framework for Agentic AI v1.5 (Singapore, 20 May 2026).
The AMD AI Developer Program ran a developer testimonial of Vaara in May 2026.
Article 14 runtime: why oversight of agentic AI has to be evidenced as action, not model, the position post on the EU Apply AI Alliance Futurium.

Vaara helps deployers assemble evidence for their own conformity work. It does not certify compliance or constitute legal advice. Deployers own their obligations under the EU AI Act and other applicable law.

License

AGPL-3.0-or-later. See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Vaara

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.3.1

Jun 20, 2026

1.3.0

Jun 19, 2026

1.2.1

Jun 19, 2026

1.2.0

Jun 19, 2026

This version

1.1.1

Jun 18, 2026

1.1.0

Jun 18, 2026

1.0.3

Jun 17, 2026

1.0.2

Jun 17, 2026

1.0.1

Jun 17, 2026

1.0.0

Jun 16, 2026

0.70.0

Jun 12, 2026

0.69.0

Jun 11, 2026

0.68.0

Jun 9, 2026

0.67.0

Jun 9, 2026

0.66.0

Jun 9, 2026

0.65.0

Jun 9, 2026

0.64.0

Jun 8, 2026

0.63.0

Jun 8, 2026

0.62.0

Jun 8, 2026

0.61.3

Jun 8, 2026

0.61.2

Jun 8, 2026

0.61.1

Jun 8, 2026

0.61.0

Jun 8, 2026

0.60.0

Jun 7, 2026

0.59.0

Jun 7, 2026

0.58.0

Jun 5, 2026

0.57.0

Jun 5, 2026

0.56.0

Jun 5, 2026

0.55.0

Jun 5, 2026

0.54.0

Jun 5, 2026

0.53.0

Jun 5, 2026

0.52.0

Jun 2, 2026

0.51.0

Jun 2, 2026

0.50.0

Jun 1, 2026

0.49.0

May 31, 2026

0.48.0

May 31, 2026

0.47.0

May 31, 2026

0.46.0

May 31, 2026

0.45.1

May 30, 2026

0.45.0

May 30, 2026

0.44.0

May 30, 2026

0.43.0

May 29, 2026

0.42.0

May 29, 2026

0.41.0

May 28, 2026

0.40.4

May 28, 2026

0.40.3

May 28, 2026

0.40.2

May 28, 2026

0.40.1

May 28, 2026

0.40.0

May 27, 2026

0.39.2

May 27, 2026

0.39.1

May 27, 2026

0.39.0

May 27, 2026

0.38.0

May 27, 2026

0.37.1

May 26, 2026

0.37.0

May 26, 2026

0.36.0

May 25, 2026

0.35.0

May 25, 2026

0.34.0

May 25, 2026

0.33.0

May 25, 2026

0.32.0

May 25, 2026

0.31.0

May 25, 2026

0.30.0

May 24, 2026

0.29.0

May 24, 2026

0.28.0

May 22, 2026

0.27.0

May 22, 2026

0.26.0

May 20, 2026

0.25.0

May 20, 2026

0.24.0

May 20, 2026

0.23.1

May 20, 2026

0.23.0

May 20, 2026

0.22.0

May 20, 2026

0.21.0

May 19, 2026

0.20.0

May 18, 2026

0.19.1

May 18, 2026

0.19.0

May 17, 2026

0.18.1

May 17, 2026

0.18.0

May 17, 2026

0.17.0

May 17, 2026

0.16.0

May 17, 2026

0.15.0

May 17, 2026

0.14.0

May 17, 2026

0.13.0

May 16, 2026

0.12.0

May 16, 2026

0.11.0

May 16, 2026

0.10.0

May 16, 2026

0.9.0

May 15, 2026

0.8.0

May 14, 2026

0.7.0

May 9, 2026

0.6.2

May 5, 2026

0.6.1

Apr 27, 2026

0.6.0

Apr 27, 2026

0.5.3

Apr 26, 2026

0.5.2

Apr 24, 2026

0.5.1 yanked

Apr 23, 2026

0.5.0 yanked

Apr 23, 2026

0.4.4 yanked

Apr 22, 2026

0.4.3

Apr 21, 2026

0.4.2

Apr 20, 2026

0.4.1

Apr 20, 2026

0.4.0

Apr 20, 2026

0.3.25 yanked

Apr 19, 2026

0.3.24 yanked

Apr 19, 2026

0.3.22 yanked

Apr 19, 2026

0.3.21 yanked

Apr 19, 2026

0.3.20 yanked

Apr 19, 2026

0.3.19 yanked

Apr 19, 2026

0.3.18 yanked

Apr 19, 2026

0.3.17 yanked

Apr 19, 2026

0.3.16 yanked

Apr 19, 2026

0.3.15 yanked

Apr 19, 2026

0.3.14 yanked

Apr 19, 2026

0.3.13 yanked

Apr 19, 2026

0.3.12 yanked

Apr 19, 2026

0.3.11 yanked

Apr 19, 2026

0.3.10 yanked

Apr 19, 2026

0.3.9 yanked

Apr 19, 2026

0.3.8 yanked

Apr 19, 2026

0.3.7 yanked

Apr 19, 2026

0.3.6 yanked

Apr 19, 2026

0.3.5 yanked

Apr 19, 2026

0.3.4 yanked

Apr 19, 2026

0.3.3 yanked

Apr 19, 2026

0.3.2 yanked

Apr 19, 2026

0.3.1 yanked

Apr 19, 2026

0.3.0 yanked

Apr 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vaara-1.1.1.tar.gz (1.3 MB view details)

Uploaded Jun 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vaara-1.1.1-py3-none-any.whl (1.0 MB view details)

Uploaded Jun 18, 2026 Python 3

File details

Details for the file vaara-1.1.1.tar.gz.

File metadata

Download URL: vaara-1.1.1.tar.gz
Upload date: Jun 18, 2026
Size: 1.3 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for vaara-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`637fa189d7f8173c179015da2085922c861d4dd7a1164dfc0a638af263cb4d7b`
MD5	`2bc62b71948d1ac3f16981ce4e8c16ec`
BLAKE2b-256	`2d09db9eddb7e8799207a7da8ce61b17ca4ee530008e2b8442b3a4e3cf74296b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vaara-1.1.1.tar.gz:

Publisher: release.yml on vaaraio/vaara

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vaara-1.1.1.tar.gz
- Subject digest: 637fa189d7f8173c179015da2085922c861d4dd7a1164dfc0a638af263cb4d7b
- Sigstore transparency entry: 1859835506
- Sigstore integration time: Jun 18, 2026
Source repository:
- Permalink: vaaraio/vaara@088a869d20fe577719175251588ae66b871d1cef
- Branch / Tag: refs/tags/v1.1.1
- Owner: https://github.com/vaaraio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@088a869d20fe577719175251588ae66b871d1cef
- Trigger Event: push

File details

Details for the file vaara-1.1.1-py3-none-any.whl.

File metadata

Download URL: vaara-1.1.1-py3-none-any.whl
Upload date: Jun 18, 2026
Size: 1.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for vaara-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7fbb9c763803f4ccfcfe91e3aadc65125f2ab7ab5e16df0e100b9aca62860583`
MD5	`4310293633aa1a6e1f5c8cd908e5408d`
BLAKE2b-256	`74056480d96eb6cabcbf83d113eee3d2cfdab91e74e460340039b7aa44bef732`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vaara-1.1.1-py3-none-any.whl:

Publisher: release.yml on vaaraio/vaara

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vaara-1.1.1-py3-none-any.whl
- Subject digest: 7fbb9c763803f4ccfcfe91e3aadc65125f2ab7ab5e16df0e100b9aca62860583
- Sigstore transparency entry: 1859835602
- Sigstore integration time: Jun 18, 2026
Source repository:
- Permalink: vaaraio/vaara@088a869d20fe577719175251588ae66b871d1cef
- Branch / Tag: refs/tags/v1.1.1
- Owner: https://github.com/vaaraio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@088a869d20fe577719175251588ae66b871d1cef
- Trigger Event: push

vaara 1.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

Install and first call

Verify it without trusting the producer

What the evidence looks like

What you get

Where it plugs in

How it scores

Standards and attestation

Docs

Acknowledgements

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance