vaara

Adaptive AI Agent Execution Layer — risk scoring, audit trails, regulatory compliance

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Vaara

These details have not been verified by PyPI

Project links

Homepage

Project description

Vaara — Built to see over the noise

Adaptive AI agent execution layer. Sits between agents and actions, scores risk in real time, and produces evidence artefacts that support EU AI Act Article 14 human-oversight and Article 12 logging obligations.

Article-by-article evidence mapping: COMPLIANCE.md.

Position paper on the European Commission's AI Alliance platform: Article 14 at runtime.

Vaara is a tool that helps deployers assemble evidence for their own conformity work. It does not itself conduct conformity assessments, certify compliance, or constitute legal advice. Deployers remain responsible for their obligations under the EU AI Act and other applicable law.

Three questions for every agent action:

Should this happen? (adaptive risk scoring with conformal prediction)
What is this? (action taxonomy with regulatory classification)
What happened and why? (hash-chained audit trail)

Why Vaara

AI governance tools audit models. Vaara governs actions.

Models are scored once at deployment. Agents act continuously at runtime -- calling tools, moving money, modifying infrastructure. Individual actions may be safe, but sequences can be catastrophic. read_data + export_data + delete_data is a data exfiltration pattern where each step alone is benign.

Vaara catches this. It learns which risk signals predict bad outcomes, adapts its scoring online, and wraps every estimate in a distribution-free confidence interval. No retraining. No manual threshold tuning.

Install

pip install vaara

Python 3.10+. Zero runtime dependencies for the default install.

For the optional ML adversarial classifier (XGBoost-based, ships with a 295 KB pre-trained bundle):

pip install vaara[ml]

This pulls in xgboost, scikit-learn, joblib, and numpy. See the Adversarial Classifier section below.

Quick Start

from vaara.pipeline import InterceptionPipeline

pipeline = InterceptionPipeline()

# Agent wants to execute a tool
result = pipeline.intercept(
    agent_id="agent-007",
    tool_name="fs.write_file",
    parameters={"path": "/etc/service.yaml", "content": "..."},
    agent_confidence=0.8,
)

if result.allowed:
    execute_tool("fs.write_file", {"path": "/etc/service.yaml", "content": "..."})
    # Report outcome so the scorer learns
    pipeline.report_outcome(result.action_id, outcome_severity=0.0)
else:
    print(f"Blocked: {result.reason}")
    # result.decision is "deny" or "escalate"
    # result.risk_score and result.risk_interval available

Adversarial Classifier (optional)

Vaara 0.5.0 ships an opt-in XGBoost classifier trained on 200 hand-crafted adversarial seeds across 8 attack categories plus 1945 LLM-generated variants and 635 benign variants. Install with pip install vaara[ml].

from vaara.adversarial_classifier import AdversarialClassifier

clf = AdversarialClassifier()  # loads bundled model, threshold=0.8 default

# Score a proposed tool call
prob = clf.score(
    tool_name="http_get",
    parameters={"url": "http://169.254.169.254/latest/meta-data/"},
    context={},
)  # -> 0.99 for SSRF to cloud metadata

if clf.is_malicious("http_get", {"url": "https://api.github.com/user"}, {}):
    # threshold-based decision using the bundle's default_threshold
    ...

Numbers (by-seed held-out test, no leakage)

At threshold 0.8 on 600 held-out test entries (50 adversarial seeds + their variants, never seen in training):

	Attack recall	Benign FPR	Balanced acc
Heuristic (default)	44%	25%	50%
Classifier @ 0.8	61%	13%	74%

Latency: 140 µs mean, 210 µs p99 (commodity CPU, no GPU at inference).

Caveats

Classifier over-triggers on legitimate uses of powerful tools (http_post, send_email, shell_exec). Live dogfood on an end-to-end LangChain-style agent loop with Qwen-generated prompts showed FPR around 50%, higher than the 13% held-out figure.
Recommended: ship with decision="escalate" (send to human-in-loop), not decision="deny".
Threshold is configurable per call. See AdversarialClassifier(threshold=0.7) for the higher-recall / higher-FPR operating point.
Reproduce: python scripts/classifier_vs_heuristic.py (requires vaara[ml], MI300X optional).

How It Works

Risk Scoring (Adaptive)

Five expert signals scored per action, combined via Multiplicative Weight Update:

Signal	What it measures
Taxonomy base	Static risk from action metadata (reversibility, blast radius, urgency)
Agent history	This agent's track record (denial rate, bad outcomes)
Sequence pattern	Dangerous action sequences (data exfiltration, privilege escalation, unauthorized tool chaining)
Burst detection	Unusual action frequency spikes
Confidence gap	High confidence on high-risk actions = suspicious

MWU learns which signals actually predict bad outcomes and shifts weight toward them. It carries a theoretical regret bound of O(√(T log N)) against the best fixed expert in hindsight (Arora–Hazan–Kale 2012).

Conformal Prediction

Every risk score is wrapped in a conformal prediction interval:

P(true_risk in [lower, upper]) >= 1 - alpha

No distributional assumptions. No model retraining. The decision uses the upper bound -- conservative by construction. Under distribution shift, FACI adaptive alpha maintains long-run coverage.

Decisions

ALLOW     — upper bound < 0.3 (configurable)
ESCALATE  — between 0.3 and 0.7 → route to human
DENY      — upper bound > 0.7

Cold start is maximally cautious: wide intervals route most actions through human review. As outcomes accumulate, intervals tighten and the system becomes autonomous.

Framework Integrations

LangChain

from vaara.integrations.langchain import VaaraCallbackHandler

pipeline = InterceptionPipeline()
handler = VaaraCallbackHandler(pipeline, agent_id="my-agent")
agent = create_react_agent(llm, tools)

result = agent.invoke(
    {"messages": [("user", "...")]},
    config={"callbacks": [handler]},
)

OpenAI Agents SDK

from vaara.integrations.openai_agents import VaaraToolGuardrail

pipeline = InterceptionPipeline()
guardrail = VaaraToolGuardrail(pipeline)
agent = Agent(name="my-agent", tools=[...], output_guardrails=[guardrail])

CrewAI

from vaara.integrations.crewai import VaaraCrewGovernance

pipeline = InterceptionPipeline()
gov = VaaraCrewGovernance(pipeline)
safe_crew = gov.governed_kickoff(crew)

MCP Server (Claude Code, Cursor)

python -m vaara.integrations.mcp_server

Add to Claude Code settings:

{
  "mcpServers": {
    "vaara": {
      "command": "python",
      "args": ["-m", "vaara.integrations.mcp_server"]
    }
  }
}

Compliance evidence

Vaara collects and maps evidence artefacts to specific article references in the EU AI Act and DORA. The output is evidence, not a conformity verdict — the deployer, with a Notified Body where applicable, owns the conformity decision.

report = pipeline.run_compliance_assessment(
    system_name="My Agent System",
    system_version="1.0.0",
)

# Article-by-article evidence mapping
for article in report.articles:
    print(f"{article.requirement.article}: {article.status.value}")
    # Article 9(1): evidence_sufficient
    # Article 12(1): evidence_sufficient
    # ...

Article references covered:

EU AI Act: Articles 9, 11–15, 61 (risk management, documentation, logging, transparency, human oversight, accuracy, post-market monitoring)
DORA: Articles 10, 12, 13 (ICT risk management, incident detection, incident response)

Full article-by-article mapping, evidence types, CLI handoff flow, and what the deployer still owns: COMPLIANCE.md.

The audit trail is hash-chained (SHA-256) and tamper-evident, which supports Article 12(1) record-keeping obligations when configured with the deployer's required log content.

Cold Start

Generate synthetic traces to pre-calibrate the scorer:

from vaara.sandbox.trace_gen import TraceGenerator

gen = TraceGenerator()
traces = gen.generate(n_traces=100)
gen.pre_calibrate(pipeline, traces)
# Calibration in minutes instead of hours

Three agent archetypes (benign, careless, adversarial) with realistic outcome distributions.

Architecture

Agent (LangChain / OpenAI / CrewAI / MCP)
    |
    v
InterceptionPipeline.intercept()
    |
    +-- ActionRegistry     ->  classify tool_name to ActionType
    +-- AdaptiveScorer     ->  MWU + conformal risk interval
    +-- AuditTrail         ->  hash-chained immutable log
    +-- ComplianceEngine   ->  EU AI Act + DORA evidence mapping
    |
    v
InterceptionResult { allowed, risk_score, risk_interval, reason }
    |
    v
Execute or Block
    |
    v
report_outcome()  ->  closes feedback loop, MWU learns

Persistence

from vaara.audit.sqlite_backend import SQLiteAuditBackend

backend = SQLiteAuditBackend("audit.db")
trail = AuditTrail(on_record=backend.write_record)
pipeline = InterceptionPipeline(trail=trail)

WAL-mode SQLite, append-only, hash chain verified on load.

Formal Specification

See docs/formal_specification.md for the mathematical foundations: MWU regret bounds, conformal coverage guarantees, convergence rates, and security properties.

Tests

pip install vaara[dev]
pytest

200+ tests, runs in <1s.

License

See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Vaara

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.37.1

May 26, 2026

0.37.0

May 26, 2026

0.36.0

May 25, 2026

0.35.0

May 25, 2026

0.34.0

May 25, 2026

0.33.0

May 25, 2026

0.32.0

May 25, 2026

0.31.0

May 25, 2026

0.30.0

May 24, 2026

0.29.0

May 24, 2026

0.28.0

May 22, 2026

0.27.0

May 22, 2026

0.26.0

May 20, 2026

0.25.0

May 20, 2026

0.24.0

May 20, 2026

0.23.1

May 20, 2026

0.23.0

May 20, 2026

0.22.0

May 20, 2026

0.21.0

May 19, 2026

0.20.0

May 18, 2026

0.19.1

May 18, 2026

0.19.0

May 17, 2026

0.18.1

May 17, 2026

0.18.0

May 17, 2026

0.17.0

May 17, 2026

0.16.0

May 17, 2026

0.15.0

May 17, 2026

0.14.0

May 17, 2026

0.13.0

May 16, 2026

0.12.0

May 16, 2026

0.11.0

May 16, 2026

0.10.0

May 16, 2026

0.9.0

May 15, 2026

0.8.0

May 14, 2026

0.7.0

May 9, 2026

0.6.2

May 5, 2026

0.6.1

Apr 27, 2026

0.6.0

Apr 27, 2026

0.5.3

Apr 26, 2026

0.5.2

Apr 24, 2026

This version

0.5.1 yanked

Apr 23, 2026

0.5.0 yanked

Apr 23, 2026

0.4.4 yanked

Apr 22, 2026

0.4.3

Apr 21, 2026

0.4.2

Apr 20, 2026

0.4.1

Apr 20, 2026

0.4.0

Apr 20, 2026

0.3.25 yanked

Apr 19, 2026

0.3.24 yanked

Apr 19, 2026

0.3.22 yanked

Apr 19, 2026

0.3.21 yanked

Apr 19, 2026

0.3.20 yanked

Apr 19, 2026

0.3.19 yanked

Apr 19, 2026

0.3.18 yanked

Apr 19, 2026

0.3.17 yanked

Apr 19, 2026

0.3.16 yanked

Apr 19, 2026

0.3.15 yanked

Apr 19, 2026

0.3.14 yanked

Apr 19, 2026

0.3.13 yanked

Apr 19, 2026

0.3.12 yanked

Apr 19, 2026

0.3.11 yanked

Apr 19, 2026

0.3.10 yanked

Apr 19, 2026

0.3.9 yanked

Apr 19, 2026

0.3.8 yanked

Apr 19, 2026

0.3.7 yanked

Apr 19, 2026

0.3.6 yanked

Apr 19, 2026

0.3.5 yanked

Apr 19, 2026

0.3.4 yanked

Apr 19, 2026

0.3.3 yanked

Apr 19, 2026

0.3.2 yanked

Apr 19, 2026

0.3.1 yanked

Apr 19, 2026

0.3.0 yanked

Apr 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vaara-0.5.1.tar.gz (222.1 kB view details)

Uploaded Apr 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vaara-0.5.1-py3-none-any.whl (194.0 kB view details)

Uploaded Apr 23, 2026 Python 3

File details

Details for the file vaara-0.5.1.tar.gz.

File metadata

Download URL: vaara-0.5.1.tar.gz
Upload date: Apr 23, 2026
Size: 222.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for vaara-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`887d967e1d149f5cee670b6081835e8163a935b78468067d7167cc840826f8de`
MD5	`6d103f401f36e67c70fcec031a031cdd`
BLAKE2b-256	`a1f42921d68c7bca2bc4535ba75f2e2a598a8db2d7bf8dabe58b9c6104ea0519`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vaara-0.5.1.tar.gz:

Publisher: release.yml on vaaraio/vaara

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vaara-0.5.1.tar.gz
- Subject digest: 887d967e1d149f5cee670b6081835e8163a935b78468067d7167cc840826f8de
- Sigstore transparency entry: 1364518372
- Sigstore integration time: Apr 23, 2026
Source repository:
- Permalink: vaaraio/vaara@0f5460047d75ed3d8cd3672d7162467f2d7f7873
- Branch / Tag: refs/tags/v0.5.1
- Owner: https://github.com/vaaraio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@0f5460047d75ed3d8cd3672d7162467f2d7f7873
- Trigger Event: push

File details

Details for the file vaara-0.5.1-py3-none-any.whl.

File metadata

Download URL: vaara-0.5.1-py3-none-any.whl
Upload date: Apr 23, 2026
Size: 194.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for vaara-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`49693846963a720b4f703ccfec51bfd20be3c81f80778e946ffe1f7a74de6d85`
MD5	`022d2d9f3768df9a542d8dcf56266f17`
BLAKE2b-256	`7d0180c46a42990b911db52f41c2844d0ee52e1e4bffdeafb5b617440fd453d7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vaara-0.5.1-py3-none-any.whl:

Publisher: release.yml on vaaraio/vaara

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vaara-0.5.1-py3-none-any.whl
- Subject digest: 49693846963a720b4f703ccfec51bfd20be3c81f80778e946ffe1f7a74de6d85
- Sigstore transparency entry: 1364518382
- Sigstore integration time: Apr 23, 2026
Source repository:
- Permalink: vaaraio/vaara@0f5460047d75ed3d8cd3672d7162467f2d7f7873
- Branch / Tag: refs/tags/v0.5.1
- Owner: https://github.com/vaaraio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@0f5460047d75ed3d8cd3672d7162467f2d7f7873
- Trigger Event: push

vaara 0.5.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Why Vaara

Install

Quick Start

Adversarial Classifier (optional)

Numbers (by-seed held-out test, no leakage)

Caveats

How It Works

Risk Scoring (Adaptive)

Conformal Prediction

Decisions

Framework Integrations

LangChain

OpenAI Agents SDK

CrewAI

MCP Server (Claude Code, Cursor)

Compliance evidence

Cold Start

Architecture

Persistence

Formal Specification

Tests

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance