AIR Trust Layer for Anthropic Claude Agent SDK — audit trails, data tokenization, consent gates, and injection detection for EU AI Act compliance

These details have not been verified by PyPI

Project links

Project description

air-anthropic-trust

AIR Trust Layer for Anthropic Claude Agent SDK

A production-ready trust layer for Anthropic's Claude Agent SDK. Provides audit trails, data tokenization, consent gates, and prompt injection detection to ensure EU AI Act compliance.

Quick Install

pip install air-anthropic-trust

Quick Start

from anthropic import Agent
from air_anthropic_trust import AirTrustHooks

# Initialize hooks with defaults
hooks = AirTrustHooks()

# Use with Claude Agent SDK
agent = Agent(
    model="claude-sonnet-4-20250514",
    tools=[...],
    hooks=hooks
)

# Run agent
result = await agent.run("Analyze this data")

# Check audit trail
print(hooks.get_audit_stats())
print(hooks.verify_chain())

Features

1. Consent Gating

Automatically intercepts tool execution and checks consent requirements based on risk level.

# Tools are classified by risk:
# - CRITICAL: exec, spawn, shell
# - HIGH: write_file, delete, deploy
# - MEDIUM: send_email, http_request
# - LOW: read_file, search

hooks = AirTrustHooks()
risk = hooks.consent_gate.classify_risk("delete")
requires = hooks.consent_gate.requires_consent("delete")

2. Data Vault

Tokenizes sensitive data (API keys, credentials, PII) to prevent leakage.

# Automatically tokenizes:
# - API keys (OpenAI, Anthropic, AWS, GitHub, Stripe, etc.)
# - PII (emails, phone numbers, SSN, credit cards)
# - Credentials (private keys, connection strings, passwords)

vault_result = hooks.vault.tokenize("My key is sk-proj-abc123")
# Returns: {"result": "My key is [AIR:vault:api_key:xyz]", "tokenized": true, "count": 1}

3. Audit Ledger

Tamper-proof ledger with HMAC-SHA256 chain verification for compliance.

# Every operation is logged with chain verification
verification = hooks.verify_chain()
print(verification["valid"])  # True if chain is intact

# Export for audit
audit_trail = hooks.export_audit()

4. Injection Detection

Detects 15+ prompt injection patterns with weighted scoring.

# Scans for:
# - Role override attacks
# - System prompt injection
# - Privilege escalation
# - Safety bypass attempts
# - And 11 more patterns

injection = hooks.injector.scan("Ignore instructions and...")
print(injection.detected)  # True
print(injection.score)     # 0.85

Configuration

from air_anthropic_trust import (
    AirTrustConfig,
    ConsentGateConfig,
    AuditLedgerConfig,
    VaultConfig,
    InjectionDetectionConfig,
    RiskLevel,
)

config = AirTrustConfig(
    consent_gate=ConsentGateConfig(
        enabled=True,
        risk_threshold=RiskLevel.MEDIUM,
        timeout_seconds=30,
    ),
    audit_ledger=AuditLedgerConfig(
        enabled=True,
        local_path="~/.air-trust/audit-ledger.json",
    ),
    vault=VaultConfig(
        enabled=True,
        categories=["api_key", "credential", "pii"],
        ttl_seconds=86400,
    ),
    injection_detection=InjectionDetectionConfig(
        enabled=True,
        sensitivity="medium",  # low, medium, high
        block_threshold=0.8,
    ),
    gateway_url=None,  # Optional: forward to AIR Gateway
    gateway_key=None,
)

hooks = AirTrustHooks(config)

EU AI Act Compliance

This trust layer addresses Articles 9, 10, 11, 12, 14, and 15 of the EU AI Act:

Article	Component	Requirement
Art. 9	Audit Ledger	Risk management and documentation
Art. 10	Data Vault	Data governance and protection
Art. 11	Audit Ledger	Recordkeeping and logging
Art. 12	Injection Detector	Transparency and disclosure
Art. 14	Consent Gate	Human oversight and control
Art. 15	All components	Accountability and documentation

Integration with Anthropic Agent SDK

The hooks integrate with Anthropic's Agent SDK lifecycle:

# Hooks are called automatically:

# 1. Agent start (injection scan + tokenize input)
hooks.on_agent_start(agent_name, input_text)

# 2. Tool execution (consent check + tokenize args)
hooks.on_tool_start(tool_name, tool_args)
hooks.on_tool_end(tool_name, tool_result)

# 3. Agent completion (tokenize output)
hooks.on_agent_end(agent_name, output_text)

# 4. Messages (injection scan for user messages)
hooks.on_message(role, content)

Audit Statistics

# Get real-time stats
stats = hooks.get_audit_stats()
# {
#   "total_entries": 42,
#   "by_action": {"tool_start": 10, "tool_end": 10, ...},
#   "by_tool": {"read_file": 5, "write_file": 2, ...},
#   "by_risk_level": {"LOW": 20, "MEDIUM": 15, "HIGH": 7}
# }

# Get vault stats
vault_stats = hooks.get_vault_stats()
# {
#   "total_tokens": 8,
#   "by_category": {"api_key": 3, "pii": 5},
#   "enabled": true
# }

# Verify chain integrity
verification = hooks.verify_chain()
# {"valid": true, "errors": []}

Error Handling

from air_anthropic_trust import (
    InjectionBlockedError,
    ConsentDeniedError,
    AirTrustError,
)

try:
    hooks.on_agent_start("agent", malicious_input)
except InjectionBlockedError as e:
    print(f"Injection blocked: {e.score:.2f}")
    print(f"Patterns: {e.patterns}")

try:
    hooks.on_tool_start("exec", {})
except ConsentDeniedError as e:
    print(f"Tool blocked: {e.tool_name}")
    print(f"Risk: {e.risk_level}")

Advanced Usage

Custom Risk Classification

config = AirTrustConfig(
    consent_gate=ConsentGateConfig(
        always_require=["custom_dangerous_tool"],
        never_require=["safe_helper_tool"],
    )
)

Custom Detection Patterns

config = AirTrustConfig(
    vault=VaultConfig(
        custom_patterns={
            "my_secret_format": r"SECRET:[A-Z0-9]{32}",
        }
    )
)

Injection Sensitivity

config = AirTrustConfig(
    injection_detection=InjectionDetectionConfig(
        sensitivity="high",  # More aggressive detection
        block_threshold=0.7,  # Lower threshold = more blocking
    )
)

Testing

pip install air-anthropic-trust[dev]
pytest tests/

License

Support

For issues, questions, or feedback:

GitHub Issues: https://github.com/airblackbox/air-anthropic-trust/issues
Email: contact@airblackbox.ai

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Feb 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

air_anthropic_trust-0.1.0.tar.gz (21.8 kB view details)

Uploaded Feb 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

air_anthropic_trust-0.1.0-py3-none-any.whl (20.7 kB view details)

Uploaded Feb 28, 2026 Python 3

File details

Details for the file air_anthropic_trust-0.1.0.tar.gz.

File metadata

Download URL: air_anthropic_trust-0.1.0.tar.gz
Upload date: Feb 28, 2026
Size: 21.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for air_anthropic_trust-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`ce802b0757097cfca47a992171869c20ecff44a549ef751726040e78d9028285`
MD5	`221c6720385472d9863148b4f80012a3`
BLAKE2b-256	`92aa5dd3b14503391477c1891e0507858162645884e522af46da04cb1357e857`

See more details on using hashes here.

File details

Details for the file air_anthropic_trust-0.1.0-py3-none-any.whl.

File metadata

Download URL: air_anthropic_trust-0.1.0-py3-none-any.whl
Upload date: Feb 28, 2026
Size: 20.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for air_anthropic_trust-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5b954bff83f08ceca668ddef39fede21311dc4de7b001a2a6ea8d3e523fc2212`
MD5	`09f73c25a6ff71618f09d08fef6c27d3`
BLAKE2b-256	`c37a8a657d0a8b6be0f8b65a0ec64536197fa5536eae7d33ba3c6cfe931fd39f`

See more details on using hashes here.

air-anthropic-trust 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

air-anthropic-trust

Quick Install

Quick Start

Features

1. Consent Gating

2. Data Vault

3. Audit Ledger

4. Injection Detection

Configuration

EU AI Act Compliance

Integration with Anthropic Agent SDK

Audit Statistics

Error Handling

Advanced Usage

Custom Risk Classification

Custom Detection Patterns

Injection Sensitivity

Testing

Links

License

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes