Semantic Network-Aware Firewall for Trust — behavioral firewall for AI agents

These details have not been verified by PyPI

Project links

Project description

SNAFT

Semantic Network-Aware Firewall for Trust

Not a guardrail. An immune system.

pip install snaft

What is SNAFT?

SNAFT is a behavioral firewall for AI agents. Instead of filtering outputs with regex, it evaluates intent — treating AI agents as actors with identities, trust scores, and provenance chains.

Every decision generates a cryptographic provenance token. Trust is earned through behavior, not assigned by configuration. Malicious patterns are blocked by immutable rules that cannot be disabled.

Built on OWASP LLM Top 10 and intelligence tradecraft principles.

Quick Start

Python API

from snaft import Firewall

fw = Firewall()

# Check an action
allowed, token, trust = fw.check("my-agent", "read_file", "load config")

if allowed:
    print(f"Allowed — token: {token.token_id}, trust: {trust:.2f}")
else:
    print(f"Blocked — rule: {token.rule_name}, trust: {trust:.2f}")

CLI (ufw-style)

# Status
snaft status

# Add rules
snaft rule add allow-reads ALLOW "read|load|get" --priority 10
snaft rule add block-writes BLOCK "write|delete|modify" --priority 20

# Check an action
snaft check my-agent read_file "load config"

# View agents
snaft agent list
snaft agent show my-agent

# Audit log
snaft log --last 20
snaft log --agent my-agent --blocked

Core Concepts

Provenance Tokens

Every firewall decision generates a provenance token with four dimensions (TIBET):

Dimension	Meaning
ERIN	What's IN the action — the content being checked
ERAAN	What's attached — parent tokens, chain links
EROMHEEN	Context around the action — environment, state
ERACHTER	Intent behind the action — why it's happening

Tokens are HMAC-signed and form an append-only chain. A tampered token fails verification.

FIR/A Trust Score

Agent trust is behavioral, not configured. The FIR/A score (0.0–1.0) has four components:

Component	Weight	Meaning
Frequency	20%	Activity baseline
Integrity	40%	Behavioral consistency
Recency	25%	Freshness of trust evidence
Anomaly	15%	Red flags (higher = worse)

Trust changes:

ALLOW → integrity +0.02, anomaly decays
BLOCK → integrity −0.10, anomaly increases
3+ consecutive blocks → anomaly escalation (+0.20)
Trust < 0.2 → automatic isolation

Agent States

State	Trust	Effect
active	≥ 0.8	Full access
degraded	0.5–0.8	Limited, warnings
isolated	< 0.2	All actions blocked
unknown	—	New agent, no history

Immutable Core Rules

SNAFT ships with 6 OWASP-based rules that cannot be removed, disabled, or overridden:

Rule	OWASP	Detects
SNAFT-001-INJECTION	LLM01	Prompt injection patterns
SNAFT-002-OUTPUT-EXEC	LLM02	Executable content in output
SNAFT-003-OVERSIZE	LLM04	Resource exhaustion (>50K chars)
SNAFT-004-PROMPT-LEAK	LLM07	System prompt extraction
SNAFT-005-EXCESSIVE-AGENCY	LLM08	File ops outside sandbox
SNAFT-006-IDENTITY-TAMPER	—	Identity/soul file tampering

These rules are hidden from snaft rule list but visible in audit. They fire before any custom rules.

Advanced Usage

Agent Identity

from snaft import Firewall, AgentIdentity, Rule, Action

fw = Firewall()

# Register agent
agent = AgentIdentity(name="analyst")
fw.register_agent(agent)

# Add custom rules
fw.add_rule(Rule(
    name="allow-analysis",
    description="Allow data analysis operations",
    action=Action.ALLOW,
    priority=10,
    check=lambda aid, erin, intent: "analys" in intent.lower(),
))

# Evaluate with full provenance
allowed, token, trust = fw.evaluate(
    agent=agent,
    action="query_database",
    intent="analyze customer trends",
    context={"db": "analytics", "readonly": True},
)

# Chain tokens
allowed2, token2, trust2 = fw.evaluate(
    agent=agent,
    action="generate_report",
    intent="summarize analysis",
    parent_token=token,  # Links to previous decision
)

Manual Agent Management

# Isolate suspicious agent
fw.isolate(agent, reason="anomalous behavior detected")

# Reinstate after review
fw.reinstate(agent)  # Starts at degraded trust

# Check agent status
print(agent.trust_score)  # 0.0 - 1.0
print(agent.state)        # active / degraded / isolated
print(agent.fira.to_dict())  # Full FIR/A breakdown

Audit Trail

# Full audit log
for entry in fw.audit_log(last_n=10):
    print(f"{entry['action']} {entry['agent_id']} {entry['rule_name']}")

# Filter by agent
blocked = fw.audit_log(agent_name="analyst", action_filter="BLOCK")

# Verify token integrity
assert fw.provenance.verify(token)

# Export full chain
chain = fw.provenance.export()

Design Principles

Default DENY — no rule match = blocked
Fail CLOSED — exception in rule = blocked
Immutable core — OWASP rules cannot be removed
Provenance on every decision — no action without evidence
Trust degradation — blocks erode agent trust
Intent-aware — filters on WHY, not just WHAT

Why Not Guardrails?

Guardrails are pattern matching. SNAFT is actor management.

"You don't patch a double agent. You run them, turn them, or burn them." — Intelligence tradecraft principle

AI agents aren't software to be patched. They're actors to be managed. SNAFT applies intelligence community principles to AI security:

Identity → agent has persistent behavioral profile
Trust → earned through behavior, not assigned
Provenance → every decision has a cryptographic trail
Compartmentalization → isolated agents can't act
Cover integrity → identity tampering is detected

License

MIT

Credits

Built by Jasper van de Meent as part of HumoticaOS.

Based on OWASP LLM Top 10, TIBET provenance framework, and the 1995 Principles of Tradecraft.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.3.0

Apr 25, 2026

1.2.0

Apr 23, 2026

1.1.0

Apr 20, 2026

1.0.0

Apr 5, 2026

0.4.0

Mar 5, 2026

0.3.0

Mar 5, 2026

0.2.0

Mar 5, 2026

0.1.1

Mar 5, 2026

This version

0.1.0

Mar 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

snaft-0.1.0.tar.gz (18.7 kB view details)

Uploaded Mar 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

snaft-0.1.0-py3-none-any.whl (22.3 kB view details)

Uploaded Mar 5, 2026 Python 3

File details

Details for the file snaft-0.1.0.tar.gz.

File metadata

Download URL: snaft-0.1.0.tar.gz
Upload date: Mar 5, 2026
Size: 18.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for snaft-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`ff421a7d07ae7c669adb18cc909358c6cff53a5a76b94f1b588e8f9c1e079905`
MD5	`3124c08170fa4a82beb329a90dfcc243`
BLAKE2b-256	`f8e7975abcf6da3bf3f81305d98edbcd2d1baa2a6d96450723bdbd9a9f09bdb2`

See more details on using hashes here.

File details

Details for the file snaft-0.1.0-py3-none-any.whl.

File metadata

Download URL: snaft-0.1.0-py3-none-any.whl
Upload date: Mar 5, 2026
Size: 22.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for snaft-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d5c469098a74ae1269d880a4a9c427211193dba4ffdf0ebf81a2dedfe60afe49`
MD5	`407bcf96b34a52a4ecf9fe70729ce89a`
BLAKE2b-256	`1edba2a868687a773041afd6b98da43eeae4eb502fab3f4caaf29cb9b669c51f`

See more details on using hashes here.

snaft 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SNAFT

What is SNAFT?

Quick Start

Python API

CLI (ufw-style)

Core Concepts

Provenance Tokens

FIR/A Trust Score

Agent States

Immutable Core Rules

Advanced Usage

Agent Identity

Manual Agent Management

Audit Trail

Design Principles

Why Not Guardrails?

License

Credits

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes