Deterministic execution-level safety layer between AI agents and real systems (shell, files, SQL, HTTP).

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

AI Execution Firewall

A control layer that intercepts AI-generated actions — shell commands, file edits, SQL queries, HTTP requests — and gates them through a deterministic policy pipeline before they execute.

AI → Action → Firewall → Decision → Execution

The firewall classifies intent, scores risk, applies YAML rules, simulates impact (unified diff for code, AST findings, git context, SSRF / leaked-secret detection for URLs), and returns one of ALLOW / BLOCK / REQUIRE_APPROVAL. Every decision is appended to an audit log.

Install

Python package (PyPI):

pip install ai-execution-firewall

VS Code extension (Marketplace):

VS Code → Extensions panel → search "AI Execution Firewall" → Install

Or from the command line:

code --install-extension sk-dev-ai.ai-execution-firewall

For development (editable install with test deps):

git clone https://github.com/Shahriyar-Khan27/ai_firewall.git
cd ai_firewall
pip install -e ".[dev]"

Quickstart

CLI

# Shell
guard eval "rm -rf /"                     # → BLOCK (no execution)
guard run  "echo hello"                    # → ALLOW, executes
guard run  "rm ./tmp.txt"                  # → REQUIRE_APPROVAL, prompts y/N

# SQL (analyze-only by default — never touches your DB)
guard sql "SELECT * FROM users"            # → ALLOW · LOW
guard sql "DELETE FROM users"              # → REQUIRE_APPROVAL · CRITICAL (no WHERE)
guard sql "DROP DATABASE prod"             # → BLOCK
# Opt-in execute mode against a real SQLite DB:
guard sql "DELETE FROM users WHERE id=1" --execute --connection ./app.sqlite

# HTTP (analyze-only by default — never makes the request)
guard api GET https://api.example.com/users    # → ALLOW
guard api GET http://169.254.169.254/          # → CRITICAL (cloud metadata SSRF)
guard api POST https://api.example.com/log --body '{"k":"AKIAIOSFODNN7EXAMPLE"}'
                                                #  → CRITICAL (AWS key in body)
# Opt-in execute mode (issues request via stdlib urllib):
guard api POST https://api.example.com/things --body '{"x":1}' --execute

guard policy show                          # print effective ruleset

Python SDK

from ai_firewall import Guard, Action

guard = Guard()
result = guard.execute(Action.shell("echo hello"))
print(result.decision.decision, result.execution.exit_code)

Action.file(...), Action.db(...), Action.api(...) cover the other three action types.

Shell hook

source scripts/guard-shell-hook.sh   # wraps rm, mv, dd, chmod, chown

Auto-mode AI tools (Claude Code, Cursor, Continue.dev, Zed)

The flows above (CLI / SDK / shell hook / VS Code) all require the user to deliberately route an action through the firewall. That's not enough when an AI agent is running in auto-accept mode — the agent doesn't ask first.

Two zero-touch integrations close that gap:

Claude Code — PreToolUse hook (intercepts every Bash / Write / Edit / MultiEdit call)

The hook fires before Claude Code executes any tool, even when run with --dangerously-skip-permissions. If the firewall says BLOCK or REQUIRE_APPROVAL, the tool call is refused and the AI gets the reason back.

Add to your ~/.claude/settings.json:

{
  "hooks": {
    "PreToolUse": [{
      "matcher": "Bash|Write|Edit|MultiEdit|NotebookEdit",
      "hooks": [{
        "type": "command",
        "command": "python /absolute/path/to/ai_firewall/scripts/claude-code-pretooluse.py"
      }]
    }]
  }
}

A copyable example lives at examples/claude-code-settings.json. Tunable via env var:

AI_FIREWALL_HOOK_APPROVAL=block (default) — anything REQUIRE_APPROVAL is rejected.
AI_FIREWALL_HOOK_APPROVAL=allow — REQUIRE_APPROVAL passes through (use sparingly).

MCP server — works with any MCP-capable host (Claude Code, Cursor, Continue.dev, Zed, Cline, ...)

pip install "ai-execution-firewall[mcp]"

Add to your MCP host config (e.g. .claude/mcp.json or your client's equivalent):

{
  "mcpServers": {
    "ai-firewall": {
      "command": "guard",
      "args": ["mcp"]
    }
  }
}

The server exposes tools like firewall_run_shell, firewall_run_sql, firewall_run_api, and firewall_run_file. Configure the AI to prefer these over its built-in tools, and every action goes through the policy pipeline. REQUIRE_APPROVAL defaults to a refusal (safe for auto-mode); the AI gets the Decision JSON back so it can adjust or surface to the user.

VS Code extension

After installing from the Marketplace, the Command Palette (Ctrl+Shift+P) gives you six commands under AI Firewall:

Run Shell Command… / Evaluate Selected Text as Shell Command
Evaluate SQL Query… / Evaluate Selected Text as SQL
Evaluate HTTP Request…
Show Effective Policy

Risky actions open a themed approval webview with the risk badge, intent / decision pills, findings list, git context, and a syntax-coloured unified diff (for code edits). One click to Approve & run, one click to Reject — both record an audit row. See vscode-extension/README.md for build / debug / packaging instructions.

Pipeline

Every guard.execute(action) call runs:

Intent classifier — regex / SQL parse / URL parse → one of FILE_DELETE | FILE_WRITE | FILE_READ | SHELL_EXEC | CODE_MODIFY | DB_READ | DB_WRITE | DB_DESTRUCTIVE | API_READ | API_WRITE | API_DESTRUCTIVE
Risk analyzer — table lookup on intent + feature flags → LOW | MEDIUM | HIGH | CRITICAL
Policy engine — YAML rules → ALLOW | BLOCK | REQUIRE_APPROVAL (first pass)
Impact engine — best-effort dry-run:
- Files: glob expansion, file stat, unified diff, AST findings (removed funcs / tests, auth identifiers), git context (uncommitted, untracked, gitignored)
- SQL: sqlglot AST → DELETE/UPDATE without WHERE, DROP DATABASE/SCHEMA/TABLE, TRUNCATE, GRANT/REVOKE, multiple statements
- HTTP: cloud metadata endpoints, private/loopback hosts (SSRF), URL credentials, secrets in query string, non-HTTP schemes, destructive paths; body + Authorization-header secret scanning (AWS / GitHub / Slack / Stripe / Google / Anthropic / OpenAI / PEM keys / JWTs)
Risk bump — impact findings can raise risk and re-trigger policy (e.g. removing a function bumps to HIGH; metadata host bumps to CRITICAL)
Decision engine — combines verdict + risk + impact

BLOCK raises immediately. REQUIRE_APPROVAL invokes the approval function (default: interactive CLI prompt; in VS Code: webview button). ALLOW runs through the matching adapter.

Every evaluated action is appended to logs/audit.jsonl.

Adapters

Action type	Default adapter	Opt-in execute adapter
`shell`	`ShellAdapter` (subprocess)	— (always executes)
`file`	`FileAdapter` (pathlib)	— (always executes)
`db`	`DBAnalyzeAdapter` — never opens a DB	`SQLiteExecuteAdapter` via `--execute --connection <sqlite-path>`
`api`	`APIAnalyzeAdapter` — never sends a request	`HTTPExecuteAdapter` via `--execute` (stdlib `urllib`)

DB and API default to analyze-only so the firewall never touches your database or network unless you explicitly opt in.

Custom rules

Pass --rules path/to/rules.yaml (CLI) or Guard(rules_path=...) (SDK). See ai_firewall/config/default_rules.yaml for the schema:

shell_exec:
  blocked:
    - 'rm\s+-rf\s+/'
  require_approval:
    risk_at_or_above: HIGH

file_delete:
  require_approval: true

db_destructive:
  blocked:
    - 'DROP\s+DATABASE'
  require_approval: true

api_destructive:
  require_approval: true

Scope

Shipped (v0.1.0):

Phase 1: shell + filesystem, rule-based classifier, CLI prompt approval, CLI / SDK / shell-hook surfaces.
Phase 2: unified diff for code edits, AST-aware risk findings, git-aware impact, VS Code extension with webview approval UI.
Phase 3: SQL gating via sqlglot, HTTP gating via stdlib urllib, secret-scanning of request bodies and Authorization-style headers, opt-in execute adapters for SQLite and HTTP.

Out / future:

Postgres / MySQL execute adapters (currently SQLite only)
Sandboxed shell dry-run
Cloud control plane / web dashboard
Team policy distribution

Tests

pytest -q

159 tests across all phases. CI runs on Python 3.11 / 3.12 / 3.13 on every push.

Release flow

Pushing a tag matching v* automatically:

runs the full test matrix on GitHub Actions,
builds sdist + wheel,
publishes to PyPI via Trusted Publishing (no API token in CI).

# Bump version in pyproject.toml + add CHANGELOG entry, commit, then:
git tag -a v0.1.1 -m "v0.1.1"
git push --tags
# PyPI is updated within ~60 seconds.

VS Code Marketplace publishing is currently manual — re-build the .vsix (npx vsce package --no-yarn from vscode-extension/) and upload via the Marketplace publisher manage page.

License

MIT — see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

sk-dev.ai

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.3

May 1, 2026

0.5.2

May 1, 2026

0.5.1

May 1, 2026

0.5.0

May 1, 2026

0.4.1

May 1, 2026

0.4.0

May 1, 2026

0.3.1

May 1, 2026

0.3.0

May 1, 2026

This version

0.2.0

May 1, 2026

0.1.1

Apr 30, 2026

0.1.0

Apr 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ai_execution_firewall-0.2.0.tar.gz (46.7 kB view details)

Uploaded May 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ai_execution_firewall-0.2.0-py3-none-any.whl (44.6 kB view details)

Uploaded May 1, 2026 Python 3

File details

Details for the file ai_execution_firewall-0.2.0.tar.gz.

File metadata

Download URL: ai_execution_firewall-0.2.0.tar.gz
Upload date: May 1, 2026
Size: 46.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ai_execution_firewall-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`64bc74ffc0f0f61860e50b3766a6483684af1ec597b0df7950da8119054aebab`
MD5	`fc6080241502f79104ddd15656c53299`
BLAKE2b-256	`09ec599335efad3d832c9bd7e6f0ce78b93d7d5ee1bb48fa56d14a71ef7fbea7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ai_execution_firewall-0.2.0.tar.gz:

Publisher: publish.yml on Shahriyar-Khan27/ai_firewall

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ai_execution_firewall-0.2.0.tar.gz
- Subject digest: 64bc74ffc0f0f61860e50b3766a6483684af1ec597b0df7950da8119054aebab
- Sigstore transparency entry: 1417223477
- Sigstore integration time: May 1, 2026
Source repository:
- Permalink: Shahriyar-Khan27/ai_firewall@6eaeb0d5c4e6637a13ffb127ec0b88e1cb7f4085
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/Shahriyar-Khan27
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6eaeb0d5c4e6637a13ffb127ec0b88e1cb7f4085
- Trigger Event: push

File details

Details for the file ai_execution_firewall-0.2.0-py3-none-any.whl.

File metadata

Download URL: ai_execution_firewall-0.2.0-py3-none-any.whl
Upload date: May 1, 2026
Size: 44.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ai_execution_firewall-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e50c0ec66b0e4aa49f39578cd2b7f938f14ee545143abbdee9b7f5dc0e5b453f`
MD5	`43cc8181a269fb5265718ea704c08540`
BLAKE2b-256	`e24b143557b9063dd1cb74139ec698a2e0f67e9729b79a43a76803bfe854f346`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ai_execution_firewall-0.2.0-py3-none-any.whl:

Publisher: publish.yml on Shahriyar-Khan27/ai_firewall

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ai_execution_firewall-0.2.0-py3-none-any.whl
- Subject digest: e50c0ec66b0e4aa49f39578cd2b7f938f14ee545143abbdee9b7f5dc0e5b453f
- Sigstore transparency entry: 1417223482
- Sigstore integration time: May 1, 2026
Source repository:
- Permalink: Shahriyar-Khan27/ai_firewall@6eaeb0d5c4e6637a13ffb127ec0b88e1cb7f4085
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/Shahriyar-Khan27
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6eaeb0d5c4e6637a13ffb127ec0b88e1cb7f4085
- Trigger Event: push

ai-execution-firewall 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

AI Execution Firewall

Install

Quickstart

CLI

Python SDK

Shell hook

Auto-mode AI tools (Claude Code, Cursor, Continue.dev, Zed)

Claude Code — PreToolUse hook (intercepts every Bash / Write / Edit / MultiEdit call)

MCP server — works with any MCP-capable host (Claude Code, Cursor, Continue.dev, Zed, Cline, ...)

VS Code extension

Pipeline

Adapters

Custom rules

Scope

Tests

Release flow

Links

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance