Open-source permission control plane for AI agents. Scan, enforce, and audit every tool call.

These details have not been verified by PyPI

Project links

Project description

AgentWard Architecture

AgentWard

Open-source permission control plane for AI agents.
Scan, enforce, and audit every tool call.

AgentWard Demo

Telling an agent "don't touch the stove" is a natural-language guardrail that can be circumvented. AgentWard puts a physical lock on the stove — code-level enforcement that prompt injection can't override.

AgentWard sits between AI agents and their tools (MCP servers, HTTP gateways, function calls) to enforce least-privilege policies, inspect data flows at runtime, and generate compliance audit trails. Policies are enforced in code, outside the LLM context window — the model never sees them, can't override them, can't be tricked into ignoring them.

Why AgentWard?

AI agents now have access to your email, calendar, filesystem, shell, databases, and APIs. The tools exist to give agents these capabilities. But nothing exists to control what they do with them.

What exists today	What it does	What it doesn't do
Static scanners (mcp-scan, Cisco Skill Scanner)	Scan tool definitions, report risks	No runtime enforcement. Scan and walk away.
Guardrails frameworks (NeMo, Guardrails AI)	Filter LLM inputs/outputs	Don't touch tool calls. An agent can still `rm -rf /`.
Prompt-based rules (SecureClaw)	Inject safety instructions into agent context	Vulnerable to prompt injection. The LLM can be tricked into ignoring them.
IAM / OAuth	Control who can access what	Control humans, not agents. An agent with your OAuth token has your full permissions.

The gap: No tool-level permission enforcement that actually runs in code, outside the LLM, at the point of every tool call. Scanners find problems but don't fix them. Guardrails protect the model but not the tools. Prompt rules are suggestions, not enforcement.

AgentWard fills this gap. It's a proxy that sits between agents and tools, evaluating every tools/call against a declarative policy — in code, at runtime, where prompt injection can't reach.

Quick Start

pip install agentward

1. Scan your tools

agentward scan

Auto-discovers MCP configs (Claude Desktop, Cursor, Windsurf, VS Code), Python tool definitions (OpenAI, LangChain, CrewAI), and ClawdBot/OpenClaw skills. Outputs a permission map with risk ratings and security recommendations.

 Server          Tool                 Risk    Data Access
 ─────────────── ──────────────────── ─────── ──────────────────
 filesystem      read_file            MEDIUM  File read
 filesystem      write_file           HIGH    File write
 github          create_issue         MEDIUM  GitHub API
 shell-executor  run_command          CRITICAL Shell execution

2. Generate a policy

agentward configure

Generates a smart-default agentward.yaml with security-aware rules based on what scan found — skill restrictions, approval gates, and chaining rules tailored to your setup.

# agentward.yaml (generated)
version: "1.0"
skills:
  filesystem:
    read_file: { action: allow }
    write_file: { action: approve }   # requires human approval
  shell-executor:
    run_command: { action: block }    # blocked entirely
require_approval:
  - send_email
  - delete_file

3. Wire it in

# MCP servers (Claude Desktop, Cursor, etc.)
agentward setup --policy agentward.yaml

# Or for ClawdBot gateway
agentward setup --gateway clawdbot

Rewrites your MCP configs so every tool call routes through the AgentWard proxy. For ClawdBot, swaps the gateway port so AgentWard sits as an HTTP reverse proxy.

4. Enforce at runtime

# MCP stdio proxy
agentward inspect --policy agentward.yaml -- npx @modelcontextprotocol/server-filesystem /tmp

# HTTP gateway proxy
agentward inspect --gateway clawdbot --policy agentward.yaml

Every tool call is now intercepted, evaluated against your policy, and either allowed, blocked, or flagged for approval. Full audit trail logged.

 [ALLOW]  filesystem.read_file        /tmp/notes.txt
 [BLOCK]  shell-executor.run_command   rm -rf /
 [APPROVE] gmail.send_email            → waiting for human approval

How It Works

AgentWard operates as a transparent proxy between agents and their tools:

Agent Host                    AgentWard                     Tool Server
(Claude, Cursor, etc.)        (Proxy + Policy Engine)       (MCP, Gateway)

    tools/call ──────────►  Intercept ──► Policy check
                              │                │
                              │    ALLOW ──────┼──────► Forward to server
                              │    BLOCK ──────┼──────► Return error
                              │    APPROVE ────┼──────► Wait for human
                              │                │
                              └── Audit log ◄──┘

Two proxy modes, same policy engine:

Mode	Transport	Intercepts	Use Case
Stdio	JSON-RPC 2.0 over stdio	`tools/call`	MCP servers (Claude Desktop, Cursor, Windsurf, VS Code)
HTTP	HTTP reverse proxy + WebSocket	`POST /tools-invoke`	ClawdBot gateway, HTTP-based tools

CLI Commands

Command	Description
`agentward scan`	Static analysis — discover tools, generate permission maps, risk ratings
`agentward configure`	Generate smart-default policy YAML from scan results
`agentward setup`	Wire proxy into MCP configs or gateway ports
`agentward inspect`	Start runtime proxy with live policy enforcement
`agentward comply`	Compliance evaluation against regulatory frameworks (coming soon)

Policy Actions

Action	Behavior
`allow`	Tool call forwarded transparently
`block`	Tool call rejected, error returned to agent
`approve`	Tool call held for human approval before forwarding
`log`	Tool call forwarded, but logged with extra detail
`redact`	Tool call forwarded with sensitive data stripped

What AgentWard Is NOT

Not a static scanner — Scanners like mcp-scan analyze and walk away. AgentWard scans and enforces at runtime.
Not a guardrails framework — NeMo Guardrails and Guardrails AI focus on LLM input/output. AgentWard controls the tool calls.
Not prompt-based enforcement — Injecting safety rules into the LLM context is vulnerable to prompt injection. AgentWard enforces policies in code, outside the context window.
Not an IAM system — AgentWard complements IAM. It controls what agents can do with the permissions they already have.

Supported Platforms

MCP Hosts (stdio proxy):

Claude Desktop
Claude Code
Cursor
Windsurf
VS Code Copilot
Any MCP-compatible client

HTTP Gateways:

ClawdBot (with WebSocket passthrough for UI)
Extensible to other HTTP-based tool gateways

Python Tool Scanning:

OpenAI SDK (@tool decorators)
LangChain (@tool, StructuredTool)
CrewAI (@tool)
Anthropic SDK

Development

# Clone and set up
git clone https://github.com/agentward-ai/agentward.git
cd agentward
python3 -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

# Run tests
pytest

# Lint
ruff check agentward/

Roadmap

MCP stdio proxy with policy enforcement
HTTP reverse proxy with WebSocket passthrough
Static scanner (MCP configs, Python tools, OpenClaw skills)
Smart-default policy generation
MCP config wrapping (agentward setup)
Audit logging (JSON Lines + rich stderr)
Skill chaining analysis and enforcement
Human-in-the-loop approval flow
Compliance frameworks (HIPAA, SOX, GDPR, PCI-DSS)
Data classifier (PII/PHI detection)
Data boundary enforcement
Skill Compliance Registry

License

Apache 2.0

agentward.ai · GitHub

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.0

Mar 20, 2026

0.3.2

Mar 4, 2026

0.3.1

Feb 27, 2026

0.3.0

Feb 26, 2026

0.2.6

Feb 24, 2026

0.2.5

Feb 23, 2026

0.2.4

Feb 23, 2026

0.2.3

Feb 22, 2026

0.2.2

Feb 21, 2026

0.2.1

Feb 21, 2026

This version

0.2.0

Feb 21, 2026

0.1.0

Feb 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentward-0.2.0.tar.gz (3.0 MB view details)

Uploaded Feb 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentward-0.2.0-py3-none-any.whl (93.4 kB view details)

Uploaded Feb 21, 2026 Python 3

File details

Details for the file agentward-0.2.0.tar.gz.

File metadata

Download URL: agentward-0.2.0.tar.gz
Upload date: Feb 21, 2026
Size: 3.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for agentward-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`859e2180f65d0df534afe7cd8aefa15f4801d7056237301c57c961a00527fd1d`
MD5	`933013ab263c47ceec4a69318be154dd`
BLAKE2b-256	`f6ec96701225d880afddf038f9e30c02d852d971e440343bd31a82c771a7dff6`

See more details on using hashes here.

File details

Details for the file agentward-0.2.0-py3-none-any.whl.

File metadata

Download URL: agentward-0.2.0-py3-none-any.whl
Upload date: Feb 21, 2026
Size: 93.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for agentward-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`06807ad1eb09d891f484188f85e658a8ec2e1f0cd4090d5c339890eb4ec63d43`
MD5	`743040cca3ce1e0130b940e1f0e35122`
BLAKE2b-256	`a37cb23d0d68f6edd7f9b0085fb5d66247a776ba2adb4daaae94b8b700a5723f`

See more details on using hashes here.

agentward 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgentWard

Why AgentWard?

Quick Start

1. Scan your tools

2. Generate a policy

3. Wire it in

4. Enforce at runtime

How It Works

CLI Commands

Policy Actions

What AgentWard Is NOT

Supported Platforms

Development

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes