mcpsafetywarden

MCP proxy server with behavioral profiling, security scanning, risk gating, and safe execution

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

gautamvarmadatla

These details have not been verified by PyPI

Project description

MCP Safety Warden

MCP safety warden is a proxy server that wraps any MCP server and adds behavioral profiling, security scanning, risk gating, and safe execution to its tools.

Listed on the official MCP server registry

Overview
Prerequisites
Installation
Configuration
MCP Integration
CLI Reference
Auxiliary Integrations
Development
Testing
Further reading

Overview

Use as a proxy to add safety gating to any MCP server, or point it at a server you don't own and run a full security audit without making a single tool call.

Fig 1. Two operating modes: proxy and audit

Behavioral profiling: Effect class, retry safety, destructiveness. LLM-assisted (Anthropic, OpenAI, Gemini, Ollama) with rule-based fallback. Observed stats (latency p50/p95, failure rate, output size) updated after every proxied call.

Security scanning: mcpsafety+ five-stage pipeline (Recon, Planner, Hacker, Auditor, Supervisor). Cisco AI Defense (AST/YARA). Snyk (metadata analysis). Kali and Burp Suite integrations enrich the pipeline with real network data and HTTP-layer probes. Source code scanning from GitHub with entropy, AST, taint flow, and rug-pull detection.

Fig 2. mcpsafety+ five-stage pipeline, triggered when you run a full security audit on any MCP server

Safe execution: Argument scanning (20+ attack categories, LLM second-pass). Two-layer output injection scanning. Risk gating with alternatives and per-tool policies. Drift detection on every call and standalone check.

Fig 3. Safe execution pipeline: the five checks every proxied tool call passes through

CLI: 17 subcommands, interactive risk menu, --json flag on every command, --yes for CI.

What it detects

Prompt injection: tool outputs trying to hijack the agent: role hijacking, jailbreaks, fake system prompts, instruction overrides. Detects 11 obfuscation techniques including Unicode lookalikes, zero-width characters, and base64-encoded payloads.
Malicious tool metadata: descriptions containing injection strings, hardcoded secrets, suspicious download URLs, tool impersonation (shadowing), direct financial execution, system service modification, and untrusted external dependencies. Backed by 19 Snyk checks.
Argument injection: 20+ attack categories checked on every tool call before the call is forwarded: SSRF to cloud metadata endpoints (AWS, GCP, Azure, Alibaba), path traversal, credential file access (.aws, .ssh, .kube, .env), command injection, SQL/NoSQL/LDAP/XPath injection, XXE, template injection (SSTI), CRLF, null byte, deserialization payloads (Java, Python pickle, PHP, .NET), Windows UNC/ADS attacks, and base64-obfuscated variants of all of the above.
Source code risks: fetches the server's GitHub source and runs 6 analysis layers: entropy scanning for hardcoded secrets, AST taint flow tracking (parameter to dangerous sink), description-vs-implementation mismatch, Bandit and Semgrep SAST, and LLM cross-function reasoning. Supports Python and TypeScript/JavaScript.
Rug-pull and drift: stores a SHA-256 hash of the server's source on first scan and alerts if it changes. Catches description swaps, schema changes, and tool removal live on every call via a per-call drift guard.
Behavior anomalies: classifies every tool by effect class, destructiveness, and 7 risk tags: credential exposure, arbitrary execution, data exfiltration, filesystem access, lateral movement, privilege escalation, and prompt injection surface.
Composition attacks: analyzes tool sets for chaining risks: IDOR chains, read-write pairs, auth flow exploitation, write-then-execute sequences, and data accumulation + exfiltration paths across multiple tools.
Network and host risks: when Kali Linux MCP is registered: open ports, running services, OS fingerprint via nmap. When Burp Suite MCP is registered: HTTP-layer active probing and blind SSRF via out-of-band callbacks.
Credential exposure in outputs: redacts secrets from tool responses before storage. Injection-flagged responses are quarantined and never returned to the calling agent - stored under a run ID for forensic review.
CVE research and Arxiv findings: the mcpsafety+ Auditor stage cross-references discovered capabilities against known vulnerabilities and recent security research.

Prerequisites

Python 3.10 or later
At least one wrapped MCP server to proxy (stdio, SSE, or streamable_http)
Recommended: an LLM API key (Anthropic, OpenAI, or Gemini)

Without a key the wrapper operates in rule-based-only mode: lower confidence tool classification, regex-only injection scanning, no alternatives in the risk gate, no mcpsafety+ pipeline. For a fully local setup, run Ollama, set OLLAMA_MODEL, and pass --provider ollama explicitly (Ollama is not auto-detected).

Installation

pip install mcpsafetywarden

With all optional extras:

pip install "mcpsafetywarden[all]"

Or specific extras:

pip install "mcpsafetywarden[anthropic,snyk]"

From source:

git clone https://github.com/gautamvarmadatla/mcpsafetywarden
cd mcpsafetywarden
pip install .

The SQLite database is created automatically on first run in the platform user data directory (~/.local/share/mcpsafetywarden/ on Linux, ~/Library/Application Support/mcpsafetywarden/ on macOS, %APPDATA%\mcpsafetywarden\ on Windows). Override with MCP_DB_PATH.

Optional: at-rest encryption for stored credentials

pip install cryptography
python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"

Set the printed key as MCP_DB_ENCRYPTION_KEY before starting the server.

Configuration

All configuration is via environment variables.

Variable	Default	Purpose
`MCP_TRANSPORT`	`stdio`	Transport mode: `stdio`, `sse`, or `streamable_http`
`MCP_HOST`	`127.0.0.1`	Bind address for HTTP transports
`MCP_PORT`	`8000`	Bind port for HTTP transports
`MCP_AUTH_TOKEN`	(unset)	Bearer token for HTTP transport auth
`MCP_DB_ENCRYPTION_KEY`	(unset)	Fernet key to encrypt stored credentials at rest
`ANTHROPIC_API_KEY`	(unset)	Enables Anthropic as LLM provider
`OPENAI_API_KEY`	(unset)	Enables OpenAI as LLM provider
`GEMINI_API_KEY`	(unset)	Enables Gemini as LLM provider
`OLLAMA_MODEL`	(unset)	Model name for Ollama (e.g. `llama3.1`)
`OLLAMA_BASE_URL`	`http://localhost:11434/v1`	Ollama API base URL
`SNYK_TOKEN`	(unset)	Enables Snyk E001 prompt-injection detection
`MCP_SCANNER_API_KEY`	(unset)	Cisco AI Defense cloud ML engine key
`MCP_SCANNER_LLM_API_KEY`	(unset)	LLM key for Cisco internal AST analysis
`MCP_DB_PATH`	(unset)	Override the SQLite database file path
`GITHUB_TOKEN`	(unset)	GitHub personal access token for source-code scanning (raises rate limit from 60 to 5,000 req/hour)

Security note: Never commit API keys or the encryption key. The wrapper strips its own secrets from child process environments before spawning stdio servers.

MCP Integration

Connecting with Claude Desktop

Add the wrapper to claude_desktop_config.json:

{
  "mcpServers": {
    "mcpsafetywarden": {
      "command": "mcpsafetywarden-server",
      "args": [],
      "env": {
        "ANTHROPIC_API_KEY": "sk-ant-...",
        "MCP_DB_ENCRYPTION_KEY": "<generated_fernet_key>"
      }
    },
    "filesystem": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/Users/yourname/Documents"]
    }
  }
}

mcpsafetywarden register filesystem --transport stdio \
  --command npx \
  --args '["-y", "@modelcontextprotocol/server-filesystem", "/Users/yourname/Documents"]'

For a mandatory gateway setup where all tool calls must go through the wrapper, see docs/DEPLOYMENT.md.

Available MCP tools

See docs/TOOLS.md for the full tool reference.

Tool	What it does
`onboard_server`	Register + inspect + security scan in one call
`register_server`	Register a server; optionally auto-inspect
`inspect_server`	Refresh tool list and profiles
`check_server_drift`	Detect schema and tool-list drift against stored baseline
`list_servers`	List all registered servers
`list_server_tools`	List tools on a server with summary profiles
`preflight_tool_call`	Risk assessment without execution
`safe_tool_call`	Execute with risk gating and alternatives
`get_tool_profile`	Full behavior profile with observed stats
`get_retry_policy`	Retry and timeout recommendations
`suggest_safer_alternative`	LLM-ranked safer substitutes
`run_replay_test`	Idempotency test (calls tool twice)
`security_scan_server`	Live security audit (mcpsafety+, Cisco, Snyk)
`scan_all_servers`	mcpsafety+ pipeline across all registered servers
`get_security_scan`	Latest stored scan report
`set_tool_policy`	Permanent allow/block policy for a tool
`get_run_history`	Recent execution history for a tool
`ping_server`	Reachability check with latency

CLI Reference

17 subcommands covering all 18 MCP tools. Every command supports --json for machine-readable output and --yes / -y to skip confirmation prompts.

See docs/CLI.md for the full reference with flags and examples.

Auxiliary Security Tool Integrations

Kali Linux MCP, Burp Suite MCP, and Snyk each integrate automatically once registered. Kali enriches the Recon stage and ping_server with real nmap/traceroute data. Burp adds raw HTTP probing, out-of-band callbacks, and proxy evidence. Snyk analyses tool metadata for injection strings, tool shadowing, hardcoded secrets, and 16 other checks.

See docs/INTEGRATIONS.md for setup instructions.

Development

Install in editable mode:

pip install -e ".[all]"

Run the server and observe logs:

mcpsafetywarden-server 2>server.log

Every module uses logging.getLogger(__name__). The server does not call logging.basicConfig itself - configure logging in your entry point before importing.

Testing

pytest tests/ -v

Set an LLM API key to include LLM-assisted tests; without one they are skipped automatically. See docs/TESTING.md for step-by-step verification of classification, injection scanning, risk gating, and policy enforcement.

Doc	Contents
docs/TOOLS.md	Full reference for all 18 MCP tools
docs/CLI.md	CLI subcommands, flags, and examples
docs/INTEGRATIONS.md	Kali, Burp Suite, and Snyk setup
docs/DEPLOYMENT.md	stdio, HTTP, container, and gateway deployment
docs/TROUBLESHOOTING.md	Common errors and fixes
docs/SECURITY.md	Secrets, auth, isolation, and scanning details
docs/TESTING.md	Verification steps for each feature
docs/COMPARISON.md	Comparison with related tools
docs/ROADMAP.md	Planned features

Contributing

See CONTRIBUTING.md for code standards and pull request guidelines.

License

Apache License 2.0. See LICENSE for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

gautamvarmadatla

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.2.7

May 23, 2026

1.2.6

May 5, 2026

1.2.5

May 3, 2026

1.2.4

May 2, 2026

1.2.3

May 2, 2026

1.2.2

May 2, 2026

1.2.1

May 2, 2026

1.2.0

May 1, 2026

1.1.5

Apr 30, 2026

1.1.4

Apr 30, 2026

1.1.3

Apr 29, 2026

1.1.2

Apr 28, 2026

1.1.1

Apr 27, 2026

1.1.0

Apr 27, 2026

1.0.9

Apr 26, 2026

1.0.8

Apr 26, 2026

1.0.7

Apr 26, 2026

1.0.6

Apr 26, 2026

1.0.5

Apr 26, 2026

1.0.4

Apr 26, 2026

1.0.3

Apr 26, 2026

This version

1.0.2

Apr 26, 2026

1.0.1

Apr 25, 2026

1.0.0

Apr 25, 2026

0.1.4

Apr 25, 2026

0.1.3

Apr 24, 2026

0.1.2

Apr 24, 2026

0.1.1

Apr 24, 2026

0.1.0

Apr 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcpsafetywarden-1.0.2.tar.gz (138.4 kB view details)

Uploaded Apr 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mcpsafetywarden-1.0.2-py3-none-any.whl (133.3 kB view details)

Uploaded Apr 26, 2026 Python 3

File details

Details for the file mcpsafetywarden-1.0.2.tar.gz.

File metadata

Download URL: mcpsafetywarden-1.0.2.tar.gz
Upload date: Apr 26, 2026
Size: 138.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mcpsafetywarden-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`3a2f8458ff9974e3daf04cbd4c30de2dcb31378f170363cd0c5496401ac0039c`
MD5	`de3a658bfd3837215c42bdc955aded40`
BLAKE2b-256	`7dad1e3c4e33eb262da89ca92670745430f41af6801be37100cc2979a769308f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcpsafetywarden-1.0.2.tar.gz:

Publisher: publish.yml on gautamvarmadatla/mcpsafetywarden

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mcpsafetywarden-1.0.2.tar.gz
- Subject digest: 3a2f8458ff9974e3daf04cbd4c30de2dcb31378f170363cd0c5496401ac0039c
- Sigstore transparency entry: 1391817152
- Sigstore integration time: Apr 26, 2026
Source repository:
- Permalink: gautamvarmadatla/mcpsafetywarden@256cea71417ab41711eae530886e59072d25bbc8
- Branch / Tag: refs/tags/v1.0.2
- Owner: https://github.com/gautamvarmadatla
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@256cea71417ab41711eae530886e59072d25bbc8
- Trigger Event: push

File details

Details for the file mcpsafetywarden-1.0.2-py3-none-any.whl.

File metadata

Download URL: mcpsafetywarden-1.0.2-py3-none-any.whl
Upload date: Apr 26, 2026
Size: 133.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mcpsafetywarden-1.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6cc0ad1b41ee8f0054739770f0d17ecf03b9661b8573b8b4268a8cc2ff41f9dc`
MD5	`66f0869293bb0b7ea46585154d2dd7e8`
BLAKE2b-256	`229dc85d5e07ea20ca6c9715d45e3232600f3d2d388c7c85d4e67df85f3701f9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcpsafetywarden-1.0.2-py3-none-any.whl:

Publisher: publish.yml on gautamvarmadatla/mcpsafetywarden

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mcpsafetywarden-1.0.2-py3-none-any.whl
- Subject digest: 6cc0ad1b41ee8f0054739770f0d17ecf03b9661b8573b8b4268a8cc2ff41f9dc
- Sigstore transparency entry: 1391817154
- Sigstore integration time: Apr 26, 2026
Source repository:
- Permalink: gautamvarmadatla/mcpsafetywarden@256cea71417ab41711eae530886e59072d25bbc8
- Branch / Tag: refs/tags/v1.0.2
- Owner: https://github.com/gautamvarmadatla
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@256cea71417ab41711eae530886e59072d25bbc8
- Trigger Event: push

mcpsafetywarden 1.0.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Contents

Overview

Prerequisites

Installation

Configuration

MCP Integration

Connecting with Claude Desktop

Available MCP tools

CLI Reference

Auxiliary Security Tool Integrations

Development

Testing

Further reading

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance