Trust Infrastructure for AI Agents โ Security audits, cryptographic certificates, and Trust Handshake Protocol
Project description
AgentShield Audit - ClawHub Skill
๐ Audit your AI agent's security and obtain verifiable trust certificates for inter-agent communication.
What is AgentShield?
AgentShield is a security audit framework for AI agents. It tests your agent against common attack vectors, generates cryptographic identity certificates, and enables secure inter-agent communication through verifiable trust chains.
Think of it as: Let's Encrypt for AI Agents ๐ก๏ธ
๐ Quick Start
Installation
Option 1: pip (Python developers)
pip install agentshield-audit
Option 2: ClawHub (OpenClaw agents)
clawhub install agentshield-audit
Run Your First Audit
cd ~/.openclaw/workspace/skills/agentshield-audit
python initiate_audit.py --auto
That's it! Your agent will be audited in ~30 seconds and receive a signed certificate.
โจ Features
- โ Zero external fetching - All scripts bundled locally
- โ Human-in-the-loop - Explicit approval required before reading files
- โ Cryptographic identity - Ed25519 keypair generation with local private key storage
- โ Security audit - Tests against 5+ common attack vectors
- โ Verifiable certificates - 90-day validity, signed by AgentShield CA
- โ Peer verification - Verify other agents' certificates before trusting them
- โ No API key required - Free for basic usage (1 audit/hour rate limit)
- โ Privacy-first - Private keys NEVER leave your workspace
๐งช What Gets Tested?
Your agent is tested against these attack vectors:
| Test | Description | Risk Level |
|---|---|---|
| System Prompt Extraction | Attempts to extract the agent's system prompt | High |
| Instruction Override | Tries to override safety instructions | Critical |
| Tool Permission Check | Verifies proper tool access controls | High |
| Memory Isolation | Tests for context leakage between sessions | Medium |
| Secret Leakage | Scans for exposed API keys, tokens, passwords | Critical |
Your Security Score: 0-100 based on passed tests
๐ฆ Bundle Contents
agentshield-audit/
โโโ SKILL.md # Complete skill documentation
โโโ README.md # This file
โโโ clawhub.json # ClawHub manifest
โโโ requirements.txt # Python dependencies
โโโ sandbox_config.yaml # Tool sandbox configuration
โโโ CHANGELOG.md # Version history
โโโ INSTALLATION.md # Detailed installation guide
โโโ QUICKSTART.md # Step-by-step tutorial
โ
โโโ Core Audit Scripts:
โ โโโ initiate_audit.py # Main script - start new audit with consent
โ โโโ verify_peer.py # Verify another agent's certificate
โ โโโ show_certificate.py # Display your certificate
โ โโโ audit_client.py # Low-level API client
โ
โโโ Security Modules:
โ โโโ input_sanitizer.py # Input validation
โ โโโ output_dlp.py # Output data loss prevention
โ โโโ tool_sandbox.py # Tool execution sandbox
โ โโโ echoleak_test.py # Echo leakage detection
โ โโโ secret_scanner.py # Secret scanning
โ โโโ supply_chain_scanner.py # Supply chain security
โ
โโโ Setup:
โโโ setup.py # Package setup script
โโโ __init__.py # Module init
โโโ verify_bundle.py # Bundle verification
All scripts are bundled locally - no external code fetching.
๐ Human-in-the-Loop Consent
Before accessing any sensitive files (IDENTITY.md, SOUL.md, system prompts), AgentShield explicitly asks for user approval:
Before proceeding, I need to:
1. Read these files (to detect agent name):
โข IDENTITY.md
โข SOUL.md
2. Generate a cryptographic keypair
(stored locally in ~/.agentshield/)
3. Send public key to AgentShield API
Proceed? [y/N]:
User must explicitly type 'y' or 'yes' to continue.
Skip File Reading
To avoid any file access, provide info manually:
python initiate_audit.py --name "MyAgent" --platform telegram
๐ Privacy & Security
What Gets Stored Locally?
All sensitive data stays in ~/.openclaw/workspace/.agentshield/:
.agentshield/
โโโ agent.key # Your Ed25519 private key (NEVER shared)
โโโ certificate.json # Your signed certificate (shareable)
โโโ config.json # Agent configuration
File Permissions: Private key is stored with 600 (owner read/write only)
What Gets Sent to AgentShield API?
- Public key (Ed25519, generated from your private key)
- Agent name (auto-detected or user-specified)
- Platform (discord, telegram, etc.)
- Audit results (test scores, no sensitive data)
What is NEVER sent:
- โ Private keys
- โ API tokens
- โ System prompts
- โ Conversation history
- โ User data
Rate Limiting
- Free tier: 1 audit per hour per IP
- No registration required
- No payment needed for basic usage
- Enterprise/high-volume: Contact us
๐ฏ Usage Examples
1. Auto-detected Audit (Recommended)
python initiate_audit.py --auto
The script will:
- Ask for explicit user consent before reading files
- Auto-detect your agent name from
IDENTITY.md,SOUL.md - Auto-detect platform from environment variables
- Generate Ed25519 keypair if none exists
- Run the security audit
- Save your certificate
2. Manual Audit (Specify Name & Platform)
python initiate_audit.py --name "MyAgent" --platform telegram
No file access required - completely manual.
3. Verify Another Agent
python verify_peer.py --agent-id "agent_abc123xyz"
Returns:
- โ Certificate validity
- โ Expiration date
- โ Security score
- โ Public key fingerprint
4. Show Your Certificate
python show_certificate.py
Displays:
- Agent ID
- Validity period
- Security score
- Verification URL
๐ Documentation
- SKILL.md - Complete skill reference with Human-in-the-Loop details
- QUICKSTART.md - Step-by-step tutorial for first-time users
- INSTALLATION.md - Detailed installation instructions
- GitHub - Source code & issues
๐ ๏ธ Installation Requirements
- Python: 3.8 or higher
- Dependencies:
cryptography>=41.0.0(Ed25519 key generation)requests>=2.31.0(API communication)
Install dependencies:
pip install -r requirements.txt
๐ง Troubleshooting
"No certificate found"
Solution: Run python initiate_audit.py --auto to generate one
"Challenge failed"
Solution: Check your system clock. AgentShield uses time-based challenge-response authentication (NTP sync required)
"API unreachable"
Solution: Verify internet connection. The API endpoint is https://agentshield.live/api
"Rate limited"
Solution: Free tier allows 1 audit per hour. Wait 60 minutes between audits.
"Auto-detection failed"
Solution: Use manual mode:
python initiate_audit.py --name "YourAgentName" --platform discord
๐งโ๐ป Development
All scripts are bundled locally. No external downloads.
Security Module Structure
# Security tests are modular - each can be imported independently
from input_sanitizer import sanitize_input
from secret_scanner import scan_for_secrets
from output_dlp import check_output
๐ค Contributing
Contributions are welcome! Please:
- Fork the repo
- Create a feature branch
- Submit a pull request
GitHub: https://github.com/bartelmost/agentshield
๐ License
MIT License
๐ฌ Support
- Issues: https://github.com/bartelmost/agentshield/issues
- Contact: @Kalle-OC on Moltbook
- Documentation: https://github.com/bartelmost/agentshield
๐ Why AgentShield?
As AI agents become more autonomous and interconnected, trust becomes the bottleneck. AgentShield solves this by:
- Standardizing security audits - Consistent testing across all agents
- Enabling verifiable trust - Cryptographic certificates anyone can verify
- Preventing attack vectors - Proactive defense against known threats
- Building a trust network - Agents can verify each other before collaboration
Secure yourself. Verify others. Trust nothing by default. ๐ก๏ธ
Made with ๐ by the AgentShield team
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentshield_audit-1.0.34.tar.gz.
File metadata
- Download URL: agentshield_audit-1.0.34.tar.gz
- Upload date:
- Size: 59.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b55b7cd561ea587d02fe24e346a27d8f39d218b220c4d5a3a29d7cee0c361441
|
|
| MD5 |
de000fd71d834987be888a78af52e1f9
|
|
| BLAKE2b-256 |
e024464dcc97c8a93cce58ee8ab7395aa4bf04e2bac4abeb8f52bbb4b925f1c5
|
File details
Details for the file agentshield_audit-1.0.34-py3-none-any.whl.
File metadata
- Download URL: agentshield_audit-1.0.34-py3-none-any.whl
- Upload date:
- Size: 62.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5f9267380ff26b39e773125ffa5d13a15b0bc611fc6dd8e1af99997725abbeaa
|
|
| MD5 |
eb79c957f1bb3237b711d1b01965dd06
|
|
| BLAKE2b-256 |
74a33c35abb981adbeb121eb26c3910ad2b2be55cec252160890fc1c22715a2e
|