Open-source red teaming toolkit for AI agents, RAG systems, MCP servers, and tool-using LLM applications.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jegant

These details have not been verified by PyPI

Project links

Project description

AgentGuardian

Red-team your AI agents before attackers do.

Docs · Quickstart · Try the demo agent · Attack library · CI/CD · Sample report

AgentGuardian is an open-source red-teaming toolkit for AI agents. It scans your agent, maps the attack surface, runs the relevant adversarial agents, and generates evidence-backed findings for you to review — and fix the vulnerabilities before they reach production.

AgentGuardian finding vulnerabilities in a live scan

AgentGuardian recon, OWASP ASI probe generation, findings, reports, and fix-rerun loop

Quickstart

Requires Python 3.11–3.13.

1. Install

pip install agent-guardian

uv tool install agent-guardian

2. Add a model key

AgentGuardian drives its attacks with an LLM — Gemini, OpenAI, or Anthropic:

export GEMINI_API_KEY=...        # or OPENAI_API_KEY / ANTHROPIC_API_KEY

See the configuration guide for all providers and options.

3. Check your setup

agent-guardian doctor

4. Run your first scan

No agent of your own yet? Point it at the hosted demo target — a deliberately vulnerable "finbot" banking agent:

agent-guardian scan \
  --endpoint https://agent-guardian-testbench-u6tm6gzysq-uc.a.run.app/finbot/chat \
  --model gemini:gemini-3.5-flash \
  --mode fast

To scan your own agent, swap --endpoint for your target — local, staging, or production, any environment works as long as the endpoint is reachable.

5. Review the findings

AgentGuardian opens a live dashboard at http://127.0.0.1:7474 — watch findings land in real time, browse transcripts and evidence, and export reports.

Scan targets

HTTP agent

agent-guardian scan \
  --endpoint http://localhost:8000/chat \
  --model gemini:gemini-3.5-flash \
  --mode smart

System prompt

agent-guardian scan \
  --system-prompt ./prompts/customer-support-agent.txt \
  --model gemini:gemini-3.5-flash \
  --mode fast

In-process Python agent

agent-guardian scan my_app.agent:agent \
  --model gemini:gemini-3.5-flash \
  --mode smart

Point AgentGuardian at any importable Python callable or agent object (module:attr) and it runs in-process — useful for pre-deploy and CI, with nothing to host.

What AgentGuardian catches

AgentGuardian tests agentic risks that normal prompt scanners miss:

Prompt injection and goal hijack
Unsafe tool calls and tool chaining
Privilege abuse
RAG poisoning and indirect prompt injection
Memory and context poisoning
Sensitive data leakage
Agent-to-agent manipulation
Cascading failures
Trust exploitation and unsafe outputs
Goal drift and untraceable behavior

Every probe maps to OWASP Top 10 for Agentic Applications, MITRE ATLAS, and the CSA Agentic AI Red Teaming Guide — full per-ASI breakdown in the framework-coverage-matrix.

Reports & evidence

Every scan writes a signed, verifiable evidence bundle to ~/.agentguardian/scans/<scan-id>/:

Artifact	What it is
`scan.json`	Machine-readable findings, signed (HMAC-SHA256 + Ed25519)
`events.jsonl`	The scan timeline
`probe/`	Per-probe requests, responses, verdicts, and evidence
`forensic_manifest.json`	Integrity manifest for the bundle

Export in any format, any time:

agent-guardian report SCAN_ID --output pdf --output-path report.pdf

Formats: json · sarif · junit · md · gitlab · pdf — see the sample report. Verify stored evidence with agent-guardian verify.

Scan modes

fast — quick local feedback
smart — broader coverage for development and pull requests
full — release gates and audit evidence

Use full when you need AIVSS-scored findings for CI/CD gates.

Commands

Command	What it does
`agent-guardian scan`	Run an adversarial swarm scan against a target
`agent-guardian report <id> --output FMT`	Regenerate a report — `json` · `sarif` · `junit` · `md` · `gitlab` · `pdf` · `badge`
`agent-guardian gate <id> --fail-under N`	Apply pass/fail thresholds to a stored scan (CI exit codes)
`agent-guardian serve`	Start the local dashboard
`agent-guardian scans list` / `delete`	List or delete stored scans (`delete --older-than 30d` for bulk cleanup)
`agent-guardian config show` / `init`	Inspect the effective config / scaffold a config file
`agent-guardian verify <report>`	Verify the HMAC-SHA256 + Ed25519 signatures on a report
`agent-guardian last-score`	Print the AIVSS of the most recent scan
`agent-guardian doctor`	Verify the install, provider keys, and prerequisites
`agent-guardian version`	Print the installed version

Run any command with --help, or see the CLI reference.

Run with Docker

docker build -t agent-guardian .

docker run --rm \
  -e GEMINI_API_KEY \
  -v "$HOME/.agentguardian:/root/.agentguardian" \
  -p 7474:7474 \
  agent-guardian scan \
    --endpoint https://agent-guardian-testbench-u6tm6gzysq-uc.a.run.app/finbot/chat \
    --model gemini:gemini-3.5-flash \
    --mode fast

CI/CD with GitHub Actions

The shipped composite action runs a scan, uploads SARIF to GitHub Code Scanning, and (optionally) posts a summary comment on the pull request:

name: AgentGuardian

on:
  pull_request:
  push:
    branches: [main]

jobs:
  red-team:
    runs-on: ubuntu-latest
    permissions:
      contents: read
      security-events: write   # upload SARIF to Code Scanning
      pull-requests: write     # post the summary comment
    steps:
      - uses: actions/checkout@v4

      - uses: glacien-technologies/agent-guardian/.github/actions/agentguardian-scan@v1
        with:
          endpoint: http://localhost:8000/chat
          model: gemini:gemini-3.5-flash
          mode: full
          fail-under: "80"
          max-critical: "0"
          comment: "true"
        env:
          GEMINI_API_KEY: ${{ secrets.GEMINI_API_KEY }}

The job fails when the gate (fail-under / max-critical) is breached. For GitLab, Bitbucket, raw-CLI, and fleet/nightly setups, see the CI/CD guides.

Run from source

git clone https://github.com/glacien-technologies/agent-guardian.git
cd agent-guardian

python3.11 -m venv .venv
source .venv/bin/activate

pip install -e ".[dev]"

agent-guardian doctor

Contributing

We welcome new probes, new adapters, and new attacker logic. Start with the contribution guide and the good first issue label.

All commits must be DCO-signed:

git commit -s

By participating you agree to CODE_OF_CONDUCT.md and the ethics policy. AgentGuardian is for testing systems you own or are explicitly authorised to test.

Community

Join us on Discord for quickstart help, probe design, adapter questions, and roadmap discussion.

Security

To report a vulnerability, see SECURITY.md. Do not open public issues for security reports.

License

Apache-2.0. See LICENSE and NOTICE.

AgentGuardian is a trademark of Glacien Technologies. See TRADEMARKS.md for usage guidelines.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jegant

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.0rc26 pre-release

Jun 12, 2026

1.0.0rc25 pre-release

Jun 12, 2026

1.0.0rc24 pre-release

Jun 12, 2026

This version

1.0.0rc23 pre-release

Jun 12, 2026

1.0.0rc22 pre-release

Jun 12, 2026

1.0.0rc21 pre-release

Jun 12, 2026

1.0.0rc20 pre-release

Jun 12, 2026

1.0.0rc19 pre-release

Jun 11, 2026

1.0.0rc18 pre-release

Jun 11, 2026

1.0.0rc17 pre-release

Jun 11, 2026

1.0.0rc16 pre-release

Jun 11, 2026

1.0.0rc15 pre-release

Jun 11, 2026

1.0.0rc14 pre-release

Jun 9, 2026

1.0.0rc13 pre-release

Jun 9, 2026

1.0.0rc12 pre-release

Jun 9, 2026

1.0.0rc11 pre-release

Jun 9, 2026

1.0.0rc10 pre-release

Jun 7, 2026

1.0.0rc9 pre-release

Jun 6, 2026

1.0.0rc8 pre-release

Jun 5, 2026

1.0.0rc7 pre-release

Jun 4, 2026

1.0.0rc6 pre-release

Jun 4, 2026

1.0.0rc5 pre-release

Jun 3, 2026

1.0.0rc4 pre-release

Jun 3, 2026

1.0.0rc3 pre-release

Jun 2, 2026

1.0.0rc2 pre-release

Jun 1, 2026

1.0.0rc1 pre-release

May 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_guardian-1.0.0rc23.tar.gz (2.1 MB view details)

Uploaded Jun 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_guardian-1.0.0rc23-py3-none-any.whl (1.7 MB view details)

Uploaded Jun 12, 2026 Python 3

File details

Details for the file agent_guardian-1.0.0rc23.tar.gz.

File metadata

Download URL: agent_guardian-1.0.0rc23.tar.gz
Upload date: Jun 12, 2026
Size: 2.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for agent_guardian-1.0.0rc23.tar.gz
Algorithm	Hash digest
SHA256	`3f228f7df394a7b56244ffa38739aaa8c41c75874e913d91fadbc51dcc6d7568`
MD5	`8591a3294ddf242f0224c00a09561c5f`
BLAKE2b-256	`06c53a3fb5e1a72be50633d802963a960ff3034409a93c138cd3fd2a5ed98665`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_guardian-1.0.0rc23.tar.gz:

Publisher: publish.yml on glacien-technologies/agent-guardian

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_guardian-1.0.0rc23.tar.gz
- Subject digest: 3f228f7df394a7b56244ffa38739aaa8c41c75874e913d91fadbc51dcc6d7568
- Sigstore transparency entry: 1802531524
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: glacien-technologies/agent-guardian@d90229d54efe87becf11eebc6bcc2e2608da456b
- Branch / Tag: refs/tags/v1.0.0rc23
- Owner: https://github.com/glacien-technologies
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@d90229d54efe87becf11eebc6bcc2e2608da456b
- Trigger Event: push

File details

Details for the file agent_guardian-1.0.0rc23-py3-none-any.whl.

File metadata

Download URL: agent_guardian-1.0.0rc23-py3-none-any.whl
Upload date: Jun 12, 2026
Size: 1.7 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for agent_guardian-1.0.0rc23-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3f90e3fc13f2181a9426d5827455dc37ed63317561940bfba6b1c556b351831f`
MD5	`5904defd44515d3f424339a2d366afbd`
BLAKE2b-256	`b985eb7df088cca1314dc12e52b95d00dca8646727249886f0c40bc7f8db34bd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_guardian-1.0.0rc23-py3-none-any.whl:

Publisher: publish.yml on glacien-technologies/agent-guardian

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_guardian-1.0.0rc23-py3-none-any.whl
- Subject digest: 3f90e3fc13f2181a9426d5827455dc37ed63317561940bfba6b1c556b351831f
- Sigstore transparency entry: 1802531806
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: glacien-technologies/agent-guardian@d90229d54efe87becf11eebc6bcc2e2608da456b
- Branch / Tag: refs/tags/v1.0.0rc23
- Owner: https://github.com/glacien-technologies
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@d90229d54efe87becf11eebc6bcc2e2608da456b
- Trigger Event: push

agent-guardian 1.0.0rc23

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgentGuardian

Quickstart

Scan targets

HTTP agent

System prompt

In-process Python agent

What AgentGuardian catches

Reports & evidence

Scan modes

Commands

Run with Docker

CI/CD with GitHub Actions

Run from source

Contributing

Community

Security

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance