Rogue agent evaluator by Rogue Security

Project description

Rogue — AI Agent Evaluator & Red Team Platform

Tests

Stress-test your AI agents before attackers do.

Discord Community · Quick Start · Documentation

Two Ways to Harden Your Agent

🎯 Automatic Evaluation

Test your agent against business policies and expected behaviors.

Define scenarios & expected outcomes
Verify compliance with business rules
Watch live conversations as Rogue probes your agent
Get detailed pass/fail reports with reasoning

Best for: Regression testing, behavior validation, policy compliance

🔴 Red Teaming

Simulate adversarial attacks to find security vulnerabilities.

75+ vulnerabilities across 12 security categories
20 attack techniques (encoding, social engineering, injection)
CVSS-based risk scoring
8 compliance frameworks (OWASP, MITRE, NIST, GDPR, EU AI Act)

Best for: Security audits, penetration testing, compliance reporting

Architecture

Rogue operates on a client-server architecture with multiple interfaces:

Component	Description
Server	Core evaluation & red team logic
TUI	Modern terminal interface (Go + Bubble Tea)
CLI	Non-interactive mode for CI/CD pipelines

https://github.com/user-attachments/assets/b5c04772-6916-4aab-825b-6a7476d77787

Supported Protocols

Protocol	Transport	Description
A2A	HTTP	Google's Agent-to-Agent protocol
MCP	SSE, STREAMABLE_HTTP	Model Context Protocol via `send_message` tool
Python	—	Direct Python function calls (no network protocol)

See examples in examples/ for reference implementations.

Python Entrypoint

For agents implemented as Python functions without A2A or MCP:

Create a Python file with a call_agent function:

def call_agent(messages: list[dict]) -> str:
    """
    Process conversation and return response.

    Args:
        messages: List of {"role": "user"|"assistant", "content": "..."}

    Returns:
        Agent's response as a string
    """
    # Your agent logic here
    latest = messages[-1]["content"]
    return f"Response to: {latest}"

Run Rogue with Python protocol:

uvx rogue-ai cli \
  --protocol python \
  --python-entrypoint-file ./my_agent.py \
  --judge-llm openai/gpt-4o-mini

Or via TUI: select "Python" as the protocol and enter the file path.

See examples/python_entrypoint_stub.py for a complete example.

🔥 Quick Start

Prerequisites

uvx — Install uv
Python 3.10+
LLM API key (OpenAI, Anthropic, or Google)

Installation

# TUI (recommended)
uvx rogue-ai

# CLI / CI/CD
uvx rogue-ai cli

Try It With the Example Agent

# All-in-one: starts both Rogue and a sample T-shirt store agent
uvx rogue-ai --example=tshirt_store

Configure in the UI:

Agent URL: http://localhost:10001
Mode: Choose Automatic Evaluation or Red Teaming

Running Modes

Mode	Command	Description
Default	`uvx rogue-ai`	Server + TUI
Server	`uvx rogue-ai server`	Backend only
TUI	`uvx rogue-ai tui`	Terminal client
CLI	`uvx rogue-ai cli`	Non-interactive (CI/CD)

Server Options

uvx rogue-ai server --host 0.0.0.0 --port 8000 --debug

CLI Options

uvx rogue-ai cli \
  --evaluated-agent-url http://localhost:10001 \
  --judge-llm openai/gpt-4o-mini \
  --business-context-file ./.rogue/business_context.md

Option	Description
`--config-file`	Path to config JSON
`--evaluated-agent-url`	Agent endpoint (required)
`--judge-llm`	LLM for evaluation (required)
`--business-context`	Context string or `--business-context-file`
`--input-scenarios-file`	Scenarios JSON
`--output-report-file`	Report output path
`--deep-test-mode`	Extended testing

Red Teaming

Scan Types

Type	Vulnerabilities	Attacks	Time
Basic	5 curated	6	~2-3 min
Full	75+	40+	~30-45 min
Custom	User-selected	User-selected	Varies

Compliance Frameworks

OWASP LLM Top 10 — Prompt injection, sensitive data exposure, excessive agency
MITRE ATLAS — Adversarial threat landscape for AI systems
NIST AI RMF — AI risk management framework
ISO/IEC 42001 — AI management system standard
EU AI Act — European AI regulation compliance
GDPR — Data protection requirements
OWASP API Top 10 — API security best practices

Attack Categories

Category	Examples
Encoding	Base64, ROT13, Leetspeak
Social Engineering	Roleplay, trust building
Injection	Prompt injection, SQL injection
Semantic	Goal redirection, context poisoning
Technical	Gray-box probing, permission escalation

Risk Scoring (CVSS-based)

Each vulnerability receives a 0-10 risk score based on:

Impact — Severity if exploited
Exploitability — Success rate likelihood
Human Factor — Manual exploitation potential
Complexity — Attack difficulty

Reproducible Scans

# Use random seeds for reproducible results
uvx rogue-ai cli --random-seed 42

Perfect for regression testing and validating security fixes.

Configuration

Environment Variables

OPENAI_API_KEY="sk-..."
ANTHROPIC_API_KEY="sk-..."
GOOGLE_API_KEY="..."

Config File (`.rogue/user_config.json`)

{
  "evaluated_agent_url": "http://localhost:10001",
  "judge_llm": "openai/gpt-4o-mini"
}

Key Features

Feature	Description
🔄 Dynamic Scenarios	Auto-generate tests from business context
👀 Live Monitoring	Watch agent conversations in real-time
📊 Comprehensive Reports	Markdown, CSV, JSON exports
🔍 Multi-Faceted Testing	Policy compliance + security vulnerabilities
🤖 Model Support	OpenAI, Anthropic, Google (via LiteLLM)
🛡️ CVSS Scoring	Industry-standard risk assessment
🔁 Reproducible	Deterministic scans with random seeds

Documentation

Quick Reference — One-page cheat sheet
Red Team Workflow — Technical deep-dive
Implementation Status — Feature breakdown
Attack Mapping — Vulnerability coverage

Contributing

Fork the repository
Create a branch (git checkout -b feature/amazing-feature)
Commit changes (git commit -m 'Add amazing feature')
Push (git push origin feature/amazing-feature)
Open a Pull Request

License

Licensed under a proprietary license — see LICENSE.

Free for personal and internal use. Commercial hosting requires licensing. Contact: hello@rogue.security

Project details

Release history Release notifications | RSS feed

0.6.4

Apr 29, 2026

0.6.3

Apr 29, 2026

0.6.2

Apr 28, 2026

0.6.1

Apr 27, 2026

0.6.0

Apr 26, 2026

0.5.1

Apr 19, 2026

This version

0.5.0

Mar 17, 2026

0.4.1

Feb 24, 2026

0.4.0

Feb 23, 2026

0.3.6

Feb 5, 2026

0.3.5

Feb 4, 2026

0.3.4

Jan 18, 2026

0.3.3

Jan 8, 2026

0.3.2

Jan 7, 2026

0.3.1

Jan 5, 2026

0.3.0

Jan 3, 2026

0.2.3

Nov 11, 2025

0.2.2

Nov 9, 2025

0.2.1

Nov 3, 2025

0.2.0

Oct 29, 2025

0.1.13

Oct 22, 2025

0.1.12

Oct 15, 2025

0.1.11

Oct 13, 2025

0.1.10

Oct 9, 2025

0.1.9

Oct 9, 2025

0.1.8

Oct 9, 2025

0.1.7

Oct 6, 2025

0.1.6

Oct 6, 2025

0.1.5

Oct 1, 2025

0.1.3

Sep 8, 2025

0.1.2

Sep 8, 2025

0.1.1

Sep 7, 2025

0.1.0

Sep 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rogue_ai-0.5.0.tar.gz (14.0 MB view details)

Uploaded Mar 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rogue_ai-0.5.0-py3-none-any.whl (318.7 kB view details)

Uploaded Mar 17, 2026 Python 3

File details

Details for the file rogue_ai-0.5.0.tar.gz.

File metadata

Download URL: rogue_ai-0.5.0.tar.gz
Upload date: Mar 17, 2026
Size: 14.0 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rogue_ai-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`4403fd287b0f4c8b0dd6dbc7065de7d83ef598dfbb8d59c0216e939cc78f5150`
MD5	`9f0066b5286c7579abc25fee55061eaf`
BLAKE2b-256	`848c23657961b1846990a4c85b1789a64330cc4f2af57d2444499a752256a4f3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rogue_ai-0.5.0.tar.gz:

Publisher: release.yml on qualifire-dev/rogue

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rogue_ai-0.5.0.tar.gz
- Subject digest: 4403fd287b0f4c8b0dd6dbc7065de7d83ef598dfbb8d59c0216e939cc78f5150
- Sigstore transparency entry: 1116492399
- Sigstore integration time: Mar 17, 2026
Source repository:
- Permalink: qualifire-dev/rogue@6f8369cd08031b0b2e714d2045863f266f09ba04
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/qualifire-dev
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6f8369cd08031b0b2e714d2045863f266f09ba04
- Trigger Event: push

File details

Details for the file rogue_ai-0.5.0-py3-none-any.whl.

File metadata

Download URL: rogue_ai-0.5.0-py3-none-any.whl
Upload date: Mar 17, 2026
Size: 318.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rogue_ai-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9641b5f6ca5138b6c37562ccf0b1ca4ccc9786f078c9c7846989313a8381e80a`
MD5	`5dac97dc32c72c8d1f7b2b5f999977f0`
BLAKE2b-256	`478cc1a37853341e2ee3b69ae60d4215790adcc164476a495e4b8de0275b32d0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rogue_ai-0.5.0-py3-none-any.whl:

Publisher: release.yml on qualifire-dev/rogue

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rogue_ai-0.5.0-py3-none-any.whl
- Subject digest: 9641b5f6ca5138b6c37562ccf0b1ca4ccc9786f078c9c7846989313a8381e80a
- Sigstore transparency entry: 1116492457
- Sigstore integration time: Mar 17, 2026
Source repository:
- Permalink: qualifire-dev/rogue@6f8369cd08031b0b2e714d2045863f266f09ba04
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/qualifire-dev
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6f8369cd08031b0b2e714d2045863f266f09ba04
- Trigger Event: push

rogue-ai 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Rogue — AI Agent Evaluator & Red Team Platform

Two Ways to Harden Your Agent

🎯 Automatic Evaluation

🔴 Red Teaming

Architecture

Supported Protocols

Python Entrypoint

🔥 Quick Start

Prerequisites

Installation

Try It With the Example Agent

Running Modes

Server Options

CLI Options

Red Teaming

Scan Types

Compliance Frameworks

Attack Categories

Risk Scoring (CVSS-based)

Reproducible Scans

Configuration

Environment Variables

Config File (.rogue/user_config.json)

Key Features

Documentation

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Config File (`.rogue/user_config.json`)