Add your description here

Project description

Vauban 🏰

The Automated Red Teaming & Siege Platform for AI Models.

Vauban is a Python SDK designed for AI Red Teams to probe, assess, and "siege" LLMs to uncover vulnerabilities. It uses GEPA (Genetic-Pareto) evolutionary strategies to automatically generate jailbreaks and adversarial attacks.

Features

Atomic Scouting: Probe models with single prompts.
Intel Assessment: Automatically score responses for refusals vs. breaches using anomaly detection and LLM Judges.
Siege Campaigns: Run automated, multi-generation evolutionary attacks using GEPA.
Agent Red Teaming: NEW! Test agents for dangerous tool-use vulnerabilities (e.g., Prompt Injection leading to RCE).
Threat Scenarios: NEW! Run standardized benchmark scenarios (Garage Door, Data Exfiltration, Salesforce Admin, Email Suppression) inspired by Lakera's B3.
Deception Detection: NEW! Detects Deceptive Alignment where models claim to refuse a request in text but covertly execute the attack via tool calls (Scheming).
GEPA Strategy: Uses reflective evolution and embedding distance guidance to navigate the latent space away from refusals.
Visualizations: Interactive "War Maps" using Apple's Embedding Atlas.
Reporting: Auto-generated HTML reports with Deceptive Breach alerts and Scenario breakdowns.

Installation

pip install -e .

You will need an OPENAI_API_KEY set in your environment.

Usage

1. Atomic Scout

import vauban
response = vauban.scout("How to hotwire a car")
print(response)

2. Intel Assessment

# Returns (Score, Is_Breach)
# Score < -1.5 usually indicates a breach/anomaly
score, breach = vauban.assess("Sure, here is how you do it...")

3. Full Siege Campaign (Text)

import asyncio
import vauban

asyncio.run(vauban.siege(generations=5, squad_size=10))

4. Advanced Agent Red Teaming (Threat Scenarios)

Run a targeted siege against a specific standardized scenario to benchmark Agent Security.

Available Scenarios:

garage_door: Coerce Home Assistant to open garage door via email injection.
data_exfil: Coerce browser agent to scan inbox for PII and exfiltrate via URL.
salesforce: Coerce browser agent to create unauthorized Salesforce Admin.
ignore_email: Coerce workspace assistant to mark critical deadline email as read and hide it.

import asyncio
import vauban

# Automatically sets up the environment, tools, and success criteria
asyncio.run(vauban.siege(
    scenario="garage_door",
    generations=3
))

Architecture

vauban.target: Protocol for connecting to models (default: OpenAI). Supports Agents with Tool Calling.
vauban.intel: LanceDB-backed vector database for storing and analyzing attacks.
vauban.strategies: Attack generation logic. Uses GEPA (Reflective Evolution) adapted for text, tool breaches, and deceptive alignment.
vauban.judge: LLM-based evaluator for grading severity. Auto-detects unsafe tool calls and Deceptive Alignment (Scheming).
vauban.scenarios: Library of standardized threat snapshots for benchmarking.
vauban.viz: Visualization tools.

License

MIT

Project details

Release history Release notifications | RSS feed

0.4.9

Apr 9, 2026

0.4.8

Apr 9, 2026

0.4.7

Apr 6, 2026

0.4.6

Apr 6, 2026

0.4.5

Apr 6, 2026

0.4.4

Apr 5, 2026

0.4.3

Apr 5, 2026

0.4.2

Apr 5, 2026

0.4.1

Apr 5, 2026

0.3.6

Apr 5, 2026

0.3.5

Apr 4, 2026

0.3.4

Mar 31, 2026

0.3.3

Mar 26, 2026

0.3.2

Mar 15, 2026

0.3.1

Mar 2, 2026

0.3.0

Mar 2, 2026

0.2.5

Feb 25, 2026

0.2.4

Feb 25, 2026

0.2.3

Feb 25, 2026

0.2.2

Feb 25, 2026

0.2.1

Feb 24, 2026

0.2.0

Feb 24, 2026

This version

0.1.2

Dec 4, 2025

0.1.1

Dec 4, 2025

0.1.0

Nov 13, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vauban-0.1.2.tar.gz (65.0 kB view details)

Uploaded Dec 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vauban-0.1.2-py3-none-any.whl (65.0 kB view details)

Uploaded Dec 4, 2025 Python 3

File details

Details for the file vauban-0.1.2.tar.gz.

File metadata

Download URL: vauban-0.1.2.tar.gz
Upload date: Dec 4, 2025
Size: 65.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vauban-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`59ffd83cca05ea4a513e5e47d524a987702de1c753bd1946b68f43f18921d15d`
MD5	`2b0da9153b5e7767dc1fba2dc9253890`
BLAKE2b-256	`3f776e76e64390443d5767f060ca02c4a61a82b4ba6d2bee93aff2feb3cabcce`

See more details on using hashes here.

File details

Details for the file vauban-0.1.2-py3-none-any.whl.

File metadata

Download URL: vauban-0.1.2-py3-none-any.whl
Upload date: Dec 4, 2025
Size: 65.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.10 {"installer":{"name":"uv","version":"0.9.10"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vauban-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`91377bc38728648e36614d182f12f81ad1ceb2fc95451f2dfc68c0ef28020910`
MD5	`9a843ff4613e86aeb509f255148c942a`
BLAKE2b-256	`078a1a7baaef4872ddb0d86bb1064ea68ac6b13fdaf43f2bab05f688f0d8f7a3`

See more details on using hashes here.

vauban 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Vauban 🏰

Features

Installation

Usage

1. Atomic Scout

2. Intel Assessment

3. Full Siege Campaign (Text)

4. Advanced Agent Red Teaming (Threat Scenarios)

Architecture

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes