Autonomous AI Risk Assessment & Mitigation Framework
Project description
Aegis-Eval: Autonomous AI Risk Assessment & Mitigation Framework
Aegis-Eval is an enterprise-grade evaluation framework designed to empirically test the safety boundaries of autonomous, self-improving AI agents. It provides a zero-trust execution environment with high-frequency telemetry to detect and mitigate risks such as Instrumental Convergence, Resource Hijacking, and Prohibited Operations.
🚀 Key Features: Aegis Cognitive Firewall V2.0
V2.0 introduces advanced threat mitigation strategies, moving from simple containment to proactive deception and swarm security.
- Contextual Access Control (C-ACLs): Sitting between the agent and the host, the Gatekeeper uses Safety DNA to block access to sensitive files based on the agent's current role.
- Deception Engineering (Honeypots): [NEW] Automatically injects fake secrets (honeypots) when an agent attempts to access forbidden resources, definitively proving malicious intent via exfiltration tracking.
- Semantic Network Proxy: [NEW] Allows controlled egress for AI agents. Analyzes outgoing data for high-entropy secrets or honeypot signatures before allowing external connections.
- Swarm Security (Inter-Agent Firewall): [NEW] Sanitizes communication between multiple agents in a swarm, preventing sensitive data leakage across privilege boundaries.
- Intent-Symbolic Interceptor (ISI): An AST-based safety layer that blocks malicious intent before execution.
- Structural Loop Detection: Detects infinite loops by comparing the semantic "skeleton" of the code.
🔬 Testing Philosophy: Containment over Alignment
We believe that making models "act nice" (Alignment) is insufficient. Aegis-Eval focuses on Containment—ensuring that models are technically unable to violate safety policies.
🛠 Installation
You can now install Aegis-Eval directly into your own projects:
pip install aegis-ai-eval
💻 Quick Start (Developer API)
Integrate Aegis into your own Agentic workflows (LangChain, CrewAI, etc.):
from aegis import CognitiveFirewall
# Initialize the firewall with your Safety DNA
firewall = CognitiveFirewall(dna_path="prompts/safety_dna.json")
# Verify code before execution
report = await firewall.verify_code("import os; os.system('rm -rf /')")
if report['intent']['status'] == 'violation':
print(f"Blocked: {report['intent']['violations']}")
📂 Architecture
aegis/core/orchestrator.py: The central nervous system managing the agent lifecycle.aegis/monitors/context_gatekeeper.py: Enforces the data perimeter and honeypots.aegis/monitors/network_proxy.py: Intercepts and sanitizes network egress.aegis/monitors/inter_agent_firewall.py: Secures inter-agent communication.tools/policy_gen.py: Automatically generates Safety DNA policies from your workspace.
🛡 Disclaimer
Aegis-Eval is built for security research and AI alignment testing. Always run in a controlled environment.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file aegis_ai_eval-2.0.2.tar.gz.
File metadata
- Download URL: aegis_ai_eval-2.0.2.tar.gz
- Upload date:
- Size: 20.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bbd8dd73eb5202ea7f92055fe83c400ad7a16e2bfa1bd3b864990da15413a2c1
|
|
| MD5 |
6cb619c2c7058725e1c5766865e4938f
|
|
| BLAKE2b-256 |
8c4abbd0a7e8bec3cf2bb391b85be6086ef4ac8901014b7421dc6aa198d6790f
|
File details
Details for the file aegis_ai_eval-2.0.2-py3-none-any.whl.
File metadata
- Download URL: aegis_ai_eval-2.0.2-py3-none-any.whl
- Upload date:
- Size: 22.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3aeb239e4c5afe8ed302cf4a595e18db8ec9e668f305670f3ab9a63e70febe15
|
|
| MD5 |
6f308dbedff885b356c5f1b17ac5a948
|
|
| BLAKE2b-256 |
b7f63dbb77c8c422accfa591e30f6caffc3c4872e9be714e9befa14ddb104f1a
|