Autonomous AI Risk Assessment & Mitigation Framework
Project description
Aegis-Eval: Autonomous AI Risk Assessment & Mitigation Framework
Aegis-Eval is an enterprise-grade evaluation framework designed to empirically test the safety boundaries of autonomous, self-improving AI agents. It provides a zero-trust execution environment with high-frequency telemetry to detect and mitigate risks such as Instrumental Convergence, Resource Hijacking, and Prohibited Operations.
🚀 Key Features: Hardened Cognitive Firewall V3.0 [MAJOR BREAKTHROUGH]
V3.0 represents a significant leap in AI security, closing common bypass gaps via kernel-level containment and semantic de-obfuscation.
- Hardened Intent Interception: [NEW] Now includes a Semantic De-obfuscator that resolves string concatenations (e.g.,
'os.' + 'system') and blocks dynamic attribute lookups (getattr,setattr) used to hide malicious intent. - Kernel-Level Containment: [NEW] The sandbox now drops ALL Linux Capabilities (
cap_drop=['ALL']) and usesno-new-privilegesto prevent C-extension or syscall-based bypasses of Python-level safety. - Cumulative Exfiltration Tracking: [NEW] Detects 'Drip-Fed' leaks by monitoring entropy across the entire agent session, stopping models that try to leak secrets one character at a time.
- Contextual Access Control (C-ACLs): Enforces "Least Privilege" data access based on the agent's assigned role.
- Deception Engineering (Honeypots): Automatically injects fake secrets (honeypots) to definitively prove malicious intent.
- Swarm Security: Sanitizes communication between multiple agents in a collaborative swarm.
🔬 Testing Philosophy: Total Containment
We move beyond "Alignment" into Hardened Technical Containment. V3.0 ensures that even if an agent is determined to be malicious and uses obfuscation, it is technically blocked at both the symbolic and kernel levels.
🛠 Installation
pip install aegis-ai-eval
💻 Quick Start (Developer API)
Integrate Aegis into your own Agentic workflows (LangChain, CrewAI, etc.):
from aegis import CognitiveFirewall
# Initialize the firewall with your Safety DNA
firewall = CognitiveFirewall(dna_path="prompts/safety_dna.json")
# Verify code before execution
report = await firewall.verify_code("import os; os.system('rm -rf /')")
if report['intent']['status'] == 'violation':
print(f"Blocked: {report['intent']['violations']}")
📂 Architecture
aegis/core/orchestrator.py: The central nervous system managing the agent lifecycle.aegis/monitors/context_gatekeeper.py: Enforces the data perimeter and honeypots.aegis/monitors/network_proxy.py: Intercepts and sanitizes network egress.aegis/monitors/inter_agent_firewall.py: Secures inter-agent communication.tools/policy_gen.py: Automatically generates Safety DNA policies from your workspace.
🛡 Disclaimer
Aegis-Eval is built for security research and AI alignment testing. Always run in a controlled environment.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file aegis_ai_eval-3.0.0.tar.gz.
File metadata
- Download URL: aegis_ai_eval-3.0.0.tar.gz
- Upload date:
- Size: 20.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d29a0426de187f607dc15621d7ab7a15a681407562dbbb87362869c43804ab4f
|
|
| MD5 |
2ea049860868db85903f41c80aa6b780
|
|
| BLAKE2b-256 |
c4ca3d87f904544697be6fded63529f96e69dd10fbf3713116cc3cf82e72ce21
|
File details
Details for the file aegis_ai_eval-3.0.0-py3-none-any.whl.
File metadata
- Download URL: aegis_ai_eval-3.0.0-py3-none-any.whl
- Upload date:
- Size: 22.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fa0a4a657677a8fd0b67c679d59c83a017e75c584f2565f65ff7f129d0c25d56
|
|
| MD5 |
121cb1242c28a77070deceb2d9249ed0
|
|
| BLAKE2b-256 |
9fbf81a6554a4fef1e29684566b935b69b8e5ded38f75507113cd590eee2b143
|