Skip to main content

Autonomous AI Risk Assessment & Mitigation Framework

Project description

Aegis-Eval: Autonomous AI Risk Assessment & Mitigation Framework

Aegis-Eval is an enterprise-grade evaluation framework designed to empirically test the safety boundaries of autonomous, self-improving AI agents. It provides a zero-trust execution environment with high-frequency telemetry to detect and mitigate risks such as Instrumental Convergence, Resource Hijacking, and Prohibited Operations.

🚀 Key Features: Hardened Cognitive Firewall V3.0 [MAJOR BREAKTHROUGH]

V3.0 represents a significant leap in AI security, closing common bypass gaps via kernel-level containment and semantic de-obfuscation.

  • Hardened Intent Interception: [NEW] Now includes a Semantic De-obfuscator that resolves string concatenations (e.g., 'os.' + 'system') and blocks dynamic attribute lookups (getattr, setattr) used to hide malicious intent.
  • Kernel-Level Containment: [NEW] The sandbox now drops ALL Linux Capabilities (cap_drop=['ALL']) and uses no-new-privileges to prevent C-extension or syscall-based bypasses of Python-level safety.
  • Cumulative Exfiltration Tracking: [NEW] Detects 'Drip-Fed' leaks by monitoring entropy across the entire agent session, stopping models that try to leak secrets one character at a time.
  • Contextual Access Control (C-ACLs): Enforces "Least Privilege" data access based on the agent's assigned role.
  • Deception Engineering (Honeypots): Automatically injects fake secrets (honeypots) to definitively prove malicious intent.
  • Swarm Security: Sanitizes communication between multiple agents in a collaborative swarm.

🔬 Testing Philosophy: Total Containment

We move beyond "Alignment" into Hardened Technical Containment. V3.0 ensures that even if an agent is determined to be malicious and uses obfuscation, it is technically blocked at both the symbolic and kernel levels.


🛠 Installation

pip install aegis-ai-eval

💻 Quick Start (Developer API)

Integrate Aegis into your own Agentic workflows (LangChain, CrewAI, etc.):

from aegis import CognitiveFirewall

# Initialize the firewall with your Safety DNA
firewall = CognitiveFirewall(dna_path="prompts/safety_dna.json")

# Verify code before execution
report = await firewall.verify_code("import os; os.system('rm -rf /')")
if report['intent']['status'] == 'violation':
    print(f"Blocked: {report['intent']['violations']}")

📂 Architecture

  • aegis/core/orchestrator.py: The central nervous system managing the agent lifecycle.
  • aegis/monitors/context_gatekeeper.py: Enforces the data perimeter and honeypots.
  • aegis/monitors/network_proxy.py: Intercepts and sanitizes network egress.
  • aegis/monitors/inter_agent_firewall.py: Secures inter-agent communication.
  • tools/policy_gen.py: Automatically generates Safety DNA policies from your workspace.

🛡 Disclaimer

Aegis-Eval is built for security research and AI alignment testing. Always run in a controlled environment.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aegis_ai_eval-3.0.0.tar.gz (20.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aegis_ai_eval-3.0.0-py3-none-any.whl (22.4 kB view details)

Uploaded Python 3

File details

Details for the file aegis_ai_eval-3.0.0.tar.gz.

File metadata

  • Download URL: aegis_ai_eval-3.0.0.tar.gz
  • Upload date:
  • Size: 20.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for aegis_ai_eval-3.0.0.tar.gz
Algorithm Hash digest
SHA256 d29a0426de187f607dc15621d7ab7a15a681407562dbbb87362869c43804ab4f
MD5 2ea049860868db85903f41c80aa6b780
BLAKE2b-256 c4ca3d87f904544697be6fded63529f96e69dd10fbf3713116cc3cf82e72ce21

See more details on using hashes here.

File details

Details for the file aegis_ai_eval-3.0.0-py3-none-any.whl.

File metadata

  • Download URL: aegis_ai_eval-3.0.0-py3-none-any.whl
  • Upload date:
  • Size: 22.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for aegis_ai_eval-3.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fa0a4a657677a8fd0b67c679d59c83a017e75c584f2565f65ff7f129d0c25d56
MD5 121cb1242c28a77070deceb2d9249ed0
BLAKE2b-256 9fbf81a6554a4fef1e29684566b935b69b8e5ded38f75507113cd590eee2b143

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page