Skip to main content

Autonomous AI Risk Assessment & Mitigation Framework

Project description

Aegis-Eval: Autonomous AI Risk Assessment & Mitigation Framework

Aegis-Eval is an enterprise-grade evaluation framework designed to empirically test the safety boundaries of autonomous, self-improving AI agents. It provides a zero-trust execution environment with high-frequency telemetry to detect and mitigate risks such as Instrumental Convergence, Resource Hijacking, and Prohibited Operations.

🚀 Key Features: Aegis Cognitive Firewall V2.0

V2.0 introduces advanced threat mitigation strategies, moving from simple containment to proactive deception and swarm security.

  • Contextual Access Control (C-ACLs): Sitting between the agent and the host, the Gatekeeper uses Safety DNA to block access to sensitive files based on the agent's current role.
  • Deception Engineering (Honeypots): [NEW] Automatically injects fake secrets (honeypots) when an agent attempts to access forbidden resources, definitively proving malicious intent via exfiltration tracking.
  • Semantic Network Proxy: [NEW] Allows controlled egress for AI agents. Analyzes outgoing data for high-entropy secrets or honeypot signatures before allowing external connections.
  • Swarm Security (Inter-Agent Firewall): [NEW] Sanitizes communication between multiple agents in a swarm, preventing sensitive data leakage across privilege boundaries.
  • Intent-Symbolic Interceptor (ISI): An AST-based safety layer that blocks malicious intent before execution.
  • Structural Loop Detection: Detects infinite loops by comparing the semantic "skeleton" of the code.

🔬 Testing Philosophy: Containment over Alignment

We believe that making models "act nice" (Alignment) is insufficient. Aegis-Eval focuses on Containment—ensuring that models are technically unable to violate safety policies.


🛠 Installation

You can now install Aegis-Eval directly into your own projects:

pip install aegis-ai-eval

💻 Quick Start (Developer API)

Integrate Aegis into your own Agentic workflows (LangChain, CrewAI, etc.):

from aegis import CognitiveFirewall

# Initialize the firewall with your Safety DNA
firewall = CognitiveFirewall(dna_path="prompts/safety_dna.json")

# Verify code before execution
report = await firewall.verify_code("import os; os.system('rm -rf /')")
if report['intent']['status'] == 'violation':
    print(f"Blocked: {report['intent']['violations']}")

📂 Architecture

  • aegis/core/orchestrator.py: The central nervous system managing the agent lifecycle.
  • aegis/monitors/context_gatekeeper.py: Enforces the data perimeter and honeypots.
  • aegis/monitors/network_proxy.py: Intercepts and sanitizes network egress.
  • aegis/monitors/inter_agent_firewall.py: Secures inter-agent communication.
  • tools/policy_gen.py: Automatically generates Safety DNA policies from your workspace.

🛡 Disclaimer

Aegis-Eval is built for security research and AI alignment testing. Always run in a controlled environment.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aegis_ai_eval-2.0.2.tar.gz (20.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aegis_ai_eval-2.0.2-py3-none-any.whl (22.5 kB view details)

Uploaded Python 3

File details

Details for the file aegis_ai_eval-2.0.2.tar.gz.

File metadata

  • Download URL: aegis_ai_eval-2.0.2.tar.gz
  • Upload date:
  • Size: 20.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for aegis_ai_eval-2.0.2.tar.gz
Algorithm Hash digest
SHA256 bbd8dd73eb5202ea7f92055fe83c400ad7a16e2bfa1bd3b864990da15413a2c1
MD5 6cb619c2c7058725e1c5766865e4938f
BLAKE2b-256 8c4abbd0a7e8bec3cf2bb391b85be6086ef4ac8901014b7421dc6aa198d6790f

See more details on using hashes here.

File details

Details for the file aegis_ai_eval-2.0.2-py3-none-any.whl.

File metadata

  • Download URL: aegis_ai_eval-2.0.2-py3-none-any.whl
  • Upload date:
  • Size: 22.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for aegis_ai_eval-2.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3aeb239e4c5afe8ed302cf4a595e18db8ec9e668f305670f3ab9a63e70febe15
MD5 6f308dbedff885b356c5f1b17ac5a948
BLAKE2b-256 b7f63dbb77c8c422accfa591e30f6caffc3c4872e9be714e9befa14ddb104f1a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page