Enterprise-grade System 2 security layer for autonomous AI agents. Protects against instruction smuggling, semantic camouflage, and supply-chain attacks.
Project description
Aletheia Cyber-Defense (ACD)
Enterprise-Grade System 2 Security for AI Agents
๐ Documentation โข
GitHub โข
The Problem
Autonomous AI agents increasingly manage CI/CD pipelines, financial transactions, and critical infrastructure. The LiteLLM supply-chain attack demonstrated that a single compromised dependency can silently exfiltrate credentials from thousands of production environments. Existing guardrails operate at the token level โ they cannot detect semantically camouflaged instructions or verify policy integrity at runtime.
Aletheia provides a System 2 reasoning layer that interposes between AI agents and the actions they request. Every action is verified against a cryptographically signed policy manifest, analyzed for semantic similarity to known attack patterns, and logged with a tamper-evident audit receipt โ before it is allowed to execute.
Security Guarantees
The following properties are cryptographically or architecturally enforced:
| # | Guarantee | Mechanism |
|---|---|---|
| 1 | Tamper-proof policy manifest | Ed25519 detached signature verified before every policy load. Invalid or missing signature causes a hard crash (ManifestTamperedError). |
| 2 | Semantic intent veto | SentenceTransformer (all-MiniLM-L6-v2) cosine similarity against 50+ camouflage phrases. Configurable threshold (default 0.55). |
| 3 | Grey-zone escalation | Payloads in the ambiguous similarity band (0.40โ0.55) are second-pass classified via keyword heuristics. Two or more high-risk keyword hits trigger a veto. |
| 4 | Action sandbox | Regex-based pattern scanner blocks subprocess exec, raw socket, eval, filesystem destruction, and privilege-escalation patterns before dispatch. |
| 5 | Daily alias rotation | Semantic alias phrase order is deterministically shuffled daily (SHA-256 seed from date + manifest hash) to prevent reverse-engineering via probing. |
| 6 | Embedding pre-warming | Model loaded eagerly at FastAPI startup to eliminate cold-start latency on the first request. |
| 7 | Audit trail integrity | Every decision produces a structured JSON log line and an HMAC-signed TMR receipt (decision + policy hash + signature). |
| 8 | Input hardening | NFKC homoglyph collapse, zero-width character strip, recursive Base64 decode, and URL percent-encoding decode โ all applied before any agent sees the payload. |
| 9 | Rate limiting | In-memory sliding-window limiter, default 10 requests per second per IP. |
| 10 | No stack-trace leakage | Global FastAPI exception handler returns an opaque error in production mode. |
| 11 | Config-driven defense modes | active / shadow / monitor โ switchable via environment variable or config.yaml without code changes. |
Additional guarantees:
- API Key Authentication โ
X-API-Keyheader required whenALETHEIA_API_KEYSis configured - Real Client IP โ rate limiting derived from network layer, never from request body
- Payload Privacy โ audit logs store SHA-256 hash + length only; no plaintext content in active mode
- Receipt Signing โ HMAC receipts use
ALETHEIA_RECEIPT_SECRET; falls back toUNSIGNED_DEV_MODE - Health Endpoint โ
GET /healthreturns version, uptime, and manifest verification status
Key Features
- Cryptographic Policy Integrity โ Ed25519-signed security manifest; tamper triggers an instant hard veto
- Semantic Intent Analysis โ Cosine similarity replaces string matching; catches camouflaged fund transfers, privilege escalation, and data exfiltration
- Grey-Zone Second-Pass Classifier โ Keyword heuristics catch creative paraphrases that fall below the primary threshold
- Action Sandbox โ Pattern-based scanner blocks subprocess, eval, raw socket, and filesystem-destruction payloads
- Polymorphic Defense โ Config-driven deterministic rotation across LINEAGE, INTENT, and SKEPTIC modes
- Structured Audit Trail โ JSON-line logging with HMAC-signed TMR receipts on every decision
- Rate Limiting โ Sliding-window limiter (10 req/s per IP, configurable)
- Input Hardening โ Homoglyph normalization, Base64 and URL-encoding recursive decode, control-character strip
- Daily Alias Rotation โ Alias bank order shuffled deterministically per day to resist probing
- Swarm-Resistant Triage โ Scout agent clusters diversionary noise and prioritizes high-blast-radius threats
Quick Start
Install
pip install -r requirements.txt
Optional Consciousness Proximity Module
To enable the optional proximity feature set:
pip install -r requirements-proximity.txt
export CONSCIOUSNESS_PROXIMITY_ENABLED=true
The proximity module is gated behind CONSCIOUSNESS_PROXIMITY_ENABLED=true and includes optional runtime dependencies for governance monitoring and relay scoring.
Sign the manifest (required before first run)
python main.py sign-manifest
Run a local audit
python main.py
Start the API server
uvicorn bridge.fastapi_wrapper:app --host 0.0.0.0 --port 8000
Run the test suite
pytest tests/ -v --ignore=tests/test_api.py
Architecture
Aletheia operates via a tri-agent consensus model:
Incoming Request
โ
โโ Input Hardening (NFKC, Base64, URL decode)
โ
โผ
โโโโโโโโโโโโโโโโโโโ
โ Scout โ Threat intelligence, swarm detection, IP scoring
โโโโโโโโโโฌโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโ
โ Nitpicker โ Polymorphic intent analysis, lineage tracing,
โ โ semantic blocked-pattern detection
โโโโโโโโโโฌโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโ
โ Judge โ Manifest signature verification, policy veto,
โ โ semantic alias veto, grey-zone escalation,
โ โ action sandbox check
โโโโโโโโโโฌโโโโโโโโโ
โ
PROCEED / DENY
โ
โผ
Audit Log + TMR Receipt
API Reference
POST /v1/audit
Request:
{
"payload": "string (max 10,000 chars)",
"origin": "trusted_admin | untrusted_metadata | external_file",
"action": "string",
"ip": "string"
}
Response:
{
"decision": "PROCEED | DENIED | RATE_LIMITED | SANDBOX_BLOCKED",
"metadata": {
"threat_level": 1.2,
"latency_ms": 14.0,
"redacted_payload": "string",
"client_id": "ALETHEIA_ENTERPRISE"
},
"receipt": {
"decision": "PROCEED",
"policy_hash": "sha256...",
"signature": "hmac-sha256...",
"issued_at": "ISO-8601"
}
}
Project Structure
aletheia-cyber-core/
โโโ agents/
โ โโโ scout_v2.py # Threat intelligence + swarm detection
โ โโโ nitpicker_v2.py # Polymorphic intent sanitization + embeddings
โ โโโ judge_v1.py # Policy enforcement + semantic veto
โโโ bridge/
โ โโโ fastapi_wrapper.py # Production REST API (rate-limited, audited)
โ โโโ config.py # Legacy config shim
โ โโโ utils.py # Input hardening (homoglyphs, Base64, URL)
โโโ core/
โ โโโ config.py # Centralized settings (env / yaml / defaults)
โ โโโ embeddings.py # Shared SentenceTransformer service
โ โโโ audit.py # Structured JSON logging + TMR receipts
โ โโโ rate_limit.py # Sliding-window rate limiter
โ โโโ sandbox.py # Action sandbox pattern scanner
โโโ manifest/
โ โโโ security_policy.json # Ground truth veto rules
โ โโโ security_policy.json.sig # Ed25519 detached signature
โ โโโ security_policy.ed25519.pub # Public verification key
โ โโโ signing.py # Manifest signing and verification
โโโ tests/
โ โโโ test_core.py # Integration tests
โ โโโ test_judge.py # Judge unit + adversarial tests
โ โโโ test_nitpicker.py # Nitpicker unit + semantic tests
โ โโโ test_enterprise.py # Audit, rate-limit, hardening tests
โ โโโ test_hardening.py # Sandbox, grey-zone, rotation tests
โ โโโ test_proximity/ # Consciousness proximity module (84 tests)
โโโ simulations/ # Adversarial simulation scripts
โโโ main.py # CLI entry point
โโโ AGENTS.md # Agent communication protocol
โโโ requirements.txt
Production Usage
Configuration
All settings are configurable via environment variables (prefixed ALETHEIA_) or config.yaml:
| Setting | Env Var | Default | Description |
|---|---|---|---|
intent_threshold |
ALETHEIA_INTENT_THRESHOLD |
0.55 |
Cosine similarity threshold for semantic veto |
grey_zone_lower |
ALETHEIA_GREY_ZONE_LOWER |
0.40 |
Lower bound of the grey-zone escalation band |
rate_limit_per_second |
ALETHEIA_RATE_LIMIT_PER_SECOND |
10 |
Max requests per second per IP |
mode |
ALETHEIA_MODE |
active |
Defense mode: active, shadow, or monitor |
log_level |
ALETHEIA_LOG_LEVEL |
INFO |
Logging verbosity |
audit_log_path |
ALETHEIA_AUDIT_LOG_PATH |
audit.log |
Path to the structured audit log |
Known Limitations
- Rate limiter is in-memory. State resets on process restart and does not synchronize across workers. Use Redis or an external store for horizontal scaling.
- Embedding model requires ~500 MB on disk. The
all-MiniLM-L6-v2model is downloaded on first use. Pre-pull in your Docker image build step. - Static alias bank. While daily rotation mitigates probing, a determined adversary with prolonged access could enumerate patterns. Consider supplementing with an LLM-based classifier for high-sensitivity deployments.
- No runtime syscall interception. The action sandbox validates declared intents, not runtime behavior. Pair with OS-level sandboxing (seccomp, AppArmor) for defense in depth.
Support
If this project is useful to your organization, consider supporting its development:
Environment Variables
| Variable | Required | Description |
|---|---|---|
ALETHEIA_API_KEYS |
Production | Comma-separated API keys for X-API-Key auth. Unset = open mode. |
ALETHEIA_RECEIPT_SECRET |
Production | HMAC secret for audit receipts. Unset = UNSIGNED_DEV_MODE. |
ALETHEIA_MODE |
No | active (default), shadow, or monitor |
ALETHEIA_LOG_LEVEL |
No | INFO (default), DEBUG, WARNING |
ALETHEIA_RATE_LIMIT_PER_SECOND |
No | Requests per IP per second. Default: 10 |
CONSCIOUSNESS_PROXIMITY_ENABLED |
No | Enable proximity module. Default: false |
Contributing
See CONTRIBUTING.md for guidelines on submitting issues and pull requests.
Security
To report a vulnerability, see SECURITY.md.
License
MIT โ Copyright (c) 2026 Ashura Joseph Holeyfield โ Aletheia Sovereign Systems
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file aletheia_cyber_core-1.4.0.tar.gz.
File metadata
- Download URL: aletheia_cyber_core-1.4.0.tar.gz
- Upload date:
- Size: 61.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
340f6f7678c4b8961a91546d4ef2379a0e880731faf533f8b5b89954f918790e
|
|
| MD5 |
0cbb13bbe63a41811c3a62f8477bb366
|
|
| BLAKE2b-256 |
50e4ed8f3a63e4c977c1a04e7a1dd2e269b16b9f15760f7295506b10a8acc5b0
|
Provenance
The following attestation bundles were made for aletheia_cyber_core-1.4.0.tar.gz:
Publisher:
release-version-sync.yml on holeyfield33-art/aletheia-core
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
aletheia_cyber_core-1.4.0.tar.gz -
Subject digest:
340f6f7678c4b8961a91546d4ef2379a0e880731faf533f8b5b89954f918790e - Sigstore transparency entry: 1236305637
- Sigstore integration time:
-
Permalink:
holeyfield33-art/aletheia-core@307aef0f2cbf7c3f36096dd36955f0250d3522b8 -
Branch / Tag:
refs/tags/v1.4.0 - Owner: https://github.com/holeyfield33-art
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-version-sync.yml@307aef0f2cbf7c3f36096dd36955f0250d3522b8 -
Trigger Event:
push
-
Statement type:
File details
Details for the file aletheia_cyber_core-1.4.0-py3-none-any.whl.
File metadata
- Download URL: aletheia_cyber_core-1.4.0-py3-none-any.whl
- Upload date:
- Size: 48.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1e303c9abaf62e9f25204dc0f3061796bc7f281cfa0455e404337c410aa8093a
|
|
| MD5 |
ad9884d5efba4e1d300aaf076aefd8fd
|
|
| BLAKE2b-256 |
97ad294bc880eabe8ea06398d79d3305755a0583bdb25babb6e5807b89ccbebc
|
Provenance
The following attestation bundles were made for aletheia_cyber_core-1.4.0-py3-none-any.whl:
Publisher:
release-version-sync.yml on holeyfield33-art/aletheia-core
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
aletheia_cyber_core-1.4.0-py3-none-any.whl -
Subject digest:
1e303c9abaf62e9f25204dc0f3061796bc7f281cfa0455e404337c410aa8093a - Sigstore transparency entry: 1236305785
- Sigstore integration time:
-
Permalink:
holeyfield33-art/aletheia-core@307aef0f2cbf7c3f36096dd36955f0250d3522b8 -
Branch / Tag:
refs/tags/v1.4.0 - Owner: https://github.com/holeyfield33-art
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-version-sync.yml@307aef0f2cbf7c3f36096dd36955f0250d3522b8 -
Trigger Event:
push
-
Statement type: