aletheia-cyber-core

Enterprise-grade System 2 security layer for autonomous AI agents. Protects against instruction smuggling, semantic camouflage, and supply-chain attacks.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Alethia

These details have not been verified by PyPI

Project links

Project description

Aletheia Core

Enterprise-Grade System 2 Security for AI Agents

Version Python License Tests Security Audit Status

📖 Documentation • GitHub •

The Problem

Autonomous AI agents increasingly manage CI/CD pipelines, financial transactions, and critical infrastructure. The LiteLLM supply-chain attack demonstrated that a single compromised dependency can silently exfiltrate credentials from thousands of production environments. Existing guardrails operate at the token level — they cannot detect semantically camouflaged instructions or verify policy integrity at runtime.

Aletheia provides a System 2 reasoning layer that interposes between AI agents and the actions they request. Every action is verified against a cryptographically signed policy manifest, analyzed for semantic similarity to known attack patterns, and logged with a tamper-evident audit receipt — before it is allowed to execute.

Security Guarantees

The following properties are cryptographically or architecturally enforced:

#	Guarantee	Mechanism
1	Tamper-proof policy manifest	Ed25519 detached signature verified before every policy load. Invalid or missing signature causes a hard crash (`ManifestTamperedError`).
2	Semantic intent veto	SentenceTransformer (`all-MiniLM-L6-v2`) cosine similarity against 50+ camouflage phrases. Configurable threshold (default 0.55).
3	Grey-zone escalation	Payloads in the ambiguous similarity band (0.40–0.55) are second-pass classified via keyword heuristics. Two or more high-risk keyword hits trigger a veto.
4	Action sandbox	Regex-based pattern scanner blocks subprocess exec, raw socket, `eval`, filesystem destruction, and privilege-escalation patterns before dispatch.
5	Daily alias rotation	Semantic alias phrase order is deterministically shuffled daily (HMAC-SHA256 seed from date + manifest hash + `ALETHEIA_ALIAS_SALT`) to prevent reverse-engineering via probing.
6	Embedding pre-warming	Model loaded eagerly at FastAPI startup to eliminate cold-start latency on the first request.
7	Audit trail integrity	Every decision produces a structured JSON log line and an HMAC-signed TMR receipt (decision + policy hash + payload_sha256 + action + origin + signature).
8	Input hardening	NFKC homoglyph collapse, zero-width character strip, recursive Base64 decode with 10x size bomb protection, and URL percent-encoding decode — all applied before any agent sees the payload.
9	Rate limiting	Sliding-window per-IP rate limiter. Distributed via
Upstash Redis when `UPSTASH_REDIS_REST_URL` is configured (survives restarts,
synchronizes across workers). Falls back to in-memory for single-node deployments.
10	No stack-trace leakage	Global FastAPI exception handler returns an opaque error in production mode. Version and mode never exposed to unauthenticated /health callers.
11	Config-driven defense modes	`active` / `shadow` / `monitor` — switchable via environment variable or `config.yaml` without code changes.
12	Receipt replay resistance	HMAC signature includes payload_sha256, action, and origin to prevent reuse across contexts.

Additional guarantees:

API Key Authentication — X-API-Key header required when ALETHEIA_API_KEYS is configured
Real Client IP — rate limiting derived from network layer, never from request body
Payload Privacy — audit logs store SHA-256 hash + length only; no plaintext content in active mode
Receipt Signing — HMAC receipts use ALETHEIA_RECEIPT_SECRET; must be set for active mode
Shadow Verdict Blocking — In shadow mode, blocks are logged but not exposed to clients

Key Features

Cryptographic Policy Integrity — Ed25519-signed security manifest; tamper triggers an instant hard veto
Semantic Intent Analysis — Cosine similarity replaces string matching; catches camouflaged fund transfers, privilege escalation, and data exfiltration
Grey-Zone Second-Pass Classifier — Keyword heuristics catch creative paraphrases that fall below the primary threshold
Action Sandbox — Pattern-based scanner blocks subprocess, eval, raw socket, and filesystem-destruction payloads
Polymorphic Defense — Config-driven deterministic rotation across LINEAGE, INTENT, and SKEPTIC modes
Structured Audit Trail — JSON-line logging with HMAC-signed TMR receipts on every decision
Rate Limiting — Sliding-window limiter (10 req/s per IP, configurable)
Input Hardening — Homoglyph normalization, Base64 and URL-encoding recursive decode, control-character strip
Daily Alias Rotation — Alias bank order shuffled deterministically per day to resist probing
Swarm-Resistant Triage — Scout agent clusters diversionary noise and prioritizes high-blast-radius threats

Quick Start

Install

pip install aletheia-cyber-core

Optional Consciousness Proximity Module

To enable the optional proximity feature set:

pip install -r requirements-proximity.txt
export CONSCIOUSNESS_PROXIMITY_ENABLED=true

The proximity module is gated behind CONSCIOUSNESS_PROXIMITY_ENABLED=true and includes optional runtime dependencies for governance monitoring and relay scoring.

Sign the manifest (required before first run)

python main.py sign-manifest

Run a local audit

python main.py

Start the API server

uvicorn bridge.fastapi_wrapper:app --host 0.0.0.0 --port 8000

Run the test suite

pytest tests/ -v --ignore=tests/test_api.py

Architecture

Aletheia operates via a tri-agent consensus model:

Incoming Request
│
├─ Input Hardening (NFKC, Base64, URL decode)
│
▼
┌─────────────────┐
│      Scout      │  Threat intelligence, swarm detection, IP scoring
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│    Nitpicker    │  Polymorphic intent analysis, lineage tracing,
│                 │  semantic blocked-pattern detection
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│      Judge      │  Manifest signature verification, policy veto,
│                 │  semantic alias veto, grey-zone escalation,
│                 │  action sandbox check
└────────┬────────┘
         │
    PROCEED / DENY
         │
         ▼
   Audit Log + TMR Receipt

API Reference

POST /v1/audit

Request:

{
  "payload": "string (max 10,000 chars)",
  "origin": "string (max 128 chars)",
  "action": "string — pattern: ^[A-Za-z0-9_\\-:.]+$",
  "client_ip_claim": "string (optional, audit/debug only — never used for enforcement)"
}

Response:

{
  "decision": "PROCEED | DENIED | RATE_LIMITED | SANDBOX_BLOCKED",
  "metadata": {
    "threat_level": "LOW | MEDIUM | HIGH | CRITICAL",
    "latency_ms": 14.0,
    "request_id": "a1b2c3d4e5f6g7h8",
    "client_id": "ALETHEIA_ENTERPRISE"
  },
  "receipt": {
    "decision": "PROCEED",
    "policy_hash": "sha256...",
    "payload_sha256": "sha256...",
    "action": "Read_Report",
    "origin": "trusted_admin",
    "signature": "hmac-sha256...",
    "issued_at": "ISO-8601"
  }
}

Note: shadow_verdict and redacted_payload are never returned to clients. client_ip_claim, if provided, is stored in the audit log for debugging only and is never used for enforcement.

GET /health

No auth required. Used by load balancers and uptime monitors.

{
  "status": "ok",
  "manifest_signature": "VALID",
  "uptime_seconds": 3600.0
}

status is "degraded" if manifest signature verification fails. version and mode are intentionally omitted to reduce information leakage.

Project Structure

aletheia-cyber-core/
├── agents/
│   ├── scout_v2.py          # Threat intelligence + swarm detection
│   ├── nitpicker_v2.py      # Polymorphic intent sanitization + embeddings
│   └── judge_v1.py          # Policy enforcement + semantic veto
├── bridge/
│   ├── fastapi_wrapper.py   # Production REST API (rate-limited, audited)
│   ├── config.py            # Legacy config shim
│   └── utils.py             # Input hardening (homoglyphs, Base64, URL)
├── core/
│   ├── config.py            # Centralized settings (env / yaml / defaults)
│   ├── embeddings.py        # Shared SentenceTransformer service
│   ├── audit.py             # Structured JSON logging + TMR receipts
│   ├── rate_limit.py        # Sliding-window rate limiter
│   └── sandbox.py           # Action sandbox pattern scanner
├── manifest/
│   ├── security_policy.json        # Ground truth veto rules
│   ├── security_policy.json.sig    # Ed25519 detached signature
│   ├── security_policy.ed25519.pub # Public verification key
│   └── signing.py           # Manifest signing and verification
├── tests/
│   ├── test_core.py         # Integration tests
│   ├── test_judge.py        # Judge unit + adversarial tests
│   ├── test_nitpicker.py    # Nitpicker unit + semantic tests
│   ├── test_enterprise.py   # Audit, rate-limit, hardening tests
│   ├── test_hardening.py    # Sandbox, grey-zone, rotation tests
│   └── test_proximity/      # Consciousness proximity module (84 tests)
├── simulations/             # Adversarial simulation scripts
├── main.py                  # CLI entry point
├── AGENTS.md                # Agent communication protocol
└── requirements.txt

Production Usage

Configuration

All settings are configurable via environment variables (prefixed ALETHEIA_) or config.yaml:

Setting	Env Var	Default	Description
`intent_threshold`	`ALETHEIA_INTENT_THRESHOLD`	`0.55`	Cosine similarity threshold for semantic veto
`grey_zone_lower`	`ALETHEIA_GREY_ZONE_LOWER`	`0.40`	Lower bound of the grey-zone escalation band
`rate_limit_per_second`	`ALETHEIA_RATE_LIMIT_PER_SECOND`	`10`	Max requests per second per IP
`mode`	`ALETHEIA_MODE`	`active`	Defense mode: `active`, `shadow`, or `monitor`
`log_level`	`ALETHEIA_LOG_LEVEL`	`INFO`	Logging verbosity
`audit_log_path`	`ALETHEIA_AUDIT_LOG_PATH`	`audit.log`	Path to the structured audit log

Known Limitations

Rate limiting: When UPSTASH_REDIS_REST_URL is configured, rate limiting is distributed across all workers and instances via Upstash Redis (sliding window, sorted set per IP). Without Upstash credentials, falls back to an in-memory limiter that resets on restart and does not synchronize across workers. Set UPSTASH_REDIS_REST_URL and UPSTASH_REDIS_REST_TOKEN for production deployments behind multiple workers.
Embedding model requires ~500 MB on disk. The all-MiniLM-L6-v2 model is downloaded on first use. Pre-pull in your Docker image build step.
Static alias bank. While daily rotation mitigates probing, a determined adversary with prolonged access could enumerate patterns. Consider supplementing with an LLM-based classifier for high-sensitivity deployments.
No runtime syscall interception. The action sandbox validates declared intents, not runtime behavior. Pair with OS-level sandboxing (seccomp, AppArmor) for defense in depth.

Support

If this project is useful to your organization, consider reaching out about our managed services and enterprise plans.

Environment Variables

Variable	Required	Description
`ALETHEIA_API_KEYS`	Production	Comma-separated API keys for `X-API-Key` auth. Unset = open mode.
`ALETHEIA_RECEIPT_SECRET`	YES (production)	HMAC secret for audit receipts. Service will NOT boot in active mode without this. Generate via `openssl rand -hex 32`.
`ALETHEIA_ALIAS_SALT`	RECOMMENDED	Salt for daily alias rotation. Prevents enumeration attacks. Generate via `openssl rand -hex 32`.
`ALETHEIA_MODE`	No	`active` (default), `shadow`, or `monitor`
`ALETHEIA_LOG_LEVEL`	No	`INFO` (default), `DEBUG`, `WARNING`
`ALETHEIA_RATE_LIMIT_PER_SECOND`	No	Requests per IP per second. Default: `10`
`UPSTASH_REDIS_REST_URL`	Recommended	Upstash Redis REST endpoint for distributed rate limiting. Falls back to in-memory if absent.
`UPSTASH_REDIS_REST_TOKEN`	Recommended	Upstash Redis REST token. Required when URL is set. Rotate immediately if exposed.
`CONSCIOUSNESS_PROXIMITY_ENABLED`	No	Enable proximity module. Default: `false`
`ALETHEIA_TRUSTED_PROXY_DEPTH`	No	Number of trusted reverse proxies in front of the service. Default: 1 (Render/Vercel). Set to 0 for direct.
`ALETHEIA_CORS_ORIGINS`	No	Comma-separated allowed CORS origins. Default: app.aletheia-core.com and aletheia-core.com

Pre-Launch Verification

Before starting the service in production, complete the following checklist:

1. Verify required secrets are set

# ALETHEIA_RECEIPT_SECRET is mandatory for active mode
if [ -z "$ALETHEIA_RECEIPT_SECRET" ]; then
  echo "ERROR: ALETHEIA_RECEIPT_SECRET not set"
  exit 1
fi

# ALETHEIA_ALIAS_SALT is recommended
if [ -z "$ALETHEIA_ALIAS_SALT" ]; then
  echo "WARNING: ALETHEIA_ALIAS_SALT not set — alias rotation is predictable"
fi

# ALETHEIA_API_KEYS must be set in active mode
if [ -z "$ALETHEIA_API_KEYS" ]; then
  echo "ERROR: ALETHEIA_API_KEYS not set — service will refuse to start in active mode"
  exit 1
fi
echo "ALETHEIA_API_KEYS length: ${#ALETHEIA_API_KEYS}"

2. Test the health endpoint

curl http://localhost:8000/health
# Expected response (v1.4.7+):
# {
#   "status": "ok",
#   "uptime_seconds": 3600,
#   "manifest_signature": "VALID"
# }
# Note: version and mode information removed to prevent information leakage

3. Verify receipt signing works

curl -X POST http://localhost:8000/v1/audit \
  -H "Content-Type: application/json" \
  -d '{
    "payload": "test payload",
    "origin": "admin",
    "action": "Read_Report"
  }'
# Response must include "signature" field with non-empty HMAC-SHA256 hex string
# DO NOT use UNSIGNED_DEV_MODE in production

4. Confirm shadow mode does not leak verdicts

ALETHEIA_MODE=shadow uvicorn bridge.fastapi_wrapper:app --port 8000 &
sleep 2

curl -X POST http://localhost:8000/v1/audit \
  -H "Content-Type: application/json" \
  -d '{"payload": "transfer funds", "origin": "user", "action": "Transfer_Funds"}'
# Response MUST NOT contain "shadow_verdict" field (even though action is blocked internally)

Production Launch Command

# Generate secure secrets
ALETHEIA_RECEIPT_SECRET=$(openssl rand -hex 32)
ALETHEIA_ALIAS_SALT=$(openssl rand -hex 32)

# Start in active mode
ALETHEIA_MODE=active \
ALETHEIA_RECEIPT_SECRET="$ALETHEIA_RECEIPT_SECRET" \
ALETHEIA_ALIAS_SALT="$ALETHEIA_ALIAS_SALT" \
uvicorn bridge.fastapi_wrapper:app --host 0.0.0.0 --port 8000

Deployment Checklist

Before going live in active mode, verify all of the following:

#	Check	Command
1	Manifest is signed	`python main.py sign-manifest`
2	`ALETHEIA_RECEIPT_SECRET` is set (≥ 32 chars)	`echo ${#ALETHEIA_RECEIPT_SECRET}`
3	`ALETHEIA_ALIAS_SALT` is set	`echo ${#ALETHEIA_ALIAS_SALT}`
4	Health endpoint returns `"status":"ok"`	`curl http://localhost:8000/health`
5	Receipt signature is not `UNSIGNED_DEV_MODE`	Inspect `signature` field in `/v1/audit` response
6	Tests pass	`pytest tests/ --ignore=tests/test_api.py -q`
7	Private key is NOT in Docker image	`docker run --rm <image> ls /app/manifest/*.key` — must error

Required environment variables:

Variable	Required	Min Length	Notes
`ALETHEIA_RECEIPT_SECRET`	YES (active mode)	32 chars	Generate: `openssl rand -hex 32`
`ALETHEIA_ALIAS_SALT`	RECOMMENDED	32 chars	Generate: `openssl rand -hex 32`
`ALETHEIA_API_KEYS`	RECOMMENDED	—	Comma-separated. Unset = open mode.
`ALETHEIA_MODE`	No	—	`active` (default), `shadow`, `monitor`
`ALETHEIA_RATE_LIMIT_PER_SECOND`	No	—	Default: `10`
`ALETHEIA_LOG_LEVEL`	No	—	Default: `INFO`
`ALETHEIA_AUDIT_LOG_PATH`	No	—	Default: `audit.log`

Architecture Decision Records

ADR-001: In-memory rate limiting only Rate limiting is intentionally in-memory and per-process. Adding Redis would introduce a hard runtime dependency that breaks single-binary deployments, adds operational complexity, and is unnecessary for the target deployment model (single-host, proxy-fronted). For horizontal scaling, place a rate-limiting proxy (nginx, Cloudflare, Traefik) in front.

ADR-002: Threat score discretisation Raw cosine-similarity floats and threat scores are never returned to clients. Returning exact values would let an attacker black-box calibrate their payload against the exact veto threshold. Only discretised bands (LOW / MEDIUM / HIGH / CRITICAL) are exposed.

ADR-003: Startup rejection without RECEIPT_SECRET The service sys.exit(1) in active mode without ALETHEIA_RECEIPT_SECRET. An unsigned receipt (UNSIGNED_DEV_MODE) in production would allow an attacker to forge receipts, breaking audit trail integrity. Hard refusal is preferable to degraded operation.

ADR-004: Ed25519 for manifest signing Ed25519 was chosen over RSA for manifest signing: smaller keys, faster verification, no padding oracle attacks, and deterministic signatures. The public key ships with the package; the private key never leaves the operator's control.

Performance Characteristics

Measured on a 2-core VM with the all-MiniLM-L6-v2 model pre-warmed:

Metric	Value	Notes
Cold-start (model load)	~3–8 s	Model downloaded on first use if not cached
Warm-start (subsequent requests)	~12–40 ms p99	Embedding encode dominates
Sandbox check only	< 1 ms	Pure regex, no model
Memory footprint	~500 MB	Dominated by PyTorch + model weights
Rate limit overhead	< 0.1 ms	In-memory list operations with threading.Lock

The embedding model is loaded eagerly at startup (warm_up()) to eliminate cold-start latency on the first production request.

Contributing

See CONTRIBUTING.md for guidelines on submitting issues and pull requests.

Security

To report a vulnerability, see SECURITY.md.

License

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Alethia

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.9.3

May 15, 2026

1.9.2

Apr 25, 2026

1.6.2

Apr 10, 2026

1.6.1

Apr 9, 2026

1.6.0

Apr 8, 2026

1.5.3

Apr 8, 2026

1.5.2

Apr 7, 2026

1.5.1

Apr 7, 2026

This version

1.5.0

Apr 7, 2026

1.4.2

Apr 4, 2026

1.4.1

Apr 4, 2026

1.4.0

Apr 4, 2026

1.3.0

Apr 4, 2026

1.2.1

Mar 24, 2026

1.0.0

Mar 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aletheia_cyber_core-1.5.0.tar.gz (95.0 kB view details)

Uploaded Apr 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aletheia_cyber_core-1.5.0-py3-none-any.whl (56.3 kB view details)

Uploaded Apr 7, 2026 Python 3

File details

Details for the file aletheia_cyber_core-1.5.0.tar.gz.

File metadata

Download URL: aletheia_cyber_core-1.5.0.tar.gz
Upload date: Apr 7, 2026
Size: 95.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aletheia_cyber_core-1.5.0.tar.gz
Algorithm	Hash digest
SHA256	`a80d198e9882f7f96fa04f97f73cbbce422d6fe00cb777eff7e8642a9509ce4e`
MD5	`ccdc71580d7b7c00db8644707fdefd36`
BLAKE2b-256	`60c882330430892f34bd0e4df45dbe225998a48772b0f85eb38b61336e24be26`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aletheia_cyber_core-1.5.0.tar.gz:

Publisher: release-version-sync.yml on holeyfield33-art/aletheia-core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aletheia_cyber_core-1.5.0.tar.gz
- Subject digest: a80d198e9882f7f96fa04f97f73cbbce422d6fe00cb777eff7e8642a9509ce4e
- Sigstore transparency entry: 1247748848
- Sigstore integration time: Apr 7, 2026
Source repository:
- Permalink: holeyfield33-art/aletheia-core@11a52a8f427925f9584994d215bd41dc0f91bb36
- Branch / Tag: refs/tags/v1.5.0
- Owner: https://github.com/holeyfield33-art
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-version-sync.yml@11a52a8f427925f9584994d215bd41dc0f91bb36
- Trigger Event: push

File details

Details for the file aletheia_cyber_core-1.5.0-py3-none-any.whl.

File metadata

Download URL: aletheia_cyber_core-1.5.0-py3-none-any.whl
Upload date: Apr 7, 2026
Size: 56.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aletheia_cyber_core-1.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f870cd17398880b01d142fbc674037b26f803ed4a52f9696a7a290813e266da7`
MD5	`77868fb53a908f87a3b90ca7f09159ff`
BLAKE2b-256	`48fd412a2dbd89d5e0985387c098b50712254e7866e628306ae1dd092f718152`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aletheia_cyber_core-1.5.0-py3-none-any.whl:

Publisher: release-version-sync.yml on holeyfield33-art/aletheia-core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aletheia_cyber_core-1.5.0-py3-none-any.whl
- Subject digest: f870cd17398880b01d142fbc674037b26f803ed4a52f9696a7a290813e266da7
- Sigstore transparency entry: 1247748885
- Sigstore integration time: Apr 7, 2026
Source repository:
- Permalink: holeyfield33-art/aletheia-core@11a52a8f427925f9584994d215bd41dc0f91bb36
- Branch / Tag: refs/tags/v1.5.0
- Owner: https://github.com/holeyfield33-art
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-version-sync.yml@11a52a8f427925f9584994d215bd41dc0f91bb36
- Trigger Event: push

aletheia-cyber-core 1.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Aletheia Core

The Problem

Security Guarantees

Key Features

Quick Start

Install

Optional Consciousness Proximity Module

Sign the manifest (required before first run)

Run a local audit

Start the API server

Run the test suite

Architecture

API Reference

Project Structure

Production Usage

Configuration

Known Limitations

Support

Environment Variables

Pre-Launch Verification

1. Verify required secrets are set

2. Test the health endpoint

3. Verify receipt signing works

4. Confirm shadow mode does not leak verdicts

Production Launch Command

Deployment Checklist

Architecture Decision Records

Performance Characteristics

Contributing

Security

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance