Aegis ATV — Agent Telemetry Vector. Action firewall + cryptographic audit chain + ContextMemory analytics for AI agents.

These details have not been verified by PyPI

Project description

Aegis ATV — Agent Telemetry Vector

demo

Every Claude Code tool call gets a cryptographic audit line. SHA3-chained, Ed25519-signed, on local disk only. Tamper-evident with one CLI command. Plus a 16-step ATV-2080-v1 firewall that BLOCKs / requires approval / ALLOWs before the tool runs. 0 cloud calls by default, ~5-minute install.

Three release tracks

Aegis ATV ships in three release tracks. Pick the one that matches your environment:

Track	Status	Best for	Install
🟢 Claude Code	GA (v0.1.0)	Solo developer using Claude Code daily	`aegis install --target claude-code`
🟢 OpenClaw + Cloud LLM	GA (v0.3.0)	Multi-channel agent ops (Telegram/Discord/Slack), provider flexibility	`aegis install --target openclaw-cloud` + `npm install @happyikas/openclaw-plugin-aegis`
🟢 OpenClaw + Local OSS LLM	GA (v0.3.0)	Air-gapped, data-residency-critical, regulated industries	`aegis install --target openclaw-local` + `npm install @happyikas/openclaw-plugin-aegis`

Detailed track docs (Korean): docs/releases/ — 1-page decision matrix + per-track manual.

The npm package is published live as @happyikas/openclaw-plugin-aegis@0.3.0; installs via the default latest dist-tag (no @preview tag needed).

Three named features (work across all tracks)

Aegis is packaged around three named features built on the same firewall + audit chain:

	What it does	Run
🏋️ ATV Coach	Learns your environment's normal/anomaly distribution (5-layer × 4-phase burn-in) and feeds it into the sLLM judge + RAG step340	`aegis coach burnin retrain`, `aegis coach case-memory build`
📊 ATV Live	Real-time agent monitoring across cost / performance / security	`aegis live`, `aegis report`, `aegis cost summary`
🔧 ATV Doctor	Diagnoses + advises + rolls back when the agent is in trouble (or about to be)	`aegis doctor`, `aegis forensic last`, `aegis advise`, `aegis rollback <trace>`
🧠 ATV Memory	Reads recent BLOCK / advise events from ContextMemory and proposes concrete CLAUDE.md edits	`aegis memory claude-md`, `aegis memory show`
🛡️ ATV Guard	Hookify-style natural-language rules → regex auto-proposal + markdown storage	`aegis guard add`, `aegis guard test`, `aegis guard import`

v0.5+ uses operator-vocabulary canonical names (live / coach / guard / memory / doctor). Older command names (dashboard / burnin / rule / case-memory / advisor-calibration / fleet-monitor) keep working as aliases — see aegis --help for the full index.

User manuals (Korean is the canonical version): docs/USER_GUIDE.ko.md (비전문가용 통합 가이드) · docs/manuals/ (기능별 깊은 reference)

Why this exists

Claude Code's built-in --allowedTools / --dangerously-skip-permissions are binary toggles with no audit trail. Aegis adds three things they don't:

Cryptographic audit chain — every decision is appended to ~/.aegis/audit.jsonl with an Ed25519 signature and SHA3-256 prev/this hash. aegis verify-audit walks the chain in one command — detects edits, deletes, re-orderings.
Per-call structured risk scoring — 31 detection rules + 6 incident playbooks running over a 30-subfield ATV-2080-v1 vector representation, with sLLM judge for grey-zone calls (Solo Free: dummy/local; opt-in: Anthropic Haiku).
Cost / loop / instruction-drift gates — catches runaway agents, redundant retry loops, and tampered CLAUDE.md / .mcp.json baseline before the tool runs, not after the bill.

For regulated industries (finance, healthcare, government) where AI coding assistants are blocked by audit requirements, the cryptographic decision log turns "we have logs of what the AI did" into "we have signed proof that the AI did exactly this and nothing more."

Side-by-side: Aegis vs Claude Code's built-in flags

	Claude Code built-in	Aegis
Tool allow / deny	`--allowedTools` / `--disallowedTools` — static list, set per session	16-step pipeline + 31 detection rules + sLLM judge for grey-zone calls — dynamic, per-call
Bypass switch	`--dangerously-skip-permissions` — binary on/off	Profile-aware (`--profile {free,pro,cloud}`) — different intelligence tier per use case
Audit log	Session log (plain text, mutable)	`~/.aegis/audit.jsonl` with SHA3-256 prev_hash/this_hash chain + opt-in Ed25519 signatures
Tamper detection	None — text log can be edited or deleted silently	`aegis verify-audit` — one command, walks chain, exits non-zero on any mutation
Forensic timeline	Manual `grep` over session transcripts	`aegis forensic <session_id>` — chronological per-call timeline with step trace + advisor signals
Cost / runaway gate	None	step335 cost gate (`AEGIS_TOKEN_BUDGET`) + step336 loop detector (3× same call → REQUIRE_APPROVAL)
Instruction-drift detection	None	step309 — SHA3 baseline of `CLAUDE.md` / `AGENTS.md` / `.mcp.json` / plugin manifests; any drift BLOCKs every call until reattest
Per-call risk scoring	None	30-subfield ATV-2080-v1 vector + M13 attribution head — top-3 contributing signals stamped into every audit record
Live recommendations	None	`aegis advise` — cost / performance / security advisor surface (`--profile pro`/`cloud` activates the 8-advisor pipeline)
Compliance-grade evidence	"Logs exist somewhere"	"Signed cryptographic proof, exportable, externally verifiable"
External reach when running	None (Claude API only)	None by default (Solo Free contract); opt-in Anthropic Haiku via `--profile cloud`
Install / uninstall	Native, instant	`aegis install --mode local` (~5 min, idempotent) + `aegis uninstall` (preserves user-owned hooks) + `--rescue`
Open source	Closed (Anthropic)	Apache-2.0, full source github.com/happyikas/Aegis-ATV

The built-in flags solve "is this tool allowed to run?" Aegis solves "what did the agent actually do, can we prove it later, and how do we catch the bad cases the allowlist missed?"

Both are useful — Aegis is layered on top, not a replacement. The two work together: keep --allowedTools for coarse permissions, add Aegis for the cryptographic audit + structured per-call scoring + cost / drift / loop gates that the binary flags can't express.

Verify integrity (the differentiating feature)

Two layers, both runnable any time without network:

# 1) Hash chain — default, no key required.
#    Detects any post-write mutation of ~/.aegis/audit.jsonl.
uv run aegis verify-audit
#   ✓ verify-audit (local chain) — 5,583 records intact
#   signing pubkey: not configured

# 2) Optional Ed25519 signing — opt-in, one-shot setup.
#    Without the private key, the chain cannot be re-computed forward
#    from a tampered point.  Recommended for any audit log you intend
#    to share, archive, or use as compliance evidence.
uv run aegis audit-key init      # generate ~/.aegis/keys/audit.ed25519{,.pub}
uv run aegis verify-audit
#   ✓ verify-audit (local chain) — 6 records intact
#   signing pubkey: loaded — signed records were also Ed25519-verified

# 3) Share the public fingerprint so others can verify without your machine:
uv run aegis audit-key show
#   fingerprint: f2a17931406e4f56
#   pub:         ~/.aegis/keys/audit.ed25519.pub

Real-session output is checked in under docs/launch/dogfooding/ — captured against an actual 5,583-record ~/.aegis/audit.jsonl, not synthetic.

What you get in 5 minutes

Pick whichever fits your setup — all three end at the same place:

# Option A: source clone (full dev environment)
git clone https://github.com/happyikas/Aegis-ATV.git && cd Aegis-ATV
uv sync                              # ~30s
uv run aegis install --mode local    # patches ~/.claude/settings.json

# Option B: one-liner installer (no manual clone)
curl -LsSf https://raw.githubusercontent.com/happyikas/Aegis-ATV/main/scripts/install.sh | bash

# Option C: Homebrew tap (macOS / Linuxbrew)
brew tap happyikas/aegis https://github.com/happyikas/Aegis-ATV.git
brew install happyikas/aegis/aegis
aegis install --mode local

# Then restart Claude Code. Done.

Now every tool call Claude Code makes goes through Aegis first. Try a destructive command in your next session — it gets BLOCKed cryptographically with a signed audit line:

⛔ BLOCK  Bash  trace=ebf0c92d  (165 ms)
   reason: dangerous pattern: <step310 regex>
   advise: [HIGH] security-reviewer — Block until reviewer ACKs

→ 5-minute walkthrough: docs/PERSONAL_QUICKSTART.md.

What gets caught (highlights)

Filesystem destructive — recursive purge against system paths (/var, /home, /).
VCS destructive — force-push to main/master/prod/release, force-delete protected branches.
Cloud destructive — Kubernetes delete, Terraform destroy, Helm uninstall, AWS IAM/EC2/S3 mutation, GCP/Azure resource removal.
SQL destructive — drop-table on production, unbounded delete (no WHERE).
Sandbox escape — privileged Docker, capability adds, nsenter, chroot, mount --bind.
Prompt injection — "ignore previous instructions", [INST] system, MCP-injection patterns.
Sensitive paths — cloud credentials (~/.aws/credentials), SSH private keys, system password files, .env.
Loop / runaway cost — same call 3× → REQUIRE_APPROVAL.
Instruction drift — CLAUDE.md / .mcp.json / plugin manifest tampering.

→ Full catalog: policies/rag_corpus/rules.jsonl (31 rules + 6 incident playbooks).

Solo Free contract — no data leaves your laptop

	Default install	Optional opt-in
Where Aegis runs	100% your laptop	same
Tool inputs / files / commands	never leave the machine	same
sLLM judge	local rules (`dummy`)	`--judge haiku` calls Anthropic API
Embeddings	SHA3 (`dummy`)	`--embedding bge-local` (one-time GGUF download, then local)
Audit log	`~/.aegis/audit.jsonl`	same

→ The default install makes 0 outbound network requests. Verify yourself with tcpdump / Little Snitch while Claude Code runs.

After it's installed

uv run aegis report                # 5-line risk summary of recent activity
uv run aegis verify-audit          # cryptographic chain check (detects any tampering)
uv run aegis forensic last         # postmortem timeline of the most-recent session
uv run aegis advise                # live cost / performance / security advice (--profile pro/cloud)
uv run aegis policy diff --since 7d  # what rules / playbooks / baselines changed
uv run aegis pull-model --recommend  # upgrade path to Phi-3.5-mini / Haiku
uv run python -m demo.macmini all  # 100-case self-validation

Modes

uv run aegis install --mode local      # Solo Free in-process hook (no service)
uv run aegis install --mode sidecar    # multi-tenant FastAPI + Postgres + Redis (Enterprise)

Local-mode intelligence tiers (`--profile`)

# free   (default): dummy embedding + dummy judge, advisor OFF, 0 cloud, 0 model files
uv run aegis install --mode local --profile free

# pro    : bge-local embedding + hybrid (M13+local-phi) judge + advisor ON; 0 cloud,
#          ~700 MB GGUF auto-downloaded on install
uv run aegis install --mode local --profile pro

# cloud  : pro stack + Anthropic Haiku judge for grey-zone calls;
#          requires ANTHROPIC_API_KEY in shell profile
uv run aegis install --mode local --profile cloud

Explicit --judge / --embedding override the profile baseline — useful for pinning a specific tier (e.g., --profile pro --judge dummy keeps the pro embedding + advisor but uses keyword-only judge).

Safe Auto-Run — known-safe ops (Read/Grep/Glob, ls, pytest, ruff, git status) skip the sLLM judge — <5 ms median.
12 / 12 known incident classes block + cloud destructive patterns (kubectl delete / terraform destroy / aws iam / unbounded DELETE) caught at step311.
Loop & Redundant Call Saver — same call 3× → REQUIRE_APPROVAL; read-only repeats deduped and surfaced in aegis report.
Poisoned Instruction Detector — CLAUDE.md / AGENTS.md / .mcp.json / plugin & skill manifest hashes baselined; any drift BLOCKs every subsequent PreToolUse until reviewed.
Local-mode signed audit chain — SHA3-256 prev_hash / this_hash per line; aegis verify-audit catches mutations and re-orderings.

📖 v2.2 사용자 매뉴얼 (한국어) — 설치, CLI 레퍼런스, 시나리오, 트러블슈팅 14개 섹션. 🧪 Mac mini 90-case 검증 매뉴얼 (한국어) — python -m demo.macmini 으로 8 advisor / 11 verb 결정적 검증. 🗺️ ATV-2080-v1 구조 다이어그램 — 30 subfields × 2,080 float32 + 16-step firewall pipeline. 소스: docs/diagrams/draw_atv_2080_v1.py. 🎯 10분 라이브 데모: docs/RUNBOOK.md. 📋 변경 내역: CHANGELOG.md.

What's in the box

#	Milestone	Module	Patent
M1–M7	Original MVP	`aegis.firewall.step{310,320,330,335,340}` · `aegis.judge.haiku` · `aegis.sign.ed25519` · `aegis.audit.{sqlite,jsonl}_store` · `aegis.attest.code_attestation`	Claims 1, 2 (partial), 17, 23
M8	ATV-2080-v1 30-subfield schema	`aegis.schema` (30 named slices, `CostEfficiencyMetrics` 16 slots) · `aegis.atv.builder` (19 SW encoders + HW band zero-fill)	Appendix A, Claims 6, 7, 9, 24
M9	Firewall split 350/360/370	`aegis.firewall.step350_approval` · `step360_audit` · `step370_exec`	Claims 2, 16
M10	ATMU + Write-Ahead Intent Log + 2PC	`aegis.atmu.{state_machine, intent_log, checkpoint, compensating}` · `POST /tool-outcome`	Claims 2, 15
M11	5-layer Burn-in × 4-phase graduation	`aegis.burnin.{phases, controller}` · `GET /burnin-status`	Claims 4, 13, 14, 19, 20
M12	Cost Attestation Ledger (separate key) + 3 divergence	`aegis.cost.{model_flops, divergence, escalation, ledger}` · `GET /cost-attestation/{aid}`	Claims 3, 26, 27, 30, 33, 34
M13	sLLM attribution head	`aegis.judge.haiku` (30-subfield contribution scores) · `step340` trace shows top-3	Claims 8, 11
M14	AID auth + per-AID circuit breaker	`aegis.firewall.{step315_aid_auth, circuit_breaker}` · `policies/aid_region.json` · `aegis.api.admin_aid`	Claim 5B (¶[0063L]–[0063M])
M15	AES-256-GCM encrypted journal + forensic replay	`aegis.audit.{encrypted_journal, replay}` · `aegis.api.replay` (`GET /forensic/replay`)	§13B, ¶[0102G-1]
M16	Hierarchical Agent Memory L3+L4	`aegis.ham.store` · `aegis.api.ham` (7 endpoints)	§13A, ¶[0102C]

The hardware band (200-D, indices 1880..2079) is intentionally zero-filled in T2 per patent ¶[0042] — that's the T3 work.

Endpoints

Method	Path	Returns	Milestone
GET	`/healthz`	`{ok, version, burn_in_id}`	M1
POST	`/evaluate`	`Verdict` (decision, reason, atv_id, signature, step_traces)	M1–M14
POST	`/approve`	`{ok, atv_id, head}`	M1
POST	`/tool-outcome`	`{ok, record_id, current_state, tool_outcome}`	M10
GET	`/audit/{aid}`	`{aid, head, length, chain_valid, chain}`	M1
GET	`/attestation`	code-attestation L3/L4/L5 + Ed25519 signature	M7
GET	`/burnin-status`	per-layer phase + samples + TPR/FPR/precision	M11
POST	`/burnin/graduate`	`{ok, layer_key, reason}` (409 if gates fail)	M11
POST	`/burnin/label`	`{ok}`	M11
GET	`/cost-attestation/{aid}`	per-AID Cost Attestation Records (separately signed)	M12
GET	`/cost-attestation/by-tenant/{tenant_id}`	tenant-scoped ledger view	M12
GET	`/admin/aid`	quarantined AIDs list	M14
GET	`/admin/aid/{aid}`	full violation history for one AID	M14
POST	`/admin/aid/release`	manual release (requires `X-Aegis-Admin-Token` header)	M14
GET	`/forensic/replay`	walk encrypted journal, decrypt, per-AID chain validity	M15
POST	`/ham/memory`	store an AID-bound encrypted item	M16
POST	`/ham/recall`	retrieve by aid + tenant + tag filter	M16
POST	`/ham/context`	assemble bundle of N most-recent items	M16
POST	`/ham/forget`	tombstone an object (idempotent)	M16
POST	`/ham/summarize`	counts + tag histogram	M16
POST	`/ham/ground`	bind a claim to N memory references (returns SHA3 claim_hash)	M16
GET	`/ham/stats`	total/live/tombstoned counts	M16
GET	`/`	web dashboard (single-page)	—
GET	`/theater`	ATV Theater (band visualizer)	—
GET	`/source`	dashboard "Source-code paths" panel	—

Quick start

# 1. Install deps (downloads Python 3.11+ if missing)
uv sync

# 2. Run the test suite (326 tests)
uv run pytest -q

# 3. Lint + typecheck
uv run ruff check . && uv run mypy src

# 4. Boot the service
uv run uvicorn aegis.main:app --reload --port 8000

# 5. In a second shell — run the full demo
uv run python -m demo.agent_demo

The demo runs the original 5-call scenario (ALLOW/BLOCK/APPROVAL mix) followed by three extension scenarios that exercise every M8–M16 endpoint:

=== M14: AID circuit breaker (aid=breaker-demo-…, role=read-only-role) ===
  violation 1/3 -> BLOCK: ... write_file; violations=1/3
  violation 2/3 -> BLOCK: ... write_file; violations=2/3
  violation 3/3 -> BLOCK: ... write_file; violations=3/3
  post-quarantine read_file -> BLOCK: AID … is quarantined — admin release required
  /admin/aid lists 1 quarantined AID(s).
  /admin/aid/release ok -> status=normal
  post-release read_file -> ALLOW: all firewall steps passed

=== M16: Hierarchical Agent Memory (aid=ham-demo-…) ===
  memory  -> object_id=…  seq=1
  memory  -> object_id=…  seq=2
  memory  -> object_id=…  seq=3
  recall(tags=['report']) -> 2 items
  context -> bundle of 3 items
  ground  -> bound=2 missing=1 claim_hash=630b056b3d6defc2…
  forget  -> ok=True
  summarize -> live=2 tag_hist={'calendar': 1, 'report': 1, 'customer': 1}

=== M15: Forensic replay (/forensic/replay) ===
  decrypted     = 10
  tampered      = 0
  aids touched  = 2
  chains valid  = 2/2

Set AEGIS_DEMO_SKIP_EXTRAS=1 to run only the original 5-call scenario.

One-shot helper

./demo/run_scenario.sh

Brings the service up via docker compose if available, otherwise via uv run uvicorn, waits for /healthz, then runs the demo.

Docker

docker compose up --build

The compose file provisions persistent volumes for the audit DB, ATMU intent log, encrypted journal, HAM store, and signing keys (distinct keys for telemetry vs. cost attestation per Claim 34). Verified end-to-end with OrbStack on macOS.

Use as a Claude Code firewall

tools/aegis_hook.py is a PreToolUse hook that fires before every tool call inside Claude Code, asks the running Aegis service for a verdict, and short-circuits the tool with stderr if blocked. See tools/README.md for install + tool mapping.

Compose with an LLM gateway (OpenRouter)

If you route LLM calls through OpenRouter, Aegis composes naturally — they sit at different layers:

OpenRouter: 300+ models × 60+ providers behind one API. Decides which model handles this prompt.
Aegis: 16-step firewall + cryptographic audit chain on every tool call. Decides is this resulting action safe?

The aegis.integrations.openrouter Python helper stamps the served-provider name (from OpenRouter's provider_responses[]) into Aegis's provider field — so aegis report --by-provider cross-groups by actual provider (openrouter:anthropic-claude-sonnet-4 vs openrouter:openai-gpt-4o), and the provider-drift advisor can flag "this provider's BLOCK rate diverges 3× from the cross-provider median" — a check that's only meaningful when multiple providers serve the same prompt.

See docs/integrations/openrouter.md for the 3-layer stack figure, code snippets, and honest scope.

Documentation

Roadmap

ROADMAP.md tracks what's in flight, what's next, and what's recently shipped — mirrored from the GitHub issues + PRs surface so a public reader has a stable URL to watch.

Getting started

Doc	What's in it
`docs/QUICKSTART.md`	60-second path: install → boot → first verdict → first chain
`docs/ARCHITECTURE.md`	Per-milestone surface tour with file pointers, data flow diagrams, and patent-claim cross-references
`docs/OPERATIONS.md`	Production runbook: env vars, key rotation, AID admin, journal forensics, backup/restore
`docs/T3_BOUNDARY.md`	T2 → T3 substitution boundary — exactly what changes (additive only) when implementing the hardware tier
`docs/DOGFOOD.md`	Dogfood report Phase A — Aegis hook installed against an actual Claude Code session, 28 calls, 5 BLOCKs, 20 REQUIRE_APPROVALs, with TP/FP/FN taxonomy and 5 concrete code-change recommendations
`docs/DOGFOOD_PHASE_B.md`	Dogfood report Phase B — same 10-call battery rerun against the post-Recommendations firewall. 4 stricter, 1 softer, 0 regressions; 71% noise floor eliminated; all 3 false negatives closed
`PLAN_v2.md`	T2 patent-aligned re-plan (M8–M16) + claim coverage matrix
`PLAN_v3.md`	T3 hardware tier design (M17–M26) — TEE attestation, ML-DSA dual-sign, FPGA judge, CSD integration
`SESSION_HANDOFF.md`	★ 새 챗 창 / 새 컨트리뷰터용 상태 스냅샷 — 한 파일에 마일스톤·디렉토리·명령어·트릭·옵션 모두. 새 세션 시작 시 이 파일 + CLAUDE.md + README 만 읽으면 충분.
`WHITEPAPER.md`	(한국어) 기술 백서 — 마크다운 소스 — 11 sections, ~1,300 lines (시장 / 사고 / 기술 / 사고 대응 7 시나리오 / MVP / POC / 데모 / 피치 / GTM / C-level)
`docs/build/WHITEPAPER.pdf`	(한국어) 기술 백서 — PDF (49 pages, ~2.1 MB) — A4 portrait, 디자인 커버 + 11 sections + 4 appendices (포함: 부록 D — 7 사고 시나리오 실제 실행 결과 9 pages). `bash tools/whitepaper/build_pdf.sh` 로 재생성
`docs/build/PITCH_DECK.pdf`	(한국어) 투자자 피치 덱 — PDF (13 slides, A4 landscape, ~900 KB) — Cover · Problem · Why now · 사고 사례 · Solution · 차별화 · Proof · Market · Business model · GTM · Roadmap · Team · Closing. `bash tools/deck/build_pdf.sh` 로 재생성
`demo/scenarios/`	사고 대응 시나리오 7개 — 실행 가능 버전 — 백서 §5 의 시나리오 A–G를 PASS/FAIL 자동 검증 가능한 bash 스크립트로 재현. `bash demo/scenarios/run_all.sh`
`SETUP_MACMINI.md`	Mac mini bootstrap for 24/7 Claude Code firewall use
`tools/README.md`	Claude Code hook install + 10-case smoke test

Recording / launch kit

Doc	What's in it
`demo/recording/README.md`	Pre-rendered media kit — GIF, asciinema cast, 9 dashboard screenshots, two TTS voiceover tracks
`docs/DEMO.md`	Click-by-click playbook — 90-second elevator + 5-minute deep-dive
`docs/RECORDING_KIT.md`	Live-recording prep — three teleprompter scripts (60s / 90s / 5min), OBS scene setup, recording-day checklist, YouTube/Loom/LinkedIn metadata
`LAUNCH.md`	Long-form launch blog post with embedded GIF
`SHOW_HN.md`	Hacker News submission copy + comment-thread playbook
`TWITTER_THREAD.md`	Three X/Twitter thread variants with timing + hashtag guidance

Pricing & business model

Doc	What's in it
`PRICING.md`	★ Pricing & tiers — Solo Free (free forever), Solo Pro $19/mo, Team $39/seat/mo, Enterprise custom. Free / paid boundary, what's never gated, FAQ
`docs/LICENSE_KEY.md`	License-key validation design — Ed25519 JWS, offline verification, revocation strategy, feature manifest, CLI surface

Release & distribution

Doc	What's in it
`docs/RELEASE_PIPELINE.md`	Release runbook — PyPI trusted publisher + GHCR multi-arch image. Tag-triggered, no API tokens. Dry-run paths, rollback, sdist size budget
`docs/SOAK_TEST.md`	Production sign-off load-test runbook — `aegis soak` (24h) / `aegis bench` (5min CI). Pass/fail thresholds, hardware sizing, common failure modes, sign-off template

Security

Doc	What's in it
`SECURITY.md`	Vulnerability reporting policy + in-scope / out-of-scope summary
`docs/THREAT_MODEL.md`	★ Full threat model — roles + trust boundaries, STRIDE walk with PR-referenced mitigations, cross-cutting design properties, cryptographic primitives + assumptions, residual gaps acknowledged, §8 third-party auditor checklist

Enterprise sales / design partner

Doc	What's in it
`docs/DESIGN_PARTNER_PROGRAM.md`	★ Public-facing design partner landing — Coding AI vertical, 30-day free pilot, self-assessment 5 questions
`docs/DESIGN_PARTNER_PLAYBOOK.md`	Internal sales playbook — 7 영업 artifacts (outreach templates, pilot LOI, discovery questions, KPIs, case study template)
`docs/TARGET_CUSTOMERS.md`	Target customer matrix (한국 Tier 1 / 글로벌 Tier 2 / Vendor Tier 3) + 8-week action plan
`docs/DECK_INDEX.md`	3-deck navigation (A: Continuous Compliance / B: Agent Transaction Safety / C: Regulated AI Operating Model)
`docs/DECK_A_CONTINUOUS_COMPLIANCE.md`	CISO / CFO / Compliance officer — 30 min × 15 slides
`docs/DECK_B_AGENT_TRANSACTION_SAFETY.md`	Platform engineering / SRE / Eng VP — 30 min × 15 slides
`docs/DECK_C_REGULATED_AI_OPERATING_MODEL.md`	Chief AI Officer / Regulated industry — 45 min × 20 slides

Configuration

Copy .env.example to .env and fill in API keys when ready.

The defaults are deliberately offline-friendly:

Setting	Default	Switch to real backend
`AEGIS_EMBEDDING_PROVIDER`	`dummy`	`openai` (needs `OPENAI_API_KEY`)
`AEGIS_JUDGE_PROVIDER`	`dummy`	`haiku` (needs `ANTHROPIC_API_KEY`)
`AEGIS_SAFETY_PROVIDER`	`dummy`	`openai` (Moderations) or `haiku`
`AEGIS_ADMIN_TOKEN`	`dev-admin-token`	any random string for production AID release

Storage paths (auto-created on first run):

Setting	Default
`AEGIS_AUDIT_DB`	`./data/audit.sqlite`
`AEGIS_AUDIT_JSONL`	`./data/audit.jsonl`
`AEGIS_INTENT_LOG_DB`	`./data/intent_log.sqlite`
`AEGIS_COST_LEDGER_DB`	`./data/cost_attestation.sqlite`
`AEGIS_COST_LEDGER_JSONL`	`./data/cost_attestation.jsonl`
`AEGIS_JOURNAL_PATH`	`./data/journal.bin`
`AEGIS_JOURNAL_DATA_KEY_PATH`	`./keys/journal_data.key`
`AEGIS_HAM_DB`	`./data/ham.sqlite`
`AEGIS_HAM_DATA_KEY_PATH`	`./keys/ham_data.key`
`AEGIS_SIGNING_KEY_PATH` / `_PUBLIC_KEY_PATH`	`./keys/ed25519.{pem,pub}`
`AEGIS_COST_SIGNING_KEY_PATH` / `_PUBLIC_KEY_PATH`	`./keys/ed25519_cost.{pem,pub}` (Claim 34: distinct from telemetry key)
`AEGIS_POLICY_DIR`	`./policies/`

If you set the provider to openai / haiku but the corresponding key is missing, the code automatically falls back to the dummy implementation so nothing breaks.

Web dashboard

Open http://localhost:8000 for the live single-page dashboard. It surfaces every M8–M16 panel:

Craft a tool call — preset buttons + sliders for prompt_injection, pii_exposure, the 16-slot cost metrics, and a free-form JSON args field
Action Firewall pipeline — animated row-by-row trace through steps 310 / 315 / 320 / 330 / 335 / 340 / 350 / 360 / 370
Verdict — decision badge, reason, ATV id, Ed25519 signature
ATV-2080-v1 bands — color-coded strip with deterministic intensity per band derived from atv_id
Audit chain — per-AID Merkle chain with prev_hash → this_hash visualization and live chain_valid flag
Burn-in baseline — per-layer phase + samples + TPR/FPR/precision
Burn-in attestation — L1–L5 code/config/key hashes + browser-side Ed25519 signature verification
AID circuit breaker (M14) — live quarantine list, admin release form
Forensic replay (M15) — decrypted / tampered / aids-seen tiles + per-AID chain head listing
Hierarchical Agent Memory (M16) — three-column store / recall / summarize+ground+forget interface; checkboxes auto-fill ground refs

/theater shows the ATV vector itself with a band-by-band breakdown.

Tests

uv run pytest --cov=aegis

326 tests across unit + integration + e2e
mypy --strict over 61 source files
ruff clean
Concurrency: 100-record SQLite audit chain, 200-line JSONL, 100-intent ATMU WAL, and per-AID circuit-breaker counters all pass under thread contention
No network in tests: respx mocks api.anthropic.com; OpenAI is unused under dummy provider

Per-milestone test files:

tests/unit/
├── test_step310_args.py … test_step370_exec.py        firewall
├── test_step315_aid_auth.py · test_circuit_breaker.py   M14
├── test_atmu_state_machine.py · test_intent_log.py     M10
├── test_burnin_*.py                                     M11
├── test_cost_*.py                                       M12
├── test_judge_haiku.py (attribution head)               M13
├── test_encrypted_journal.py · test_replay.py          M15
└── test_ham.py                                          M16

tests/integration/
├── test_evaluate_e2e.py · test_audit_chain_e2e.py      M1
├── test_tool_outcome_e2e.py                             M10
├── test_burnin_e2e.py                                   M11
├── test_cost_attestation_e2e.py                         M12
├── test_admin_aid_e2e.py                                M14
├── test_replay_e2e.py                                   M15
└── test_ham_e2e.py                                      M16

Where to look

src/aegis/
├── schema.py                ATV slice constants + 30-subfield Pydantic models
├── config.py                pydantic-settings (.env loader)
├── main.py                  FastAPI factory + `app`
├── atv/
│   ├── embeddings.py        EmbeddingProvider abstraction
│   └── builder.py           build_atv() — 19 SW encoders + HW zero-fill
├── firewall/
│   ├── core.py              FirewallContext + run_firewall orchestrator
│   ├── circuit_breaker.py   M14 — per-AID violation counter + quarantine
│   ├── step310_args.py      pattern blocklist + injection threshold
│   ├── step315_aid_auth.py  M14 — AID-region authorization
│   ├── step320_blast.py     tool blast-radius lookup
│   ├── step330_human.py     high-blast → REQUIRE_APPROVAL
│   ├── step335_cost.py      M8 — forecast-gating with 16-slot metrics
│   ├── step340_policy.py    policy match + sLLM judge fallback
│   ├── step350_approval.py  M9 — approval dispatch (channels)
│   ├── step360_audit.py     M9 — sign + append + cost_attestation_hint
│   └── step370_exec.py      M9 — exec annotation (PROCEED/SUPPRESS/DEFER)
├── atmu/                    M10 — Agent Telemetry Management Unit
│   ├── state_machine.py     7-state machine + legal transitions
│   ├── intent_log.py        SQLite-backed Write-Ahead Intent Log
│   ├── checkpoint.py        blast≥7 checkpoint manifests
│   └── compensating.py      DEFAULT_COMPENSATION_STRATEGIES
├── burnin/                  M11 — 5-layer × 4-phase
│   ├── phases.py            Phase StrEnum + PhaseMetrics + can_graduate
│   └── controller.py        BurnInController (observe / record_label / try_graduate)
├── cost/                    M12 — Cost Attestation Ledger
│   ├── model_flops.py       FLOPS_PER_TOKEN per model
│   ├── divergence.py        token-to-FLOPs / memory-cost / dollar-cost metrics
│   ├── escalation.py        Claim 27 — independent of sLLM verdict
│   └── ledger.py            separate Ed25519 key + per-aid Merkle chain
├── judge/
│   ├── base.py              Judge ABC + JudgeVerdict (with `subfield_attribution`)
│   ├── haiku.py             M13 — Claude Haiku 4.5 + attribution head
│   └── dummy.py             offline stub
├── sign/
│   ├── ed25519.py           keypair management + sign/verify
│   └── merkle.py            chain hashing + verify_chain
├── audit/
│   ├── sqlite_store.py      indexed records + chain head (transactional)
│   ├── jsonl_store.py       append-only raw record dump
│   ├── encrypted_journal.py M15 — AES-256-GCM with AAD-bound header
│   └── replay.py            M15 — ReplayReport + per-AID chain rebuild
├── ham/                     M16 — Hierarchical Agent Memory L3+L4
│   └── store.py             encrypted SQLite + L1 OrderedDict cache
├── attest/
│   ├── code_attestation.py  L3/L4/L5 hashes signed at startup
│   └── burn_in.py           BurnInMeasurement assembly
└── api/
    ├── evaluate.py          POST /evaluate
    ├── approve.py           POST /approve
    ├── audit_query.py       GET  /audit/{aid}
    ├── attestation.py       GET  /attestation
    ├── tool_outcome.py      POST /tool-outcome (M10)
    ├── burnin_status.py     M11 endpoints
    ├── cost_attestation.py  M12 endpoints
    ├── admin_aid.py         M14 endpoints
    ├── replay.py            M15 endpoint
    ├── ham.py               M16 endpoints
    └── source.py            dashboard source-peek

policies/
├── default.json             deny + allow rules (PLAN 6.9)
└── aid_region.json          M14 — per-AID role policy

demo/
├── agent_demo.py            5-call scenario + M14/M15/M16 extensions
├── tools.py                 Anthropic tool catalog
└── run_scenario.sh          bring service up + run demo

tools/
├── aegis_hook.py            Claude Code PreToolUse hook
├── aegis_safety.py          PRE-LLM safety classifier (regex / OpenAI / Haiku)
└── test_hook.sh             10-case smoke test

T3 — patent-future work

The T3 (hardware) tier is fully specified — see PLAN_v3.md for the 10-milestone breakdown (M17–M26) and docs/T3_BOUNDARY.md for the T2 → T3 substitution boundary.

Quick summary of what T3 adds:

Phase	Milestones	Adds
A TEE software path (cloud-available)	M17–M19	Real Intel TDX / AMD SEV-SNP attestation · ML-DSA post-quantum dual-signing · HW perf-counter cost attestation
B In-storage / accelerator	M20–M22	FPGA/AIE bit-exact deterministic sLLM judge · HW tag comparator at memory controller · NVMe-CSD integration with in-storage similarity
C Cross-cutting hardening	M23–M26	Per-AID HW resource counters · TEE-sealed key storage · Linkage-consistency vector (SW↔HW drift detection) · ZK range proof for cost dimensions (stretch)

The external contract doesn't move. T2 clients talk to T3 servers without code changes — schema, endpoints, and JSON shapes stay identical. The only visible difference is that some response fields stop being zero.

T3 hardware claims (CSD, FPGA, TEE) have schema placeholders already in place — tier_profile, cost_attestation_profile, the 200-D HW band, the hw_cost_attestation subfield range 2044..2059, the linkage_consistency_features 2060..2079. T3 fills these placeholders without breaking the external contract.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.0

May 17, 2026

0.5.27

May 17, 2026

0.5.26

May 17, 2026

0.5.25

May 17, 2026

0.5.24

May 17, 2026

0.5.23

May 17, 2026

0.5.22

May 17, 2026

0.5.21

May 17, 2026

0.5.20

May 17, 2026

0.5.19

May 17, 2026

0.5.18

May 17, 2026

0.5.17

May 17, 2026

0.5.16

May 16, 2026

0.5.15

May 16, 2026

0.5.14

May 16, 2026

0.5.13

May 16, 2026

0.5.12

May 16, 2026

0.5.11

May 16, 2026

0.5.10

May 16, 2026

This version

0.5.9

May 16, 2026

0.5.8

May 16, 2026

0.5.7

May 16, 2026

0.5.6

May 16, 2026

0.5.5

May 16, 2026

0.5.4

May 16, 2026

0.5.3

May 16, 2026

0.5.2

May 16, 2026

0.5.1

May 15, 2026

0.5.0

May 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aegis_atv-0.5.9.tar.gz (795.4 kB view details)

Uploaded May 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aegis_atv-0.5.9-py3-none-any.whl (837.0 kB view details)

Uploaded May 16, 2026 Python 3

File details

Details for the file aegis_atv-0.5.9.tar.gz.

File metadata

Download URL: aegis_atv-0.5.9.tar.gz
Upload date: May 16, 2026
Size: 795.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aegis_atv-0.5.9.tar.gz
Algorithm	Hash digest
SHA256	`77c8cb1adb09c7fb92342740d45bf7508cddbada71ee8a280a6350c10e00066e`
MD5	`e4189aea48e8d23b21414b49a7718521`
BLAKE2b-256	`9e6a40b3c1e974c65cc428130f4376741588be72f23fbd2927b8cb79dd44309b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aegis_atv-0.5.9.tar.gz:

Publisher: release-pypi.yml on happyikas/Aegis-ATV

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aegis_atv-0.5.9.tar.gz
- Subject digest: 77c8cb1adb09c7fb92342740d45bf7508cddbada71ee8a280a6350c10e00066e
- Sigstore transparency entry: 1552418643
- Sigstore integration time: May 16, 2026
Source repository:
- Permalink: happyikas/Aegis-ATV@ebb5b3383faef3387e4490fce8329e4bd867b231
- Branch / Tag: refs/tags/v0.5.9
- Owner: https://github.com/happyikas
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-pypi.yml@ebb5b3383faef3387e4490fce8329e4bd867b231
- Trigger Event: push

File details

Details for the file aegis_atv-0.5.9-py3-none-any.whl.

File metadata

Download URL: aegis_atv-0.5.9-py3-none-any.whl
Upload date: May 16, 2026
Size: 837.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aegis_atv-0.5.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0bb74d767817112e8861dc1819e735d0026ed1179327f6df41971e2679a5b9ca`
MD5	`681f5e2ed9cc323e825320348f81b801`
BLAKE2b-256	`63cec5afc5fd7311afdffe2362286f4871baa215c3378ac09405a6c80e1f1710`

See more details on using hashes here.

Provenance

The following attestation bundles were made for aegis_atv-0.5.9-py3-none-any.whl:

Publisher: release-pypi.yml on happyikas/Aegis-ATV

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: aegis_atv-0.5.9-py3-none-any.whl
- Subject digest: 0bb74d767817112e8861dc1819e735d0026ed1179327f6df41971e2679a5b9ca
- Sigstore transparency entry: 1552418659
- Sigstore integration time: May 16, 2026
Source repository:
- Permalink: happyikas/Aegis-ATV@ebb5b3383faef3387e4490fce8329e4bd867b231
- Branch / Tag: refs/tags/v0.5.9
- Owner: https://github.com/happyikas
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-pypi.yml@ebb5b3383faef3387e4490fce8329e4bd867b231
- Trigger Event: push

aegis-atv 0.5.9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Aegis ATV — Agent Telemetry Vector

Three release tracks

Three named features (work across all tracks)

Why this exists

Side-by-side: Aegis vs Claude Code's built-in flags

Verify integrity (the differentiating feature)

What you get in 5 minutes

What gets caught (highlights)

Solo Free contract — no data leaves your laptop

After it's installed

Modes

Local-mode intelligence tiers (--profile)

What's in the box

Endpoints

Quick start

One-shot helper

Docker

Use as a Claude Code firewall

Compose with an LLM gateway (OpenRouter)

Documentation

Roadmap

Getting started

Recording / launch kit

Pricing & business model

Release & distribution

Security

Enterprise sales / design partner

Configuration

Web dashboard

Tests

Where to look

T3 — patent-future work

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Local-mode intelligence tiers (`--profile`)