Forensic-grade tamper-evident audit chain for LLM applications. HMAC-SHA256 chain, content-addressable storage, pre-call policy gate, regression detection.
Project description
BIJOTEL
Forensic-grade tamper-evident audit chain for LLM applications.
BIJOTEL turns the spans your OpenTelemetry GenAI instrumentation already
emits into a HMAC-sealed chain on disk, content-addressable storage with
semantic dedup, and a pre-call policy gate that audits before it blocks.
It's a plug-in to whatever tracer you have (OpenLLMetry,
AnthropicInstrumentor, custom wrappers) — it does not replace your
tracer; it extends it.
Status: v2.0.1 on PyPI; GENA production runs v1.9.1 (v2.0.x is a
docs-only PyPI release on top of the same v1.9.1 code, no behavior
change). Production-validated through 14 consecutive days on the GENA
agent ecosystem: 5,497 chain entries, 11 wheel deploys
(v0.5.0 → v1.9.1), 0 chain breaks, 2 LLM providers in the same chain
(Anthropic + xAI; the OpenAI SDK adapter is shipped, no
api.openai.com calls in production). All 14 bijuterii layers
active at the default bijotel serve engine.
Multi-provider chain (v2.0.0)
The HMAC-sealed chain handles any LLM provider that emits OTel GenAI
spans, in the same table, with the same HMAC secret, the same JCS
canonical body format. Anthropic spans (via
opentelemetry-instrumentation-anthropic) and OpenAI / xAI spans (via
bijotel.wrap()) land side-by-side. bijotel verify walks the whole
chain without distinguishing — the HMAC linkage holds regardless of
who emitted each span.
chain rows on GENA (excerpt, 2026-05-24):
seq 5490 openai.chat provider=xai grok-3-mini (gen4 verifier)
seq 5489 anthropic.chat provider=anthropic claude-haiku-4-5 (gen4 extractor)
seq 5488 openai.chat provider=xai grok-3-mini
seq 5487 anthropic.chat provider=anthropic claude-haiku-4-5
...
$ bijotel verify --db chain.db
Chain VALID (5490 entries).
That's N-version programming in production: one provider extracts, another verifies; the chain records both and verifies cleanly across both.
Install
pip install bijotel
Optional extras:
pip install "bijotel[anthropic]" # Anthropic SDK + instrumentation
pip install "bijotel[openai]" # OpenAI SDK
pip install "bijotel[api]" # FastAPI + uvicorn → `bijotel serve`
pip install "bijotel[fingerprint]" # sentence-transformers (semantic dedup)
pip install "bijotel[ast]" # tree-sitter (Bash AST code safety)
pip install "bijotel[all]" # everything above
Quickstart
import os
from opentelemetry import trace
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.instrumentation.anthropic import AnthropicInstrumentor
from bijotel.processors import HmacChainSpanProcessor, CasSpanProcessor
provider = TracerProvider()
provider.add_span_processor(
HmacChainSpanProcessor(
secret_key=bytes.fromhex(os.environ["BIJOTEL_HMAC_SECRET"]),
db_path="chain.db",
)
)
provider.add_span_processor(CasSpanProcessor(db_path="chain.db"))
trace.set_tracer_provider(provider)
AnthropicInstrumentor().instrument()
# Now every anthropic.chat call is sealed in the chain with full canonical
# body, prev_hash linkage, HMAC, and CAS-deduped body storage.
Generate a fresh secret:
export BIJOTEL_HMAC_SECRET=$(python -c "import secrets; print(secrets.token_hex(32))")
Verify integrity later:
bijotel verify --db chain.db
CLI
After install, the bijotel command exposes 8 subcommands:
bijotel verify --db chain.db # full HMAC re-verification
bijotel inspect --db chain.db 4952 # one entry's canonical body
bijotel stats --db chain.db # chain + CAS + policy stats
bijotel list --db chain.db --since 2026-05-20 # filterable browsing
bijotel export --db chain.db --output out.json # signed portable JSON
bijotel verify-export out.json # auditor-side verification
bijotel regression --db chain.db --window 100 # z-score + IQR drift
bijotel serve --port 8080 --db chain.db # REST API only (Swagger at /docs)
bijotel serve --port 8080 --db chain.db --dashboard # API at /api/* + React dashboard at /
--since uses UTC calendar dates (YYYY-MM-DD, lower bound inclusive
at 00:00:00Z), consistent across all subcommands.
REST API (bijotel serve)
bijotel serve exposes 18 endpoints. Full OpenAPI 3.1 spec at
/openapi.json, interactive Swagger UI at /docs.
| Method | Path | Description |
|---|---|---|
| GET | /health |
Liveness + version + db_exists |
| GET | /version |
Package version metadata |
| GET | /docs |
Swagger UI |
| GET | /redoc |
ReDoc UI |
| GET | /openapi.json |
OpenAPI 3.1 spec |
| GET | /chain |
Paginated chain rows, since/until filters |
| GET | /chain/stats |
Aggregate counters (entries/CAS/dedup/age) |
| GET | /chain/{seq} |
One entry with full canonical body |
| POST | /chain/verify |
Smoke (default) or full=true canonical |
| GET | /policy/rules |
Active rules + closure introspection |
| POST | /policy/evaluate |
Dry-run a request through the engine |
| GET | /layers |
14-layer bijuterii manifest |
| GET | /regression/latest |
Most recent persisted run |
| GET | /regression/history |
Paginated timeline |
| POST | /regression/run |
Execute fresh run (optionally persist) |
| POST | /export |
Download a signed JSON snapshot |
| POST | /export/verify |
Upload + return validity + reason |
Optional Bearer auth
Set BIJOTEL_API_KEY on the serve process and all endpoints (except
/health, /version, /docs, /redoc, /openapi.json) require
Authorization: Bearer <key>. Constant-time comparison
(hmac.compare_digest). Empty / unset env = no auth (dev mode).
Dashboard
A React/Vite dashboard ships in src/bijotel/dashboard/:
| Page | URL | What it does |
|---|---|---|
| Chain Explorer | /chain |
4 stats cards + paginated table + click-row → side panel with full canonical body; Verify + Export buttons |
| Policy Decisions | /policy |
Active-rules grid + live Evaluate form (dry-run a prompt) + layers grid |
| Regression Monitor | /regression |
Status cards + recharts timeline (24h/7d/30d/all) + per-dimension breakdown + Run-Now panel |
| System Status | /system |
Full 14-layer manifest table |
Bundle stays under 100 KB gzip on initial load thanks to per-route code
splitting; the heavy recharts chunk lazy-loads only when
/regression is visited.
Dev:
cd src/bijotel/dashboard
npm install
npm run dev # http://localhost:5173 with /api proxied to :8080
Production build → dashboard_dist/ at project root. Day 12 polish
wires bijotel serve --dashboard to mount it as static.
14 AI safety bijuterii — all active in v2.0.0
Each layer maps to a catalog pattern. status reflects the live
GET /layers response on a healthy production install. There are
three states the endpoint can report:
- active — layer is wired in this process right now and has runtime evidence (chain rows, rule in PolicyEngine, tracker on app state, sibling JSON on disk, etc.).
- available — code ships and is importable, but nothing on this server currently uses it; the host can opt in via config.
- planned — no code yet. In v2.0.0 there are zero
plannedlayers — the catalog is whole.
On a fresh pip install with the v2.0.0 default engine
(prompt_pattern_deny + pii_detection + output_length_limit + ast_safety_check + routing_recommendation), the layers below
report active immediately once their evidence trigger is met
(see column "active when…"). The empty-chain edge case is the only
one where forensic_chain/regression start as
available — they flip to active after the first sealed span.
| # | Bijuterie | Layer | Active when… | v2.0.0 |
|---|---|---|---|---|
| 11 | Forensic-First | HMAC-SHA256 chain (HmacChainSpanProcessor) |
chain table has ≥1 row | ✅ active |
| 2 | Content-Addressable Storage | CAS unique-body table | cas table has ≥1 row | ✅ active |
| 2 | Merkle DAG | dag_nodes + dag_refs reference graph |
dag_nodes has ≥1 row | ✅ active (since v1.5.3) |
| 10 | Compliance-as-Code | PolicyEngine + 11 rule factories | engine on app state | ✅ active |
| 5 | AST-First Code Safety | tree-sitter bash + stdlib Python ast | ast_safety_check rule in engine |
✅ active (since v1.9.1) |
| 15 | Inference Routing | Pareto cost/quality/latency + per-agent budget | routing_recommendation rule in engine |
✅ active (since v1.6.0) |
| D | Containment (Combo D) | Permitted + Safe + Sealed orchestrator | ContainmentGuard on app state |
✅ active (since v1.7.0) |
| 9 | Consensus Voting | Multi-model agreement, N-version pattern | consensus_provider on app state |
✅ active (since v1.8.0) |
| 3 | Energy Accounting | per-call Wh + grams CO2 + region grid intensity | EnergyTracker attached or energy_log rows |
✅ active (since v1.9.0) |
| 16 | Regression Detection | z-score + IQR over input_tokens/output_tokens/cost | chain has ≥5 rows | ✅ active |
| 7 | Deterministic + Semantic Fingerprinting | SHA-256 + sentence-transformers | bijotel_fingerprints.db has ≥1 row |
✅ active (since v1.6.0) |
| 18 | Misalignment Probes | 29 probes across 8 attack categories | misalignment_probes_*.json sibling exists |
✅ active (since v1.9.1) |
| 19 | OTel GenAI Semconv | Compatible with OpenLLMetry, Anthropic/OpenAI instrumentors | always (semantic conventions used throughout) | ✅ active |
| 7 | Provider Protocol | AnthropicAdapter, OpenAIAdapter (xAI via OpenAI-compatible) |
always | ✅ active |
Why no more planned
Up through v1.8.0 the table carried planned for Energy
Accounting and planned for Consensus Voting — both
shipped in v1.8.0 / v1.9.0 with full tests and production proof:
- v1.8.0 — Consensus: real Haiku vs Sonnet votes recorded (factual agreement 1.00, creative agreement 0.15).
- v1.9.0 — Energy: 14-day GENA backfill produced 19.95 Wh / 7.58 g CO2; Haiku-migration savings ≈ 8× captured ex post facto.
v2.0.0 is the tag for the moment the column emptied.
What makes BIJOTEL different
- HMAC-SHA256 tamper-evident chain. Each span carries
prev_hash || canonical_hashre-hashed with a server secret. Any mutation — even reordering — breaks verification. Thebijotel-chain-v1export schema lets external auditors verify with the secret alone, no SQLite access. - Content-addressable storage with semantic dedup. Identical
request bodies share storage; the dedup factor surfaces as a metric
(
/chain/statsfield). The Merkle DAG layer (#2) enables reference-graph queries. - Pre-call policy gate with audit trail. Eight rule factories
(
prompt_pattern_deny,pii_detection,output_length_limit,model_allowlist,model_version_pin,cost_per_call_max,daily_token_budget,rate_limit_calls_per_minute) compose into aPolicyEngine. Decisions:allow/warn/deny. Warnings attach to the span viabijotel.policy.warning. Denies emit a synthetic chain entry withbijotel.blocked=true. - Statistical regression detection on the chain itself. No
separate metrics pipeline.
RegressionDetectorreads fromchain.db, computes baseline + flags drift on input_tokens / output_tokens / cost using z-score AND IQR (defaultBOTHmode minimizes false positives). - Composable with upstream OTel instrumentors. BIJOTEL adds
SpanProcessors on top of your existingopentelemetry-instrumentation-anthropic/opentelemetry-instrumentation-openaichain. It never wraps the SDK call itself, so there's no provider-specific glue to maintain.
Production validated (v1.9.1 deploy, 2026-05-24)
GENA's production agent ecosystem (Aisophical) has been the rolling integration test since v0.5.0:
- 14 days continuous operation (2026-05-10 → 2026-05-24), 11 wheel deploys on GENA: v0.5.0 → v0.6.0 → v0.6.1 → v1.1.0 → v1.4.0 → v1.5.2 → v1.5.3 → v1.7.0 → v1.8.0 → v1.9.0 → v1.9.1. (v2.0.0 and v2.0.1 are docs-only releases on PyPI; the production code is v1.9.1.)
- 5,497 chain entries,
bijotel verify --db chain.dbreturnsChain VALIDend-to-end — cross-version and cross-provider HMAC continuity. - 0 chain breaks across the 11-deploy window; the chain
processor's
BEGIN IMMEDIATEcritical section + WAL mode survived every concurrent-writer scenario including the gen4 add-on (today) writing into the same DB from a separate process. - 2 LLM providers in the chain as of 2026-05-24:
Anthropic (claude-haiku-4-5 + claude-sonnet-4) emitted via
AnthropicInstrumentor, and xAI (grok-3-mini) emitted viabijotel.wrap()on an OpenAI-SDK client pointing athttps://api.x.ai/v1. Both providers verify under the same HMAC. - All 14 layers
activein/api/layerson the defaultbijotel serveengine. - 14-day energy footprint (chain backfill, 2026-05-24): 19.95 Wh / 7.58 g CO2 across 5,459 LLM calls. Haiku migration on 2026-05-21 cut daily CO2 ≈ 8× — captured retroactively, not designed in.
- Cross-provider consensus sample (2026-05-24): Haiku vs Sonnet on a factual prompt scored 1.00 agreement (same answer); on a creative prompt scored 0.15 (genuine disagreement → flag).
Full reports in
AUDIT_2026_05_23.md and
TIER_MIGRATION_2026_05_23.md;
the v1.1.0-era integration test in
INTEGRATION_TEST_20260523.md.
Known issues
- Dashboard
bijotel serve --dashboardnot wired yet (planned v1.3+). Today's pattern: runbijotel serveon the backend, run Vite dev server (or any static host) on the dashboard side, point the proxy at:8080. - Vite dev server binds IPv6-only on some Windows installs.
curl 127.0.0.1:5173returns nothing; usecurl localhost:5173(DNS resolves to ::1) orcurl '[::1]:5173'. Browsers are fine. - GENA-style deploys that install the wheel without extras must
add
python-multipartto theirrequirements.txtif they wantPOST /export/verifyto register (FastAPI'sUploadFilerequires it). The[api]extra carries it transitively. - GitHub source repository remains private during the v1.x.x development window — the URLs in PyPI metadata (Documentation, Issues, Changelog, Source) currently 404 for external visitors. This is intentional and will flip when the repo goes public.
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bijotel-2.0.3-py3-none-any.whl.
File metadata
- Download URL: bijotel-2.0.3-py3-none-any.whl
- Upload date:
- Size: 388.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
de03508777ee18d4b54ea25d30bdbf4440faf92b2ffa90d124e44801db66dee9
|
|
| MD5 |
88135b20d73850b907d7db11e153c8a0
|
|
| BLAKE2b-256 |
9bad86659cc18653d6c84dc78992d30eec8c7715e92aad225ff61fe762f7e99b
|