Tamper-evident HMAC audit chain for LLM applications. HMAC-SHA256 chain, content-addressable storage, pre-call policy gate, regression detection.

These details have not been verified by PyPI

Project links

Project description

BIJOTEL

Tamper-evident HMAC audit chain for LLM applications.

BIJOTEL turns the spans your OpenTelemetry GenAI instrumentation already emits into a HMAC-sealed chain on disk, content-addressable storage with semantic dedup, and a pre-call policy gate that audits before it blocks. It's a plug-in to whatever tracer you have (OpenLLMetry, AnthropicInstrumentor, custom wrappers) — it does not replace your tracer; it extends it.

Status: v2.0.5 on PyPI; GENA production runs v2.0.5 (deployed 2026-05-25 via the F11 pattern-expansion release). Now also running on a second independent system (ARA, ai-research-agency on aarch64) since 2026-05-25. Production-validated through 15 consecutive days on GENA: 5,889 chain entries, 14 wheel deploys (v0.5.0 → v2.0.5), 0 chain breaks, 2 LLM providers in the same chain (Anthropic + xAI; the OpenAI SDK adapter is shipped, no api.openai.com calls in production). All 14 bijuterii layers active at the default bijotel serve engine.

Multi-provider chain (v2.0.0)

The HMAC-sealed chain handles any LLM provider that emits OTel GenAI spans, in the same table, with the same HMAC secret, the same JCS canonical body format. Anthropic spans (via opentelemetry-instrumentation-anthropic) and OpenAI / xAI spans (via bijotel.wrap()) land side-by-side. bijotel verify walks the whole chain without distinguishing — the HMAC linkage holds regardless of who emitted each span.

chain rows on GENA (excerpt, 2026-05-24):
  seq 5490  openai.chat     provider=xai        grok-3-mini      (gen4 verifier)
  seq 5489  anthropic.chat  provider=anthropic  claude-haiku-4-5 (gen4 extractor)
  seq 5488  openai.chat     provider=xai        grok-3-mini
  seq 5487  anthropic.chat  provider=anthropic  claude-haiku-4-5
  ...
  $ bijotel verify --db chain.db
  Chain VALID (5490 entries).

That's N-version programming in production: one provider extracts, another verifies; the chain records both and verifies cleanly across both.

Install

pip install bijotel

Optional extras:

pip install "bijotel[anthropic]"     # Anthropic SDK + instrumentation
pip install "bijotel[openai]"        # OpenAI SDK
pip install "bijotel[api]"           # FastAPI + uvicorn → `bijotel serve`
pip install "bijotel[fingerprint]"   # sentence-transformers (semantic dedup)
pip install "bijotel[ast]"           # tree-sitter (Bash AST code safety)
pip install "bijotel[all]"           # everything above

Quickstart

import os
from opentelemetry import trace
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.instrumentation.anthropic import AnthropicInstrumentor

from bijotel.processors import HmacChainSpanProcessor, CasSpanProcessor

provider = TracerProvider()
provider.add_span_processor(
    HmacChainSpanProcessor(
        secret_key=bytes.fromhex(os.environ["BIJOTEL_HMAC_SECRET"]),
        db_path="chain.db",
    )
)
provider.add_span_processor(CasSpanProcessor(db_path="chain.db"))
trace.set_tracer_provider(provider)
AnthropicInstrumentor().instrument()

# Now every anthropic.chat call is sealed in the chain with full canonical
# body, prev_hash linkage, HMAC, and CAS-deduped body storage.

Generate a fresh secret:

export BIJOTEL_HMAC_SECRET=$(python -c "import secrets; print(secrets.token_hex(32))")

Verify integrity later:

bijotel verify --db chain.db

CLI

After install, the bijotel command exposes 8 subcommands:

bijotel verify --db chain.db                          # full HMAC re-verification
bijotel inspect --db chain.db 4952                     # one entry's canonical body
bijotel stats --db chain.db                           # chain + CAS + policy stats
bijotel list --db chain.db --since 2026-05-20         # filterable browsing
bijotel export --db chain.db --output out.json        # signed portable JSON
bijotel verify-export out.json                         # auditor-side verification
bijotel regression --db chain.db --window 100         # z-score + IQR drift
bijotel serve --port 8080 --db chain.db               # REST API only (Swagger at /docs)
bijotel serve --port 8080 --db chain.db --dashboard   # API at /api/* + React dashboard at /

--since uses UTC calendar dates (YYYY-MM-DD, lower bound inclusive at 00:00:00Z), consistent across all subcommands.

REST API (`bijotel serve`)

bijotel serve exposes 18 endpoints. Full OpenAPI 3.1 spec at /openapi.json, interactive Swagger UI at /docs.

Method	Path	Description
GET	`/health`	Liveness + version + db_exists
GET	`/version`	Package version metadata
GET	`/docs`	Swagger UI
GET	`/redoc`	ReDoc UI
GET	`/openapi.json`	OpenAPI 3.1 spec
GET	`/chain`	Paginated chain rows, since/until filters
GET	`/chain/stats`	Aggregate counters (entries/CAS/dedup/age)
GET	`/chain/{seq}`	One entry with full canonical body
POST	`/chain/verify`	Smoke (default) or `full=true` canonical
GET	`/policy/rules`	Active rules + closure introspection
POST	`/policy/evaluate`	Dry-run a request through the engine
GET	`/layers`	14-layer bijuterii manifest
GET	`/regression/latest`	Most recent persisted run
GET	`/regression/history`	Paginated timeline
POST	`/regression/run`	Execute fresh run (optionally persist)
POST	`/export`	Download a signed JSON snapshot
POST	`/export/verify`	Upload + return validity + reason

Optional Bearer auth

Set BIJOTEL_API_KEY on the serve process and all endpoints (except /health, /version, /docs, /redoc, /openapi.json) require Authorization: Bearer <key>. Constant-time comparison (hmac.compare_digest). Empty / unset env = no auth (dev mode).

Dashboard

A React/Vite dashboard ships in src/bijotel/dashboard/:

Page	URL	What it does
Chain Explorer	`/chain`	4 stats cards + paginated table + click-row → side panel with full canonical body; Verify + Export buttons
Policy Decisions	`/policy`	Active-rules grid + live Evaluate form (dry-run a prompt) + layers grid
Regression Monitor	`/regression`	Status cards + recharts timeline (24h/7d/30d/all) + per-dimension breakdown + Run-Now panel
System Status	`/system`	Full 14-layer manifest table

Bundle stays under 100 KB gzip on initial load thanks to per-route code splitting; the heavy recharts chunk lazy-loads only when /regression is visited.

Dev:

cd src/bijotel/dashboard
npm install
npm run dev   # http://localhost:5173 with /api proxied to :8080

Production build → dashboard_dist/ at project root. Day 12 polish wires bijotel serve --dashboard to mount it as static.

14 AI safety bijuterii — all active in v2.0.0

Each layer maps to a catalog pattern. status reflects the live GET /layers response on a healthy production install. There are three states the endpoint can report:

active — layer is wired in this process right now and has runtime evidence (chain rows, rule in PolicyEngine, tracker on app state, sibling JSON on disk, etc.).
available — code ships and is importable, but nothing on this server currently uses it; the host can opt in via config.
planned — no code yet. In v2.0.0 there are zero planned layers — the catalog is whole.

On a fresh pip install with the v2.0.0 default engine (prompt_pattern_deny + pii_detection + output_length_limit + ast_safety_check + routing_recommendation), the layers below report active immediately once their evidence trigger is met (see column "active when…"). The empty-chain edge case is the only one where forensic_chain/regression start as available — they flip to active after the first sealed span.

#	Bijuterie	Layer	Active when…	v2.0.0
11	Forensic-First	HMAC-SHA256 chain (`HmacChainSpanProcessor`)	chain table has ≥1 row	✅ active
2	Content-Addressable Storage	CAS unique-body table	cas table has ≥1 row	✅ active
2	Merkle DAG	`dag_nodes` + `dag_refs` reference graph	dag_nodes has ≥1 row	✅ active (since v1.5.3)
10	Compliance-as-Code	PolicyEngine + 11 rule factories	engine on app state	✅ active
5	AST-First Code Safety	tree-sitter bash + stdlib Python ast	`ast_safety_check` rule in engine	✅ active (since v1.9.1)
15	Inference Routing	Pareto cost/quality/latency + per-agent budget	`routing_recommendation` rule in engine	✅ active (since v1.6.0)
D	Containment (Combo D)	Permitted + Safe + Sealed orchestrator	`ContainmentGuard` on app state	✅ active (since v1.7.0)
9	Consensus Voting	Multi-model agreement, N-version pattern	`consensus_provider` on app state	✅ active (since v1.8.0)
3	Energy Accounting	per-call Wh + grams CO2 + region grid intensity	`EnergyTracker` attached or `energy_log` rows	✅ active (since v1.9.0)
16	Regression Detection	z-score + IQR over input_tokens/output_tokens/cost	chain has ≥5 rows	✅ active
7	Deterministic + Semantic Fingerprinting	SHA-256 + sentence-transformers	`bijotel_fingerprints.db` has ≥1 row	✅ active (since v1.6.0)
18	Misalignment Probes	29 probes across 8 attack categories	`misalignment_probes_*.json` sibling exists	✅ active (since v1.9.1)
19	OTel GenAI Semconv	Compatible with OpenLLMetry, Anthropic/OpenAI instrumentors	always (semantic conventions used throughout)	✅ active
7	Provider Protocol	`AnthropicAdapter`, `OpenAIAdapter` (xAI via OpenAI-compatible)	always	✅ active

Why no more `planned`

Up through v1.8.0 the table carried planned for Energy Accounting and planned for Consensus Voting — both shipped in v1.8.0 / v1.9.0 with full tests and production proof:

v1.8.0 — Consensus: real Haiku vs Sonnet votes recorded (factual agreement 1.00, creative agreement 0.15).
v1.9.0 — Energy: 14-day GENA backfill produced 19.95 Wh / 7.58 g CO2; Haiku-migration savings ≈ 8× captured ex post facto.

v2.0.0 is the tag for the moment the column emptied.

What makes BIJOTEL different

HMAC-SHA256 tamper-evident chain. Each span carries prev_hash || canonical_hash re-hashed with a server secret. Any mutation — even reordering — breaks verification. The bijotel-chain-v1 export schema lets external auditors verify with the secret alone, no SQLite access.
Content-addressable storage with semantic dedup. Identical request bodies share storage; the dedup factor surfaces as a metric (/chain/stats field). The Merkle DAG layer (#2) enables reference-graph queries.
Pre-call policy gate with audit trail. Eight rule factories (prompt_pattern_deny, pii_detection, output_length_limit, model_allowlist, model_version_pin, cost_per_call_max, daily_token_budget, rate_limit_calls_per_minute) compose into a PolicyEngine. Decisions: allow / warn / deny. Warnings attach to the span via bijotel.policy.warning. Denies emit a synthetic chain entry with bijotel.blocked=true.
Statistical regression detection on the chain itself. No separate metrics pipeline. RegressionDetector reads from chain.db, computes baseline + flags drift on input_tokens / output_tokens / cost using z-score AND IQR (default BOTH mode minimizes false positives).
Composable with upstream OTel instrumentors. BIJOTEL adds SpanProcessors on top of your existing opentelemetry-instrumentation-anthropic / opentelemetry-instrumentation-openai chain. It never wraps the SDK call itself, so there's no provider-specific glue to maintain.

Production validated (v2.0.5 deploy, 2026-05-25)

GENA's production agent ecosystem (Aisophical) has been the rolling integration test since v0.5.0:

15 days continuous operation (2026-05-10 → 2026-05-25), 14 wheel deploys on GENA: v0.5.0 → v0.6.0 → v0.6.1 → v1.1.0 → v1.4.0 → v1.5.2 → v1.5.3 → v1.7.0 → v1.8.0 → v1.9.0 → v1.9.1 → v2.0.3 → v2.0.4 → v2.0.5.
5,889 chain entries, bijotel verify --db chain.db returns Chain VALID end-to-end — cross-version and cross-provider HMAC continuity.
0 chain breaks across the 14-deploy window; the chain processor's BEGIN IMMEDIATE critical section + WAL mode survived every concurrent-writer scenario including the gen4 add-on (today) writing into the same DB from a separate process.
2 LLM providers in the chain as of 2026-05-24: Anthropic (claude-haiku-4-5 + claude-sonnet-4) emitted via AnthropicInstrumentor, and xAI (grok-3-mini) emitted via bijotel.wrap() on an OpenAI-SDK client pointing at https://api.x.ai/v1. Both providers verify under the same HMAC.
All 14 layers active in /api/layers on the default bijotel serve engine.
14-day energy footprint (chain backfill, 2026-05-24): 19.95 Wh / 7.58 g CO2 across 5,459 LLM calls. Haiku migration on 2026-05-21 cut daily CO2 ≈ 8× — captured retroactively, not designed in.
Cross-provider consensus sample (2026-05-24): Haiku vs Sonnet on a factual prompt scored 1.00 agreement (same answer); on a creative prompt scored 0.15 (genuine disagreement → flag).

For deep production validation across Rounds 1–3 (46 tests, 0 partial/fail), see release notes in CHANGELOG.md and the live demo with a verify-yourself chain at https://bijotel.whiteandpoint.com.

Known issues

Vite dev server binds IPv6-only on some Windows installs. curl 127.0.0.1:5173 returns nothing; use curl localhost:5173 (DNS resolves to ::1) or curl '[::1]:5173'. Browsers are fine. Only affects the dashboard dev workflow; the built bundle shipped in the wheel is unaffected.
F11 pattern coverage improved to 100% on the R1 production probe corpus in v2.0.5 (23/23 attack vectors caught with 0 false positives on the benign control set). Coverage on broader, unseen attack corpora is necessarily lower; pattern expansion is iterative, not "done". See the v2.0.5 entry in CHANGELOG.md for the methodology.
bijotel verify signals failure via stderr + exit code 3, not stdout. A surfaced R3 finding; if you script the CLI, branch on exit code, not stdout substring match.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.0.6

May 25, 2026

2.0.5

May 24, 2026

2.0.4

May 24, 2026

2.0.3

May 24, 2026

2.0.2

May 24, 2026

2.0.1

May 24, 2026

2.0.0

May 24, 2026

1.9.1

May 24, 2026

1.9.0

May 24, 2026

1.8.0

May 24, 2026

1.7.0

May 24, 2026

1.6.0

May 24, 2026

1.5.3

May 23, 2026

1.5.2

May 23, 2026

1.5.1

May 23, 2026

1.5.0

May 23, 2026

1.4.0

May 22, 2026

1.1.0

May 22, 2026

1.0.0

May 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bijotel-2.0.6-py3-none-any.whl (392.0 kB view details)

Uploaded May 25, 2026 Python 3

File details

Details for the file bijotel-2.0.6-py3-none-any.whl.

File metadata

Download URL: bijotel-2.0.6-py3-none-any.whl
Upload date: May 25, 2026
Size: 392.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for bijotel-2.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d8ec1668d8967a0ca1f3f46edf56a81a2137ad908983e0d37f60a4ae9f87566d`
MD5	`71cb9cecdbae15a8f76ae1d04fbca99b`
BLAKE2b-256	`3d4ebcee2c32bb004fc01470d890031313890ac647dc9d22b4ebb1aa4409d013`

See more details on using hashes here.

bijotel 2.0.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BIJOTEL

Multi-provider chain (v2.0.0)

Install

Quickstart

CLI

REST API (`bijotel serve`)

Optional Bearer auth

Dashboard

14 AI safety bijuterii — all active in v2.0.0

Why no more `planned`

What makes BIJOTEL different

Production validated (v2.0.5 deploy, 2026-05-25)

Known issues

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

bijotel 2.0.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BIJOTEL

Multi-provider chain (v2.0.0)

Install

Quickstart

CLI

REST API (bijotel serve)

Optional Bearer auth

Dashboard

14 AI safety bijuterii — all active in v2.0.0

Why no more planned

What makes BIJOTEL different

Production validated (v2.0.5 deploy, 2026-05-25)

Known issues

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

REST API (`bijotel serve`)

Why no more `planned`