Credence — epistemic enforcement layer that prevents LLMs from forgetting what they didn't know
Project description
Credence
AI doesn't remember what it wasn't sure about. Credence does.
pip install "credence-guard[mcp]"
credence demo # 30-second smoke test, no API key required
[mcp] adds the FastMCP server for Claude Code. Core package has zero hard dependencies.
The problem
You say: "The rate limit is probably around 50 — I haven't confirmed it yet."
Fifteen turns later, Claude writes:
RATE_LIMIT = 50 # no warning. no flag. shipped.
The API rejects every request at 2am. The real limit was 10. Claude forgot you weren't sure.
This isn't hallucination. The model reproduced exactly what it read. What it read had the qualifier stripped — by context compression, fifteen turns back.
What Credence does
Tracks uncertain values the moment you state them. Blocks writes that embed those values until you confirm them.
you say "rate limit is probably 50"
→ observer registers it (before Claude responds)
→ Claude writes: RATE_LIMIT = 50 # ⚠ CREDENCE[unverified]
→ write blocked until you confirm
Every other tool warns. Credence enforces.
What it looks like
# Claude generates this. Credence intercepts before it ships.
class StripeClient:
API_VERSION = "2023-10-16" # ⚠⚠ CREDENCE[stale]: API date versions change on release — verify before shipping
RATE_LIMIT = 100 # ⚠ CREDENCE[unverified]: I think Stripe rate limit is around 100 req/min
TOKEN_EXPIRY = 3600 # ⚠⚠ CREDENCE[stale]: Token/session lifetime values are set by the vendor — verify
MAX_RETRIES = 3
TIMEOUT_MS = 5000
credence: blocked Edit — 2 unverified value(s)
→ I think Stripe rate limit is around 100 req/min | TOKEN_EXPIRY = 3600
Verify first, then retry. Use credence_constraints to see all pending.
After you confirm: "Confirmed — rate limit is 100 req/min per stripe.com/docs" → gate clears.
Setup
1. Add to .mcp.json:
{ "mcpServers": { "credence": { "command": "credence-server" } } }
2. Add to .claude/settings.json:
{
"hooks": {
"UserPromptSubmit": [
{ "hooks": [{ "type": "command", "command": "python3 -m credence.observer" }] }
],
"PreToolUse": [
{
"matcher": "Write|Edit|Bash|NotebookEdit",
"hooks": [{ "type": "command", "command": "python3 -m credence.hooks" }]
}
]
}
}
Done. No API key required.
Registry: Credence creates
epistemic_registry.dbin your working directory. Add*.dbto your.gitignore, or setCREDENCE_DB=~/.credence/registry.dbto keep it global.Session tracking: Set
CREDENCE_SESSION_ID=my-projectto keep constraints stable across directory changes and terminal restarts.Event log: The gate writes block/allow events to
~/.credence/events.jsonl(local only, never sent anywhere). SetCREDENCE_NO_LOG=1to disable.
How it works
Two layers, neither requires model cooperation:
| Layer | Hook | Role |
|---|---|---|
| Observer | UserPromptSubmit |
Passive listener — registers uncertain values before Claude generates anything |
| Gate | PreToolUse |
Blocks writes that embed unverified values |
The observer fires before the model processes your message. If you say "I think the rate limit is 50", the registry has that entry before Claude generates a single token.
What gets blocked
credence: blocked Edit — 2 unverified value(s)
→ rate limit is probably 50 req/min | token expires in 3600s
Verify first, then retry. Use credence_constraints to see all pending.
Once verified, the gate clears.
What Credence does NOT do
- Does not verify facts — it cannot tell you if a value is correct
- Does not catch uncertainty that was never stated
- Does not block the model from saying a wrong value in prose — only from writing it to a file or command
Measured results
46% of uncertainty qualifiers are stripped by Claude Haiku during context compression. Credence blocks 100% of those writes (n=50, bootstrap CI: [0%–0%]).
Validated across 7 open-weight models (Qwen, Mistral, Llama, Phi, Gemma) from 5 organizations: same failure mode, same block rate.
credence demo # smoke test, no API key
python3 -m pytest tests/ -q # 829 tests
python3 -m evals.latency_report # P50/P95/P99
Full methodology: docs/TECHNICAL_REPORT.md
Project layout
credence/ pip-installable package
observer.py passive UserPromptSubmit hook
hooks.py PreToolUse enforcement gate
mcp_server.py 17-tool MCP server
registry.py SQLite constraint store
memory.py cross-session persistence
tests/ 829 tests
evals/ validation studies + multi-model benchmarks
docs/ technical report, architecture, ETP spec
credence_gate/ Rust gate (alternative to Python hooks.py)
experimental/ Phase 2 work — not yet shipped
paper/ Research paper draft + figures
Built by
Lakshmi Chakradhar Vijayarao — GitHub · LinkedIn · X
MIT License
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file credence_guard-1.2.3.tar.gz.
File metadata
- Download URL: credence_guard-1.2.3.tar.gz
- Upload date:
- Size: 132.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
036f5ed370db54949381b1bae6c930cd20fa01f982bd67d1a2ca2886c71779d1
|
|
| MD5 |
ac5e16667906b890c75ec380d2c0aa26
|
|
| BLAKE2b-256 |
ce978f59ac9d014399163a1bda561b7cf2ce29eb25b268b9b03c97318842893c
|
File details
Details for the file credence_guard-1.2.3-py3-none-any.whl.
File metadata
- Download URL: credence_guard-1.2.3-py3-none-any.whl
- Upload date:
- Size: 107.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3e00dd39926bab37819394c29606931444e3b8f7c2b4bbcb42b39ea7e9615826
|
|
| MD5 |
6c9824de9a65bc7ed65dcdd54a1f533e
|
|
| BLAKE2b-256 |
c0d635194b96b6245883e17be551768358c2f1e412c477dad436ac9307c6dab3
|