CLI for pre-diff risk classification with schema validation and disk-verified citations.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

hibou

These details have not been verified by PyPI

Project links

Methodology

Project description

antemortem — does your AI agent actually read your code, or just sound like it?

Your coding agent wrote a plan and swears it's safe against your repo. Prove it. antemortem makes the AI cite a real file:line for every claim, then machine-checks each citation against the bytes on your disk — offline. A fabricated citation makes the check fail. No prose to trust. A deterministic PASS / FAIL, with a hard fabrication rate number you can gate CI on.

AI code review · LLM hallucination check · verify AI citations · pre-merge AI plan review · AI agent guardrails · Claude Code / Cursor / Copilot hook · MCP server · GitHub Action

README family: English · 한국어 · Easy start · 쉬운 한국어 · Deep docs: docs/ · Claim ledger (한국어)

pip install antemortem

Quick start

pip install antemortem
export ANTHROPIC_API_KEY=...   # or OPENAI_API_KEY / GEMINI_API_KEY; Ollama needs no key

antemortem init   my-feature                          # scaffold the recon doc
antemortem doctor my-feature.md --repo .              # preflight: no API call
antemortem run    my-feature.md --repo .              # one call → classifications + citations
antemortem lint   my-feature.md --repo .              # re-verify every citation offline
antemortem gate   my-feature.md --repo .              # enforce the decision policy in CI

No key to try it? Jump to Try it in 30 seconds for the offline demo.

The problem, in one screen
The headline: a hard fabrication-rate number
The hero: your agent runs it on itself (MCP + CI)
Try it in 30 seconds — no API key
Why this works — the two ideas
The full loop (with an API key)
Multi-provider — including local, keyless Ollama
When NOT to use it

The problem, in one screen

In 2026 your agent — Claude Code, Cursor, Copilot, Aider — writes the plan and the patch, then tells you, confidently, that it's safe against your existing code. You have no fast way to check whether it actually read your repo or just hallucinated a reassuring answer.

LLMs are fluent. Fluency is not evidence. An agent will cite auth/middleware.py:48 for a claim that line 48 doesn't support — or for a line that doesn't exist. You find out at runtime.

antemortem is the gate that catches that. It forces the model to ground every verdict in a real file:line, then re-reads each cited line offline against the disk. The output isn't prose you have to believe — it's a machine-checkable PASS / FAIL plus a fabrication rate:

trap t1: "refresh path leaves the old session cookie live"
  model says: REAL   cite auth/middleware.py:45-52
  antemortem lint → lines 45-52 exist, evidence hash matches disk   ✓ VERIFIED

trap t2: "race on concurrent refresh"
  model says: GHOST  cite auth/token.py:72
  antemortem lint → file has 60 lines; line 72 does not exist       ✗ FABRICATED → exit 1

If the agent invented a citation, the second case is exactly what you see. The lie does not merge.

The headline: a hard fabrication-rate number

antemortem metrics answers exactly one question — is the model citing real evidence? — and prints the number that proves it. Point it at a run artifact; it reports verified vs fabricated vs unresolved citations and a fabrication rate, then fails CI when that rate is too high:

# How often did the model hallucinate its own evidence?
antemortem metrics antemortem/feat.json --repo .
#   Citations: verified=7, fabricated=1, unresolved=2, cited=8, total=10
#   Fabrication rate: 12.5% of cited
#   Status: FAIL (fabricated citations present)

# Zero tolerance in CI — any fabricated citation fails the job (exit 4):
antemortem metrics antemortem/feat.json --repo . --fail-over 0 --format json

--format json emits a stable antemortem-citation-metrics-v1 summary; --fail-over <rate> exits 4 (policy gate) when fabricated / cited exceeds your threshold. This is the LLM-hallucination check, reduced to one auditable percentage.

Need a shareable artifact instead of a console line? antemortem report renders the same run into a single-file Markdown or HTML scorecard — decision verdict, per-trap table, citation-verification status — self-contained (HTML inlines its own CSS) so you can attach it to a PR or publish it as a CI artifact:

antemortem report antemortem/feat.json --repo . --format html --out scorecard.html

1. The hero: your agent runs it on itself (MCP + CI)

Wire it into your coding agent (MCP server)

antemortem-mcp is a Model Context Protocol server. Plug it into Claude Code (or any MCP client) and your agent gains three tools it can call on its own output before asking you to merge:

MCP tool	What the agent does with it
`scaffold`	Opens a recon doc for the change it's about to make.
`run`	Classifies each risk against the actual repo files as `REAL` / `GHOST` / `NEW` / `UNRESOLVED`, every non-`UNRESOLVED` verdict carrying a `file:line` citation.
`lint`	Re-verifies those citations offline against the disk. Zero LLM calls. Catches the model hallucinating its own evidence.

The MCP server exposes exactly these three tools (scaffold, run, lint). The CI-facing surface — gating, fabrication metrics, scorecards — runs as the CLI in your pipeline, not as MCP tools. One paste into .mcp.json (or claude_desktop_config.json for the desktop client):

{
  "mcpServers": {
    "antemortem": {
      "command": "python",
      "args": ["-m", "antemortem.mcp"],
      "env": { "ANTHROPIC_API_KEY": "sk-ant-..." }
    }
  }
}

pip install "antemortem[mcp]"

The server speaks stdio by default — what Claude Code expects. Need network transport? python -m antemortem.mcp --http. Confine the agent's filesystem reach with ANTEMORTEM_WORKSPACE_ROOT, so every path it passes must resolve under one root. Now the agent can't just say "I checked the repo" — it produces an artifact whose every claim points at a line on disk, and lint decides whether that line backs the claim. Self-review you can audit, not self-review you have to trust. Full setup in docs/MCP.md.

Block the PR in CI — one `uses:` step

Because antemortem ships a composite GitHub Action (action.yml at the repo root), the whole gate is one step — no glue, no install script. It installs antemortem from PyPI, runs the offline lint + decision gate, and fails the job on a blocked decision or a fabricated citation:

# .github/workflows/antemortem.yml
name: antemortem gate
on: [pull_request]
jobs:
  recon-gate:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: hibou04-ops/antemortem-cli@v0.11.1
        with:
          document: antemortem/my-feature.md
          repo: .
          allow: SAFE_TO_PROCEED,PROCEED_WITH_GUARDS

The action's JSON summary (schema: antemortem-gate-v1) carries the verdict, decision, and fabricated-citation metrics, exposed as a step summary output for downstream steps. Prefer the raw CLI? The same gate is one shell line:

      - run: pip install antemortem
      # Re-verify every citation against this checkout, then enforce the policy.
      - run: antemortem gate antemortem/my-feature.md --repo .

antemortem gate runs the offline lint first (citations + evidence hashes against disk), then enforces a decision allowlist. The four labels are deterministic — same artifact in, same verdict out, no model call:

Decision	Meaning
`SAFE_TO_PROCEED`	No real risks remain.
`PROCEED_WITH_GUARDS`	Real risks exist, each has a remediation.
`NEEDS_MORE_EVIDENCE`	Too much unresolved, or citations don't hold.
`DO_NOT_PROCEED`	A high-severity risk with no mitigation.

Default allowlist is SAFE_TO_PROCEED,PROCEED_WITH_GUARDS. Exit codes are stable: 0 pass · 1 validation/citation failure · 2 usage error · 3 provider failure · 4 policy gate blocked. CI branches on the exit code — it never reads prose. Full reference in docs/GITHUB_ACTION.md.

Gate the patch the agent actually wrote

Don't trust a hand-listed file scope — derive it from the diff. antemortem run --diff <ref> reads the changed files straight from a git diff and audits exactly that patch:

antemortem run antemortem/my-feature.md --repo . --diff origin/main
antemortem gate antemortem/my-feature.md --repo . --format json

--diff accepts staged, working, or any git ref/range (HEAD~1, origin/main, a..b); the changed files are merged into the recon scope. So the gate covers the agent's real changes, not just the ones it chose to mention.

"Just ask the agent to review it" — and why that loses

The free competitor is zero-friction: tell your agent "review this plan against the repo." It will. It'll sound thorough. An AI grading its own homework with no answer key writes whatever sounds right. Here's the gap antemortem closes — and how it differs from a post-diff PR-review bot (CodeRabbit, Copilot review, etc.):

	Ask the agent to review it	PR-review bots (CodeRabbit, etc.)	antemortem
When it runs	chat, ad-hoc	after the diff exists	before the diff — on the plan
Who frames the risk list	the agent (rubber-stamps)	the bot	you do, before it sees code
Every claim cites `file:line`	no	sometimes	yes (schema-enforced)
Citations re-verified on disk	no	no	yes (`lint`, offline, deterministic)
Catches a fabricated citation	no	no	yes (the run fails)
Hard fabrication-rate number	no	no	yes (`metrics`, gate on it)
Machine pass/fail a PR can gate on	no	partial	yes (stable exit codes)
Persistent, re-checkable artifact	no (a chat message)	review comments	yes (markdown + JSON + scorecard)

That answer key is antemortem — checked by a program, not by the agent's own confidence.

Try it in 30 seconds — no API key

The bundled demo replays a real recon from stored output, so no key and no network are needed. The lint at the end is the live offline check:

git clone https://github.com/hibou04-ops/antemortem-cli.git
cd antemortem-cli && pip install -e ".[mcp]"

# 4 traps → REAL / GHOST / NEW / UNRESOLVED → a decision (pre-recorded, offline)
PYTHONIOENCODING=utf-8 python examples/demo_replay.py

# now machine-verify every file:line and evidence hash against disk
antemortem lint examples/demo_antemortem.md --repo .

# ...and reduce it to one fabrication-rate number
antemortem metrics examples/demo_antemortem.json --repo .

lint exits 0 if every citation checks out on disk, 1 if any is fabricated or stale. That single exit code is the whole product: a deterministic, offline answer to "did the AI lie about the codebase?"

Why this works — the two ideas (this is the word "antemortem")

A post-mortem asks why something already failed. An antemortem runs the autopsy before the patient — your change — is even born: it interrogates an AI's plan against the real code before the first keystroke. That word is the whole method, and two mechanisms make the review impossible to rubber-stamp:

Anchoring defense. You enumerate the risks ("traps") before the model sees the code. The model never frames your risk list, so it can't quietly agree with its own framing and call it review. It has to take a position on your hypotheses, against your file scope.
Hallucination-proof review. Every non-UNRESOLVED verdict is a machine-verifiable disk citation, and a deterministic offline lint fails the run if a citation is fabricated. When the artifact carries an evidence_hash or snippet, lint also confirms the cited text hasn't drifted. The model's confidence is irrelevant — only the disk decides.

Label	Meaning	Evidence required
`REAL`	The code confirms the risk.	`file:line` where it surfaces
`GHOST`	The code disproves it (already handled).	`file:line` that contradicts it
`NEW`	A risk the model found that you missed.	`file:line` of the raising code
`UNRESOLVED`	No evidence either way. Honest, not a failure.	none (explanation required)

No mainstream tool turns LLM reconnaissance into a lint-able, gate-able CI artifact like this. The full trust model — what it verifies and what it deliberately does not — is in docs/trust_model.md (한국어).

The full loop (with an API key)

export ANTHROPIC_API_KEY=...   # or OPENAI_API_KEY / GEMINI_API_KEY / GOOGLE_API_KEY; Ollama needs no key

antemortem init    my-feature                          # scaffold the recon doc
#   edit antemortem/my-feature.md:
#     § Spec   — the change   § Traps — YOUR risk hypotheses   § Files — the scope
antemortem doctor  antemortem/my-feature.md --repo .   # preflight: no API call
antemortem run     antemortem/my-feature.md --repo .   # one call → classifications + citations
antemortem lint    antemortem/my-feature.md --repo .   # re-verify every citation offline
antemortem gate    antemortem/my-feature.md --repo .   # enforce the decision policy
antemortem metrics antemortem/my-feature.json --repo . # fabrication-rate number
antemortem report  antemortem/my-feature.json --repo . --format html --out scorecard.html

That is the full command surface — 9 commands: init, doctor, run, lint, evidence, gate, eval, metrics, report. evidence fills/checks evidence hashes (no provider call), eval scores offline golden cases, and --strict-evidence on lint requires the cited text, not just the line range, to be unchanged.

Multi-provider — including local, keyless Ollama

The discipline is vendor-neutral; the LLM is the only pluggable seam. First-class adapters ship for anthropic, openai, gemini, and ollama (local, keyless inference via a running daemon), plus any OpenAI-compatible endpoint (Azure, Groq, Together, OpenRouter) via --provider openai --base-url <url>:

antemortem run antemortem/feat.md --repo . --provider ollama          # local, no API key
antemortem run antemortem/feat.md --repo . --provider openai --base-url https://...  # compatible endpoint

Every adapter uses the provider's native structured-output path and validates the result against the same Pydantic schema — no client-side JSON regex anywhere. The CLI stays model-agnostic; pass --model to pin any model. Local and partially-compatible endpoints vary in structured-output fidelity, so lint stays mandatory there — which is the point: the disk check doesn't trust the model, regardless of vendor. The full capability matrix is in docs/provider_compatibility.md.

When NOT to use it

Skip antemortem for trivial changes (typo, version bump), exploratory spikes with no spec yet, and hot-fixes where speed beats discipline. It validates your plan against existing code — it doesn't catch runtime bugs that live outside the files. It's a screening gate that runs before code review, tests, and design review, where changing direction is cheapest — not a replacement for them. Where antemortem sits relative to the rest of the toolkit is mapped in docs/toolkit_positioning.md (한국어).

Built on real engineering: an offline test suite (python -m pytest -q, zero network in normal CI), an MCP server, four first-class providers, a deterministic decision gate, and a composite GitHub Action. The public claim surface is generated from source and self-checked — see docs/generated/claims.md (한국어), validated by python scripts/check_repo_consistency.py.

pip install "antemortem[mcp]"      # MCP server + CLI
pip install antemortem             # CLI only
pip install "antemortem[ollama]"   # add local Ollama support

Deep dives in docs/: CLI reference & exit codes · trust model · GitHub Action · MCP setup · schema (src/antemortem/schema.py) · decision rules (src/antemortem/decision.py). Methodology origin: Antemortem.

Same idea, other surfaces (the Hibou04 toolkit — a verification gate where AI workflows skip one): omegaprompt gates an overfit prompt · omega-lock gates an optimizer's winning score on held-out data.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

hibou

These details have not been verified by PyPI

Project links

Methodology

Release history Release notifications | RSS feed

This version

0.11.1

Jun 12, 2026

0.11.0

Jun 12, 2026

0.10.6

Jun 7, 2026

0.10.5

Jun 7, 2026

0.10.2

May 22, 2026

0.10.0

May 21, 2026

0.9.4

May 7, 2026

0.9.3

May 7, 2026

0.9.0

May 7, 2026

0.8.0

May 6, 2026

0.7.0

May 2, 2026

0.6.0

Apr 29, 2026

0.5.0

Apr 28, 2026

0.4.0 yanked

Apr 21, 2026

0.3.0 yanked

Apr 21, 2026

0.2.0 yanked

Apr 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

antemortem-0.11.1.tar.gz (124.7 kB view details)

Uploaded Jun 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

antemortem-0.11.1-py3-none-any.whl (126.3 kB view details)

Uploaded Jun 12, 2026 Python 3

File details

Details for the file antemortem-0.11.1.tar.gz.

File metadata

Download URL: antemortem-0.11.1.tar.gz
Upload date: Jun 12, 2026
Size: 124.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for antemortem-0.11.1.tar.gz
Algorithm	Hash digest
SHA256	`36da5287a22e2cea5accd4fef9b63bd383606d07abb83817753203df40129153`
MD5	`6b8cec8a03138365e1ea1056ed81f072`
BLAKE2b-256	`65ad34a24e6f8e254d90acee992c9c27ce75b049ba551ec4ca41ec7c4b391904`

See more details on using hashes here.

Provenance

The following attestation bundles were made for antemortem-0.11.1.tar.gz:

Publisher: publish.yml on hibou04-ops/antemortem-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: antemortem-0.11.1.tar.gz
- Subject digest: 36da5287a22e2cea5accd4fef9b63bd383606d07abb83817753203df40129153
- Sigstore transparency entry: 1802873379
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: hibou04-ops/antemortem-cli@c02c0dd1a6cc7136512dea6196fb42ac67fc4249
- Branch / Tag: refs/tags/v0.11.1
- Owner: https://github.com/hibou04-ops
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@c02c0dd1a6cc7136512dea6196fb42ac67fc4249
- Trigger Event: release

File details

Details for the file antemortem-0.11.1-py3-none-any.whl.

File metadata

Download URL: antemortem-0.11.1-py3-none-any.whl
Upload date: Jun 12, 2026
Size: 126.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for antemortem-0.11.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`99a6602aefaeda37027771ea83ad7c3bfa851bc18f9009764345257bc98efc7b`
MD5	`b0ac3e0e58386fd5990766c0e66f6458`
BLAKE2b-256	`8776e046aeba46a9730e0b0c77270f3ce187cb1a1f79ceec40e9752eb748be81`

See more details on using hashes here.

Provenance

The following attestation bundles were made for antemortem-0.11.1-py3-none-any.whl:

Publisher: publish.yml on hibou04-ops/antemortem-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: antemortem-0.11.1-py3-none-any.whl
- Subject digest: 99a6602aefaeda37027771ea83ad7c3bfa851bc18f9009764345257bc98efc7b
- Sigstore transparency entry: 1802873579
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: hibou04-ops/antemortem-cli@c02c0dd1a6cc7136512dea6196fb42ac67fc4249
- Branch / Tag: refs/tags/v0.11.1
- Owner: https://github.com/hibou04-ops
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@c02c0dd1a6cc7136512dea6196fb42ac67fc4249
- Trigger Event: release

antemortem 0.11.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

antemortem — does your AI agent actually read your code, or just sound like it?

Quick start

Table of Contents

The problem, in one screen

The headline: a hard fabrication-rate number

1. The hero: your agent runs it on itself (MCP + CI)

Wire it into your coding agent (MCP server)

Block the PR in CI — one uses: step

Gate the patch the agent actually wrote

"Just ask the agent to review it" — and why that loses

Try it in 30 seconds — no API key

Why this works — the two ideas (this is the word "antemortem")

The full loop (with an API key)

Multi-provider — including local, keyless Ollama

When NOT to use it

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Block the PR in CI — one `uses:` step