papercheck

A reproducible audit harness for mathematical LaTeX papers.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

CGarryZA

These details have not been verified by PyPI

Project description

A reproducible audit harness for mathematical LaTeX papers.

papercheck extracts a paper's structure, records findings in schema-validated ledgers that are checked against the exact source text, and runs a final gate that reports whether the paper is ready. It runs as a command-line tool, or as an MCP server that an agent (Claude Code, Codex, Cursor, or similar) drives while you supply the mathematical judgement.

The point is discipline. Instead of asking one model to "review the paper," papercheck segments the manuscript, inventories the claims, runs narrow hostile audits, verifies each finding against the source, adjudicates, patches only what was accepted, and gates. The gates are enforced in code: an agent cannot patch before adjudication, and a finding whose quote does not appear in the source never enters the ledger.

The staged pipeline — segment, budget, specialist review, synthesize, adjudicate — follows the approach described in Google Research's Paper Assistant Tool (PAT) [1], adapted into a small, open, model-agnostic harness that runs no models of its own.

What it is

A deterministic Python core with two frontends — a CLI and an MCP server. papercheck itself never calls a model; the reasoning lives in the agent or in your hands at the CLI. What the core provides:

LaTeX-AST structure extraction (theorems, labels, references, citations, equations, draft markers) into structure.json.
Schema-validated issue ledgers, each finding tied to a file and line.
Quote verification at intake: if a finding's quoted text is not in the source, it is rejected as REJECTED_SOURCE_TARGET_INVALID before it reaches the proposed ledger.
A stage-gated state machine (INIT → SCANNED → … → ADJUDICATED → … → GATED) that refuses out-of-order operations.
A CLI, an MCP server (29 tools), and a local web UI.

Why

A single "review my paper" prompt tends to fail in three ways. papercheck handles each in code rather than in prompting:

Failure mode	What usually happens	How papercheck handles it
Hallucinated findings	the model invents a problem that isn't in the text	`submit_issue` matches the quoted text against the source; no match means the finding is rejected before it enters the ledger
Patch before proof	the model rewrites before anyone knows what is real	patches are refused unless the state is at `ADJUDICATED` and the issue is `ACCEPTED`
Skipped gate	"looks good to me" with no audit trail	the final gate is computed from mechanical signals and accepted blockers, and returns one of a fixed set of verdicts

Quickstart

pipx install papercheck        # or: pip install papercheck

papercheck scan     path/to/paper     # extract structure.json
papercheck segments path/to/paper     # propose audit segments and budgets
papercheck gate     path/to/paper --mechanical-only
papercheck report   path/to/paper     # self-contained HTML report
papercheck serve    path/to/paper     # local web UI

Run it on the bundled example (prints ==== READY ====):

papercheck gate examples/toy_clean_paper --mechanical-only

Driving it from an agent (MCP)

papercheck ships a FastMCP server exposing 29 tools plus the audit prompt pack. Register it once:

claude mcp add papercheck -- papercheck-mcp

Then ask the agent to audit a paper. It walks the workflow through the MCP tools — init_audit, run_scan, propose_segments, submit_issue, adjudicate_issue, run_gate — under the same code-enforced gates as the CLI.

Generating a paper-specific domain pack

papercheck does not call a model, so tailoring the audit to a paper's field is a two-step split: the agent reads the paper and drafts the pack, papercheck validates and stores it.

papercheck packs scaffold --paper-root ./paper            # deterministic draft from the scan
papercheck packs create draft.json --paper-root ./paper   # validated -> Paper_Audit/domain_pack.json

The same is available through the scaffold_domain_pack and create_domain_pack MCP tools. Generic packs ship for stochastic analysis, PDE, numerical analysis, optimization, machine-learning theory, and a general fallback.

How it works

flowchart LR
    A[INIT] --> B[SCANNED] --> C[SEGMENTED] --> D[INVENTORIED] --> E[AUDITING]
    E --> F[SYNTHESIZED] --> G[ADJUDICATED] --> H[PATCH_PLANNED]
    H --> I[PATCHING] --> J[REGRESSED] --> K[GATED]

    E -. submit_issue .-> V{quote and label verified?}
    V -- no --> X[REJECTED_SOURCE_TARGET_INVALID]
    V -- yes --> L[PROPOSED ledger]
    style X fill:#7f1d1d,stroke:#ef4444,color:#fff
    style K fill:#14532d,stroke:#22c55e,color:#fff

One deterministic core, two frontends, no LLM calls inside the harness. papercheck stays strictly an MCP server and CLI — no provider adapters, no self-orchestration — so it works with any model and makes no network calls of its own.

Commands

Command	What it does
`papercheck init`	create the `Paper_Audit/` workspace and state file
`papercheck scan`	LaTeX-AST structure extraction into `structure.json`
`papercheck segments`	heuristic segment and budget proposal
`papercheck gate [--mechanical-only]`	compute the final verdict (exit 0 = READY)
`papercheck verify-quote`	check a quote against a source file
`papercheck report`	self-contained HTML audit report
`papercheck serve`	local web UI (filter issues, click through to source)
`papercheck compare old/ new/`	structural diff of two paper versions
`papercheck profile list\|show`	advisory audit pipelines (`quick`, `arxiv`, `full`, …)
`papercheck packs …`	list, scaffold, or create domain packs
`papercheck prompts list\|show`	the vendored audit prompt pack
`papercheck mcp`	run the MCP server (stdio)

For a paper repository, docs/ci.md describes a mechanical, LLM-free GitHub Action that runs scan and gate --mechanical-only on every pull request and sends nothing to any model provider.

Limitations

See docs/limitations.md for the full version.

papercheck is not a theorem prover and not a replacement for peer review.
Catching a semantic error depends entirely on the driving model. papercheck keeps the process disciplined and traceable, but AI findings must be checked independently.
papercheck makes no network calls. The agent you drive it with may transmit your manuscript to its model provider, so review the relevant data terms before auditing unpublished work (docs/privacy.md).

About

Built by Christian Garry.

Contributing

Contributions welcome — see CONTRIBUTING.md. The architecture is fixed for the 0.x line: strictly MCP and CLI, JSON as the source of truth, gates enforced in code. Prompt changes are guarded by the eval fixtures described in docs/agent_eval.md.

References

Rajesh Jayaram, Drew Tyler, David Woodruff, Corinna Cortes, Yossi Matias, Vahab Mirrokni, and Vincent Cohen-Addad. Towards Automating Scientific Review with Google's Paper Assistant Tool. arXiv:2606.28277, 2026. https://arxiv.org/abs/2606.28277

BibTeX for the above is in docs/references.bib.

License

MIT — see LICENSE. If papercheck is useful in your work, cite it via CITATION.cff.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

CGarryZA

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.4

Jul 3, 2026

0.3.3

Jul 3, 2026

0.3.2

Jul 2, 2026

0.3.1

Jul 2, 2026

0.3.0

Jul 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

papercheck-0.3.4.tar.gz (920.9 kB view details)

Uploaded Jul 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

papercheck-0.3.4-py3-none-any.whl (90.0 kB view details)

Uploaded Jul 3, 2026 Python 3

File details

Details for the file papercheck-0.3.4.tar.gz.

File metadata

Download URL: papercheck-0.3.4.tar.gz
Upload date: Jul 3, 2026
Size: 920.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for papercheck-0.3.4.tar.gz
Algorithm	Hash digest
SHA256	`961d59b70bd71bdb48ba40621331d2dadda864dd4688688d221c20330400230c`
MD5	`89073b3889984c8f7a5cc6acbab32427`
BLAKE2b-256	`566a2cfa083210d79b5f6284ea8caa77cf7af75d2c838612fd9743ae6d01bcec`

See more details on using hashes here.

Provenance

The following attestation bundles were made for papercheck-0.3.4.tar.gz:

Publisher: publish.yml on cgarryZA/papercheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: papercheck-0.3.4.tar.gz
- Subject digest: 961d59b70bd71bdb48ba40621331d2dadda864dd4688688d221c20330400230c
- Sigstore transparency entry: 2057447926
- Sigstore integration time: Jul 3, 2026
Source repository:
- Permalink: cgarryZA/papercheck@8e66cc957d829847ceea1d086a61b2fd9c403f1f
- Branch / Tag: refs/tags/v0.3.4
- Owner: https://github.com/cgarryZA
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@8e66cc957d829847ceea1d086a61b2fd9c403f1f
- Trigger Event: release

File details

Details for the file papercheck-0.3.4-py3-none-any.whl.

File metadata

Download URL: papercheck-0.3.4-py3-none-any.whl
Upload date: Jul 3, 2026
Size: 90.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for papercheck-0.3.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`46fc8912b4412200f70ceafec041397422876a201e3bcc285767c503222e7bb1`
MD5	`30cf69ec4a3cd46ee413c30787df0a62`
BLAKE2b-256	`6beb5a7310614b76bc184dcdf1241539d7a963fd9fb825c112abdd7674a2819e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for papercheck-0.3.4-py3-none-any.whl:

Publisher: publish.yml on cgarryZA/papercheck

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: papercheck-0.3.4-py3-none-any.whl
- Subject digest: 46fc8912b4412200f70ceafec041397422876a201e3bcc285767c503222e7bb1
- Sigstore transparency entry: 2057448064
- Sigstore integration time: Jul 3, 2026
Source repository:
- Permalink: cgarryZA/papercheck@8e66cc957d829847ceea1d086a61b2fd9c403f1f
- Branch / Tag: refs/tags/v0.3.4
- Owner: https://github.com/cgarryZA
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@8e66cc957d829847ceea1d086a61b2fd9c403f1f
- Trigger Event: release

papercheck 0.3.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

What it is

Why

Quickstart

Driving it from an agent (MCP)

How it works

Commands

Limitations

About

Contributing

References

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance