lyingdocs

Autonomous documentation-code misalignment detection using LLM agents

These details have not been verified by PyPI

Project description

LyingDocs

A trust layer for your repository.

Detect when your docs, code, configs, and examples stop agreeing with each other.

Modern repositories are read by more than humans.

They are read by teammates, new contributors, users, reviewers, downstream integrators — and increasingly by AI agents.

That only works if the repository can be trusted.

But trust quietly erodes over time:

documentation describes features that were never shipped
code behavior drifts away from the spec
examples stop matching reality
values claimed to be configurable are hardcoded deep in the codebase
papers and implementation tell different stories

LyingDocs is a trust layer for your repository.
It audits the gap between what your repo says and what your code actually does — before your users, contributors, or agents learn the wrong thing.

Why LyingDocs exists

Every codebase accumulates invisible trust debt.

In the age of fast iteration and LLM-assisted development, teams now ship code and documentation faster than ever — but not always in sync. A repo may still look polished while becoming progressively less reliable as a source of truth.

That is the problem LyingDocs is built to solve.

LyingDocs is not just a documentation checker. It is a system for surfacing repository misalignment:

docs that overclaim
code paths that are undocumented
specs that no longer match implementation
"configurable" behavior that is actually fixed
claims in papers or READMEs that cannot be supported by the code

The goal is simple:

Keep your repository trustworthy for humans and machines.

What LyingDocs does

LyingDocs deploys two autonomous agents against your repository:

Hermes reads your documentation, plans an audit strategy, and decides what needs to be verified
Argus investigates the actual codebase and reports what the code really does

Hermes then reconciles the two and writes a structured report of the mismatches it finds.

This lets you catch cases where your repository is no longer telling the truth about itself.

How it works

1. Hermes reads what the repo claims

Hermes traverses your documentation and extracts claims, assumptions, and implementation promises from sources such as:

docs/
README files
setup guides
usage examples
configuration references
papers and research writeups

It then plans an audit by turning those claims into targeted investigation tasks.

2. Argus checks what the code actually does

Argus executes each task against your real codebase.

You can choose the backend that best fits your setup:

codex — OpenAI Codex CLI subprocess
claude_code — Claude Code CLI subprocess (claude -p)
local — built-in minimal agent loop using filesystem tools and any OpenAI-compatible API directly

3. LyingDocs reports the trust gaps

Hermes reconciles documented claims with observed implementation behavior and outputs a report of misalignments.

These findings can then be reviewed by maintainers, turned into issues, and eventually enforced in CI.

Positioning

LyingDocs is best thought of as:

a trust layer for your repo
a docs-to-code alignment guard
a pre-user warning system for misleading documentation
a future CI / GitHub Action quality gate for repository truthfulness

It is not meant to be a tool you manually open every day.

It is meant to become something your repository runs automatically:

on pull requests
before releases
during scheduled audits
before docs deployment
as part of your GitHub Actions workflow

Installation

pip install lyingdocs

Quick Start

export OPENAI_API_KEY="sk-..."

lyingdocs analyze --doc-path docs/ --code-path . -o output/audit

This performs a full audit of your repository and produces a report describing where documentation and implementation no longer align.

Example use cases

Use LyingDocs when you want to answer questions like:

Does the README still reflect the real behavior of the project?
Are our examples and quickstarts still valid?
Did code change without the docs changing with it?
Are we claiming configuration that does not actually exist?
Does our paper describe behavior the implementation does not support?
Can an AI agent trust this repository as a source of truth?

Misalignment categories

Category	Description
LogicMismatch	Code contradicts documentation
PhantomSpec	Documentation describes non-existent features
ShadowLogic	Important code behavior exists but is undocumented
HardcodedDrift	Supposedly configurable values are actually hardcoded

These categories represent different ways repository trust breaks down.

Configuration

LyingDocs loads configuration from multiple sources, with later sources overriding earlier ones:

Built-in defaults (OpenAI API, gpt-5.4)
Config file — lyingdocs.toml in project root, or ~/.config/lyingdocs/config.toml
Environment variables / .env
CLI arguments

Hermes and Argus are configured independently, so you can use:

a cheaper planning model for Hermes
a stronger coding / investigation model for Argus
different providers or endpoints for each agent

Config file example

Example configs live in tests/configs.

[hermes]
model = "gpt-5.4"
base_url = "https://api.openai.com/v1"
# api_key_env = "OPENAI_API_KEY"  # optional — defaults to OPENAI_API_KEY

[argus]
backend = "local"           # "codex" | "claude_code" | "local"
model = "gpt-5.4"
base_url = "https://api.openai.com/v1"
# api_key_env = "OPENAI_API_KEY"

# Only read when argus.backend = "codex"
[argus.codex]
provider = "openai"
wire_api = "responses"
# path = "/usr/local/bin/codex"   # optional: explicit codex binary path

# Only read when argus.backend = "claude_code"
[argus.claude_code]
# path = "/usr/local/bin/claude"  # optional: explicit claude binary path

# Only read when argus.backend = "local"
[argus.local]
max_iterations = 25         # per-task agent loop cap
max_read_bytes = 200000     # per read_file call

[limits]
max_dispatches = 20         # max Argus dispatches per Hermes run
max_iterations = 50         # max Hermes loop iterations
argus_task_timeout = 1200   # seconds per Argus task (codex / claude_code backends)
token_budget = 524288       # Hermes context budget before compression

Environment variables

Variable	Description
`OPENAI_API_KEY`	Required unless overridden via `api_key_env`
`HERMES_MODEL`	Hermes model name
`HERMES_BASE_URL`	Hermes API base URL
`ARGUS_BACKEND`	`codex`, `claude_code`, or `local`
`ARGUS_MODEL`	Argus model name
`ARGUS_BASE_URL`	Argus API base URL
`ARGUS_CODEX_PROVIDER`	Codex backend provider
`ARGUS_CODEX_WIRE_API`	Codex backend wire API (`responses` or `chat`)
`ARGUS_CODEX_PATH`	Explicit path to `codex`
`ARGUS_CLAUDE_CODE_PATH`	Explicit path to `claude`
`ARGUS_TASK_TIMEOUT`	Timeout per Argus task in seconds
`TOKEN_BUDGET`	Hermes context budget before compression

Argus backends

Argus is the deep code analysis side of the system.

`local`

No external CLI required. Uses a built-in agent loop with filesystem tools and an OpenAI-compatible API.

Good default for getting started.

[argus]
backend = "local"
model = "gpt-5.4"
base_url = "https://api.openai.com/v1"

`codex`

Uses OpenAI Codex CLI.

npm install -g @openai/codex

[argus]
backend = "codex"

[argus.codex]
provider = "openai"
wire_api = "responses"

Resolution order:

explicit path from config
system PATH
local node_modules/.bin/codex

`claude_code`

Uses Claude Code.

[argus]
backend = "claude_code"
model = "claude-sonnet-4-6"

[argus.claude_code]
# path = "/usr/local/bin/claude"

Invoked as:

claude -p <prompt> --model <argus_model> --output-format text

with cwd set to your code root.

CLI reference

# Full analysis
lyingdocs analyze --doc-path docs/ --code-path . -o output/audit

# Choose Argus backend
lyingdocs analyze --doc-path docs/ --code-path . --argus-backend=local

# Different models for Hermes and Argus
lyingdocs analyze --doc-path docs/ --code-path . \
  --hermes-model gpt-5.4 \
  --argus-model gpt-5.4

# Resume interrupted analysis
lyingdocs analyze --doc-path docs/ --code-path . --resume

# Use an explicit config file
lyingdocs analyze --doc-path docs/ --code-path . --config myconfig.toml

# Generate GitHub issue drafts
lyingdocs analyze --doc-path docs/ --code-path . --gen-issue

# Show version
lyingdocs version

Available flags:

--hermes-model, --hermes-base-url, --argus-backend {codex,claude_code,local}, --argus-model, --argus-base-url, --argus-codex-provider, --argus-codex-wire-api, --max-dispatches, --max-iterations, --config, --resume, --gen-issue

Generating GitHub issue drafts

Pass --gen-issue to automatically draft a GitHub issue after analysis:

lyingdocs analyze --doc-path docs/ --code-path . --gen-issue

LyingDocs uses Hermes to synthesize findings into a single, polite GitHub issue and saves it to issue.json in the output directory.

The file contains:

title — a short issue title
body — a GitHub-flavored Markdown issue body listing findings, code references, doc references, and a note acknowledging possible false positives

You can post it directly with the gh CLI:

gh issue create \
  --title "$(jq -r '.title' output/issue.json)" \
  --body  "$(jq -r '.body'  output/issue.json)"

This makes LyingDocs useful not only as an audit tool, but as a bridge into repository maintenance workflows.

GitHub Actions direction

LyingDocs is moving toward a natural next step:

continuous trust enforcement inside GitHub Actions

The long-term shape of the project is not “run this manually forever.” The long-term shape is:

run on pull requests
comment on suspicious docs/code drift
warn maintainers before release
surface trust regressions early
make repository truthfulness part of CI

That is where LyingDocs becomes most valuable: not only as an analyzer, but as infrastructure.

Roadmap

Multi-harness support — Argus runs on Codex, Claude Code, or a built-in local agent
Issue generation — --gen-issue drafts GitHub issues from findings
GitHub Action integration — run LyingDocs automatically in PRs and CI to catch trust regressions as they are introduced
One-session memory support — Argus backends retain state across tasks for deeper multi-step investigations
Deeper analysis — multi-hop reasoning across doc hierarchies and version-aware diffing to detect when code changed but docs did not
Paper mode — treat academic papers as documentation and detect paper-to-code misalignment
Auto-fix mode — Hermes proposes doc patches for human review and application

For researchers

A paper is also documentation.

It is a human-language description of code, behavior, claims, and expected results — often written under deadline, and often drifting away from the implementation over time.

If you want to know whether:

your repo matches your paper
your claims are supported by the code
another researcher can trust your implementation

then LyingDocs can help.

The problem is the same. Paper is documentation for code. LyingDocs is for papers too.

Why “trust layer”

Because the problem is bigger than stale docs.

A repository becomes untrustworthy whenever its outward description and inward behavior drift apart.

That harms:

users trying to adopt the project
contributors trying to extend it
maintainers trying to review changes
researchers trying to reproduce results
AI agents trying to understand the repo

LyingDocs exists to make that gap visible.

Not after users complain. Before.

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.4

Apr 16, 2026

This version

0.1.3

Apr 12, 2026

0.1.2

Apr 12, 2026

0.1.1

Apr 12, 2026

0.1.0

Apr 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lyingdocs-0.1.3.tar.gz (31.7 kB view details)

Uploaded Apr 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lyingdocs-0.1.3-py3-none-any.whl (41.3 kB view details)

Uploaded Apr 12, 2026 Python 3

File details

Details for the file lyingdocs-0.1.3.tar.gz.

File metadata

Download URL: lyingdocs-0.1.3.tar.gz
Upload date: Apr 12, 2026
Size: 31.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lyingdocs-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`c97bdc46faf17657b825ffc1103b7e42edafed57687df46270c88985f16c8cb6`
MD5	`ee46b3525e1f50b3c2309a2b61f96988`
BLAKE2b-256	`d69e703bac9522047dfdf2f54e52f5decddc30705d8fc15a86f696036b639d87`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lyingdocs-0.1.3.tar.gz:

Publisher: publish.yml on KMing-L/lying-docs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lyingdocs-0.1.3.tar.gz
- Subject digest: c97bdc46faf17657b825ffc1103b7e42edafed57687df46270c88985f16c8cb6
- Sigstore transparency entry: 1280585265
- Sigstore integration time: Apr 12, 2026
Source repository:
- Permalink: KMing-L/lying-docs@6ab4ca15c91b2e70fee57e460728a442a65e2e68
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/KMing-L
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6ab4ca15c91b2e70fee57e460728a442a65e2e68
- Trigger Event: push

File details

Details for the file lyingdocs-0.1.3-py3-none-any.whl.

File metadata

Download URL: lyingdocs-0.1.3-py3-none-any.whl
Upload date: Apr 12, 2026
Size: 41.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lyingdocs-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5d3a27d45e843d8583c8c80a7ed92488f08b4ead0781b5db42b7c6a92aa9c1fb`
MD5	`50efecdc528dd14d904fb34b732fe5f2`
BLAKE2b-256	`370407e37074e994f0c2e46f7a7a943ce019ddb4c894a7a4370e457ef5163569`

See more details on using hashes here.

Provenance

The following attestation bundles were made for lyingdocs-0.1.3-py3-none-any.whl:

Publisher: publish.yml on KMing-L/lying-docs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: lyingdocs-0.1.3-py3-none-any.whl
- Subject digest: 5d3a27d45e843d8583c8c80a7ed92488f08b4ead0781b5db42b7c6a92aa9c1fb
- Sigstore transparency entry: 1280585273
- Sigstore integration time: Apr 12, 2026
Source repository:
- Permalink: KMing-L/lying-docs@6ab4ca15c91b2e70fee57e460728a442a65e2e68
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/KMing-L
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6ab4ca15c91b2e70fee57e460728a442a65e2e68
- Trigger Event: push

lyingdocs 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LyingDocs

Why LyingDocs exists

What LyingDocs does

How it works

1. Hermes reads what the repo claims

2. Argus checks what the code actually does

3. LyingDocs reports the trust gaps

Positioning

Installation

Quick Start

Example use cases

Misalignment categories

Configuration

Config file example

Environment variables

Argus backends

local

codex

claude_code

CLI reference

Generating GitHub issue drafts

GitHub Actions direction

Roadmap

For researchers

Why “trust layer”

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`local`

`codex`

`claude_code`