Benchmark how agent-ready a code repository is for LLM coding agents.

These details have not been verified by PyPI

Project description

agent-readiness

A benchmark for AI agent readiness of a code repository.

You bought the seats. Your team is using Claude Code, Cursor, Copilot, Cline. And the agents keep going off the rails on your codebase.

The model is the variable you can't change. The repo is what you can.

agent-readiness scans a repository (or a whole workspace of repos) and scores how ready it is for AI coding agents — across cognitive load, feedback loops, flow, safety, and (for multi-repo workspaces) coordination — then hands you a prioritised punchlist of fixes. Like Lighthouse, but for AI agent readiness instead of page load.

There are two ways to use it:

As a skill in your coding agent — ask Claude / Cursor / Codex "score this repo for agent-readiness" and it scores + fixes via the bundled MCP server. Multi-repo workspaces auto-launch a live browser dashboard.
As a CLI in your shell or CI — agent-readiness scan . for the same scoring engine, with --fail-below for CI gates and --json for machine output.

Both surfaces wrap the same scoring engine. Pick whichever fits your workflow; you can mix them.

Install — as a skill (recommended)

The scanner ships as a Claude Code plugin and a portable Agent Skill (works in Cursor and any other harness that loads SKILL.md).

Claude Code

/plugin marketplace add harrydaihaolin/agent-readiness-skill
/plugin install agent-readiness@agent-readiness-skill

That's it — the plugin bundles the MCP server config, so there's no JSON to paste. The skill registers enumerate_workspace, scan_repo, check_workspace_readiness, scan_workspace_async (dashboard mode), get_scan_status, and apply_top_action automatically.

Prerequisite once per machine:

pip install agent-readiness-mcp

Cursor / Claude Desktop / other harnesses

Clone the skill repo and run its installer:

git clone https://github.com/harrydaihaolin/agent-readiness-skill.git
cd agent-readiness-skill
./scripts/install.sh          # auto-detect; or --target=cursor

See the skill repo for per-harness install details and the community marketplace status.

Use it

Just talk to your agent:

"Score this repo for agent-readiness."

"What should I add to AGENTS.md?"

"Score the whole workspace and open the dashboard."

For a single repo, the skill prints the score, the top friction, and offers to apply the highest-priority deterministic fix.

For a multi-repo workspace, the skill auto-launches dashboard mode — a live browser surface that streams per-repo progress, lets the user answer interactive prompts inline, and stays out of the chat instead of blocking it for minutes. The chat side stays hands-off and only checks status when the user types. See Dashboard mode below.

Install — as a CLI

pip install agent-readiness

Requires Python 3.11+. From source:

git clone https://github.com/harrydaihaolin/agent-readiness.git
cd agent-readiness
pip install -e ".[dev]"   # or: make dev

Use it

$ agent-readiness scan .

AI Readiness  62 / 100

  Cognitive load     70 / 100
  Feedback loops     40 / 100   ← biggest drag
  Flow & reliability 75 / 100
  Safety             OK

Top friction (fix these first):
  1. test_command.discoverable — no test invocation found in Makefile,
     package.json, or pyproject.toml
  2. agent_docs.present — no AGENTS.md / CLAUDE.md / .cursorrules at root
  3. headless.no_setup_prompts — README mentions "log in to the dashboard"
     during setup; agents can't traverse this

Every WARN/ERROR finding also carries a paste-ready fix_prompt block (one paragraph of agent-led prose + a verify command), so you can hand it straight to your coding agent. Pipe --json for the structured output the skill consumes internally.

Full CLI surface

# Static scan (no Docker needed)
agent-readiness scan .
agent-readiness scan . --json
agent-readiness scan . --fail-below 70        # exit 1 if score < 70 (CI gate)
agent-readiness scan . --only feedback        # filter to one pillar
agent-readiness scan . --baseline prev.json   # diff against a previous run
agent-readiness scan . --report report.html   # HTML report (requires jinja2)
agent-readiness scan . --badge badge.svg      # score badge SVG
agent-readiness scan . --sarif findings.sarif # SARIF for GitHub code scanning

# Runtime scan (executes tests inside Docker)
agent-readiness scan . --run

# Multi-repo (workspace) scans
agent-readiness scan-workspace .              # static, headless
agent-readiness scan-and-view . --children .  # local dashboard server

# Ontology bootstrap + inference (Bundle C)
agent-readiness ontology validate ontology/ --strict
agent-readiness ontology reason ontology/ --json

# Apply the top action
agent-readiness apply-top-action .

# Other commands
agent-readiness list-checks
agent-readiness explain manifest.detected
agent-readiness init                          # write .agent-readiness.toml

What gets measured

See docs/RUBRIC.md for the full definition. Short version:

Pillar	What it captures
Cognitive load	What the agent must absorb to make a correct change.
Feedback loops	How fast and clear is the signal after a change.
Flow / reliability	Headless walkability + how often friction outside the task blocks the agent.
Safety & trust	Secrets, destructive scripts, gitignore hygiene. (Cap, not weight.)
Coordination (workspace-only)	Whether agents can operate coherently across N repos. Root `AGENTS.md`, member repos declared, dep / change order documented.

A sixth namespace, ontology / inference, ships derived findings — violations the scanner infers from your declared ontology (Bundle C): e.g. a Library coupled with a Protocol whose consumers disagree on the major version, a Repo claiming Releasable without a release workflow, or an Action chain that crosses a tenant boundary without the required intent template.

Dashboard mode (multi-repo workspaces)

For workspaces of two or more repos, the skill auto-launches a live browser dashboard (agent-readiness-analytics-dashboard) and streams the scan over Server-Sent Events. The UX you'd otherwise miss in chat:

Per-repo grid loads progressively — each repo lights up as it starts, ticks, and completes. No 4-minute wait staring at a spinner.
Interactive prompts in-browser — the scanner occasionally needs a human decision (classify this directory as code/data/docs?; ratify this proposed ontology atom?; apply the top action?). Those land as cards in a Prompts queue in the dashboard and the user answers them with a click instead of having them interrupt the chat.
Exit dashboard mode anytime — click the "Exit dashboard" button in the browser or type /agent-readiness exit-dashboard in chat. The scan keeps running; only the surface changes.
Findings feed + pillar sidebar + log tail — the same prioritised punchlist the CLI prints, refreshed live.

Run it directly from the CLI:

agent-readiness scan-and-view . --children .
# opens http://localhost:8765/live/<scan_id>

Or, in the agent skill, just say "score the whole workspace" — it hands the URL back in chat and stays out of the way until the scan completes or you ask for status.

Design principles

Agents are headless. We assume the agent has stdin / stdout / files / git / HTTP and nothing else. No browser, no dashboard, no clickable button. If important state is reachable only through a UI, it's invisible to the agent — and the repo loses points wherever that's true.

This applies to our own tool, too. The scanner is fully headless: no required interactive prompts, stable JSON via --json, exit codes that mean things, machine-readable findings. The dashboard is opt-in icing on top, never a required path.

Code quality counts only where it predicts agent success. Mega-files, ambiguous names, dead code, missing types — those have direct lines to agent failure modes and get measured. We don't reproduce the full SonarQube taxonomy. Other tools do that well.

Run untrusted code in Docker, always. Any check that executes code from the target repo runs inside a sandboxed container. See docs/SANDBOX.md.

This repo's score

Dogfooding: agent-readiness scan . run against this repository itself.

╭─────────────────────────────╮
│  AI Readiness  100.0 / 100  │
╰─────────────────────────────╯
 Cognitive load      100.0  ████████████████████
 Feedback loops      100.0  ████████████████████
 Flow & reliability  100.0  ████████████████████
 Safety              100.0  ████████████████████

No findings. Looking good.

Score updated after each iteration as part of the development workflow.

Status

All phases implemented through v3.4.1 — 30+ checks across 5 pillars plus the ontology/inference namespace (Bundle C, 6 derived rules), Docker sandbox, HTML + SARIF renderers, full CLI surface, MCP server, Claude Code plugin / Cursor skill, and a live SSE dashboard for multi-repo workspaces (Bundle D). See docs/PLAN.md for the full roadmap and CHANGELOG.md for per-phase release notes.

License

MIT for the code; see LICENSE. The project name and logo are governed separately by TRADEMARK.md: forks are welcome, "agent-readiness" the brand is reserved for the canonical project.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

3.4.3

May 27, 2026

3.4.2

May 27, 2026

3.4.1

May 27, 2026

3.4.0

May 27, 2026

3.3.0

May 27, 2026

3.2.0

May 26, 2026

3.1.0

May 25, 2026

3.0.4

May 25, 2026

3.0.3

May 25, 2026

3.0.2

May 25, 2026

3.0.1

May 25, 2026

3.0.0

May 25, 2026

2.9.0

May 24, 2026

2.8.0

May 24, 2026

2.7.0

May 24, 2026

2.6.0

May 23, 2026

2.5.0

May 21, 2026

2.4.6

May 21, 2026

2.4.5

May 21, 2026

2.4.4

May 21, 2026

2.4.3

May 21, 2026

2.4.2

May 21, 2026

2.4.1

May 20, 2026

2.4.0

May 20, 2026

2.3.1

May 20, 2026

2.3.0

May 20, 2026

2.1.1

May 4, 2026

2.1.0

May 4, 2026

1.5.0

May 2, 2026

1.4.0

May 2, 2026

1.1.0

Apr 29, 2026

1.0.0

Apr 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_readiness-3.4.3.tar.gz (544.5 kB view details)

Uploaded May 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_readiness-3.4.3-py3-none-any.whl (496.3 kB view details)

Uploaded May 27, 2026 Python 3

File details

Details for the file agent_readiness-3.4.3.tar.gz.

File metadata

Download URL: agent_readiness-3.4.3.tar.gz
Upload date: May 27, 2026
Size: 544.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for agent_readiness-3.4.3.tar.gz
Algorithm	Hash digest
SHA256	`4638d29b158b6c42a5e813531b07d18cdd613b9b065001c1cb129e9a1b6b7206`
MD5	`ecf2d9dd6fda1511a1f305630173e647`
BLAKE2b-256	`3c7d43c089c5825e9bda811d5763a9dd911a3da26fc83950db4b65274d98de96`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_readiness-3.4.3.tar.gz:

Publisher: release.yml on harrydaihaolin/agent-readiness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_readiness-3.4.3.tar.gz
- Subject digest: 4638d29b158b6c42a5e813531b07d18cdd613b9b065001c1cb129e9a1b6b7206
- Sigstore transparency entry: 1645005982
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: harrydaihaolin/agent-readiness@a4b55f04b9cd242bbd50230a0744ad8ac0b89305
- Branch / Tag: refs/tags/v3.4.3
- Owner: https://github.com/harrydaihaolin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a4b55f04b9cd242bbd50230a0744ad8ac0b89305
- Trigger Event: push

File details

Details for the file agent_readiness-3.4.3-py3-none-any.whl.

File metadata

Download URL: agent_readiness-3.4.3-py3-none-any.whl
Upload date: May 27, 2026
Size: 496.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for agent_readiness-3.4.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`37baf0d89cc9af48a104e690fb6e314f6207bed48a6da94bfbf8b2899e3fe591`
MD5	`15b6d4a0c6ba9313efcda3292f26e624`
BLAKE2b-256	`4bdf5eadf3a390fe07bdbfbbc8e4c347e83efda3129fe1ca1d27a3f56c786dfc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_readiness-3.4.3-py3-none-any.whl:

Publisher: release.yml on harrydaihaolin/agent-readiness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_readiness-3.4.3-py3-none-any.whl
- Subject digest: 37baf0d89cc9af48a104e690fb6e314f6207bed48a6da94bfbf8b2899e3fe591
- Sigstore transparency entry: 1645006115
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: harrydaihaolin/agent-readiness@a4b55f04b9cd242bbd50230a0744ad8ac0b89305
- Branch / Tag: refs/tags/v3.4.3
- Owner: https://github.com/harrydaihaolin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a4b55f04b9cd242bbd50230a0744ad8ac0b89305
- Trigger Event: push

agent-readiness 3.4.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

agent-readiness

Install — as a skill (recommended)

Claude Code

Cursor / Claude Desktop / other harnesses

Use it

Install — as a CLI

Use it

Full CLI surface

What gets measured

Dashboard mode (multi-repo workspaces)

Design principles

This repo's score

Status

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance