Unified CLI for the Agent Quality Toolkit (agentmd, coderace, agentlint, agentreflect)

These details have not been verified by PyPI

Project links

Project description

agentkit-cli

One install to rule them all.

pip install agentkit-cli

Get the full Agent Quality Toolkit pipeline in a single command — no more juggling four separate pip install steps.

What is it?

agentkit-cli is a unified meta-CLI that wraps the Agent Quality Toolkit quartet:

Tool	Purpose
agentmd	Generate CLAUDE.md context files
agentlint	Lint AI context files and git diffs
coderace	Benchmark AI coding performance
agentreflect	Generate reflection reports from failures

Pipeline Overview

agentkit run
    │
    ├── 1. agentmd generate        → produces CLAUDE.md
    ├── 2. agentlint check-context → lints CLAUDE.md
    ├── 3. agentlint (diff)        → lints recent changes
    ├── 4. coderace benchmark      → (opt-in via --benchmark)
    └── 5. agentreflect generate   → reflection on failures

Zero to full pipeline in 30 seconds.

Installation

pip install agentkit-cli

Install the quartet tools too:

pip install agentmd agentlint coderace agentreflect

Usage

`agentkit report`

Generate a shareable HTML quality report for the current project in one command.

agentkit report                          # saves agentkit-report.html in current dir
agentkit report --output myreport.html   # custom output path
agentkit report --open                   # save and auto-open in browser
agentkit report --json                   # emit structured JSON to stdout
agentkit report --path /my/project       # specify project directory

Runs all installed toolkit tools (agentlint, agentmd, coderace, agentreflect) with 60-second timeouts, collects results, and assembles a self-contained HTML report — no CDN, no external fonts, suitable for sharing as a screenshot or file. Tools that aren't installed are shown as skipped; the command never crashes regardless of toolkit state.

Report sections:

Toolkit coverage score (% of tools ran successfully)
Context quality score from agentlint (freshness, top issues)
Context docs score from agentmd (file sizes, tier structure)
Agent benchmark results from coderace (scores per agent, winner highlighted)
Reflection summary from agentreflect
Pipeline status table (installed / success / failed / not installed)

JSON output format:

{
  "project": "my-project",
  "version": "0.5.0",
  "coverage": 75,
  "tools": [
    {"tool": "agentlint", "installed": true, "status": "success"},
    {"tool": "agentmd", "installed": true, "status": "success"},
    {"tool": "coderace", "installed": false, "status": "not_installed"},
    {"tool": "agentreflect", "installed": true, "status": "success"}
  ],
  "agentlint": { ... },
  "agentmd": { ... },
  "coderace": null,
  "agentreflect": { ... }
}

Sharing Results

Publish a report to here.now and get a shareable link in seconds.

# Publish the most recently generated report
agentkit publish

# Publish a specific report file
agentkit publish path/to/report.html

# Generate and publish in one command
agentkit report --publish

# Run the full pipeline and publish at the end
agentkit run --publish

# Get just the URL (useful in scripts)
agentkit publish --quiet

# Get JSON output with URL and expiry info
agentkit publish --json

Authentication: Anonymous publishes expire after 24 hours. For persistent links, set the HERENOW_API_KEY environment variable:

export HERENOW_API_KEY=your-key-here
agentkit publish   # link never expires

GitHub Action

Add automated agent quality checks to every pull request with the mikiships/agentkit-cli GitHub Action.

What it checks:

Context Lint Score — runs agentlint check-context and scores your AGENTS.md / CLAUDE.md (0–100). Fails the action if the score falls below min-lint-score.
Context Drift — runs agentmd drift to detect whether your context file has drifted from the codebase.
Code Review — runs coderace review on the PR diff with parallel review lanes.

Add it in 3 lines:

# .github/workflows/agent-quality.yml
name: Agent Quality
on: [pull_request]
jobs:
  quality:
    runs-on: ubuntu-latest
    permissions:
      pull-requests: write
    steps:
      - uses: actions/checkout@v4
      - uses: mikiships/agentkit-cli@v0.7.0
        with:
          github-token: ${{ secrets.GITHUB_TOKEN }}
          min-lint-score: '70'

PR comment format:

## 🔬 Agent Quality Report

| Check | Result |
|-------|--------|
| Context Lint Score | 85/100 |
| Context Drift | ✅ Fresh |
| Code Review | Review complete |

[Details in workflow logs]

Inputs:

Input	Default	Description
`github-token`	—	Required. Token for posting PR comments.
`min-lint-score`	`70`	Minimum lint score. Action fails if below this.
`post-comment`	`true`	Whether to post a PR comment.
`python-version`	`3.11`	Python version to use.

Outputs: lint-score, drift-status, review-summary

See examples/agentkit-quality.yml for a complete ready-to-use workflow.

`agentkit demo`

Zero-config first-run experience. Works in any directory — no .agentkit.yaml needed.

agentkit demo                        # auto-detect project type and agents
agentkit demo --skip-benchmark       # just run generate + lint + reflect
agentkit demo --task refactor        # pick a specific coderace task
agentkit demo --agents claude,codex  # specify agents manually
agentkit demo --json                 # emit results as JSON

Detects your project type (python/typescript/javascript/generic), picks the best benchmark task, finds installed agents (claude, codex), and runs the full pipeline without any setup. Footer shows how to continue with agentkit init && agentkit run.

`agentkit init`

Initialize agentkit in a project. Checks which tools are installed and creates .agentkit.yaml.

agentkit init
agentkit init --path /my/project

Creates .agentkit.yaml:

tools:
  coderace: true
  agentmd: true
  agentlint: true
  agentreflect: true
defaults:
  min_score: 80
  context_file: CLAUDE.md

`agentkit run`

Run the full quality pipeline sequentially.

agentkit run
agentkit run --path /my/project
agentkit run --skip generate
agentkit run --skip lint
agentkit run --skip benchmark
agentkit run --skip reflect
agentkit run --benchmark        # include benchmark step
agentkit run --json             # emit summary as JSON
agentkit run --notes "regression after refactor"

Missing tools are skipped automatically with a warning — you don't need all four installed.

`agentkit status`

Quick health check: tool versions, config presence, last run summary.

agentkit status
agentkit status --path /my/project
agentkit status --json

`agentkit doctor`

Diagnose whether all quartet tools are installed and functional.

agentkit doctor
agentkit doctor --json

Outputs a Rich table with ✓/✗ per tool, version, and install command. Exits 1 if any tool is missing.

`agentkit ci`

Generate a ready-to-use GitHub Actions workflow that runs the full quartet pipeline on every PR.

# Write .github/workflows/agentkit.yml
agentkit ci

# Preview without writing (dry run)
agentkit ci --dry-run

# With coderace benchmark step
agentkit ci --benchmark

# Gate PR on minimum lint score
agentkit ci --min-score 80

# Custom Python version
agentkit ci --python-version 3.11

# Custom output path
agentkit ci --output-dir .github/workflows

Generated workflow example:

name: Agent Quality Toolkit

on:
  pull_request:
  push:
    branches: [main, master]

jobs:
  agentkit:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: "3.12"
      - name: Install quartet tools
        run: pip install agentmd-gen ai-agentlint coderace ai-agentreflect agentkit-cli
      - name: Run agentkit pipeline
        run: agentkit run --ci
      - name: Upload lint report
        uses: actions/upload-artifact@v4
        with:
          name: agentkit-lint-report
          path: .agentlint_report.json

Flag	Default	Description
`--python-version`	`3.12`	Python version for the workflow
`--benchmark`	off	Include coderace benchmark step
`--min-score`	none	Gate PR on maintainer rubric score
`--output-dir`	`.github/workflows`	Where to write the workflow file
`--dry-run`	off	Print to stdout instead of writing

`agentkit watch`

Watch the project for file changes and automatically re-run the pipeline.

# Watch current directory
agentkit watch

# Watch specific directory
agentkit watch --path /my/project

# Custom extensions
agentkit watch --extensions .py --extensions .md

# Custom debounce (seconds)
agentkit watch --debounce 3.0

# Run in CI mode on changes
agentkit watch --ci

Shows:

Watching /my/project for changes... (Ctrl+C to stop)
Extensions: .py, .md, .yaml, .yml
Debounce: 2.0s

On any matching file change, clears the screen and re-runs agentkit run. Press Ctrl+C to stop.

Requirements: pip install watchdog

CI Integration

Use the agentkit GitHub Action to run the full pipeline in CI:

- name: Run agentkit pipeline
  uses: mikiships/agentkit-cli@v0.2.0
  with:
    python-version: '3.12'
    skip: ''
    benchmark: 'false'
    fail-on-lint: 'true'

Or use agentkit ci to generate the workflow automatically (recommended for v0.3.0+).

Inputs:

Input	Default	Description
`skip`	`''`	Comma-separated steps to skip (`generate`, `lint`, `benchmark`, `reflect`)
`benchmark`	`false`	Enable coderace benchmark step
`python-version`	`3.12`	Python version to use
`fail-on-lint`	`true`	Exit 1 on agentlint failures

Auto-Inject Badge

The fastest way to add an agent quality badge to your README:

agentkit readme

This computes the score, injects a badge section into README.md, and prints a summary:

Updated README.md — agent quality: 87/100 (green)

The injected section looks like:

## Agent Quality

[![agent quality](https://img.shields.io/badge/agent%20quality-87%2F100-green)](https://pypi.org/project/agentkit-cli/)

Generated by [agentkit-cli](https://pypi.org/project/agentkit-cli/).

The command is idempotent — run it again and it just updates the score. Remove the section with --remove:

agentkit readme --remove

Other options:

agentkit readme --dry-run              # preview without modifying
agentkit readme --readme docs/README.md  # custom path
agentkit readme --section-header "## Quality"  # custom header
agentkit readme --score 87             # override score

Combine with the pipeline:

agentkit run --readme     # run full pipeline then inject badge
agentkit report --readme  # generate HTML report then inject badge

Add a Badge (Manual)

Show your project's agent quality score in your README:

agentkit badge

This outputs a shields.io badge URL plus ready-to-paste Markdown and HTML snippets. Add it to your README:

[![agent quality](https://img.shields.io/badge/agent%20quality-87%2F100-green)](https://github.com/mikiships/agentkit-cli)

For CI or scripting, use --json:

agentkit badge --json
# {"score": 87, "color": "green", "badge_url": "...", "markdown": "...", "html": "..."}

To use a fixed score (e.g. from a gating check):

agentkit badge --score 87

Color thresholds: green ≥80 · yellow 60-79 · orange 40-59 · red <40

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.31.0

Apr 22, 2026

1.30.0

Apr 22, 2026

1.29.0

Apr 22, 2026

1.28.0

Apr 22, 2026

1.27.0

Apr 21, 2026

1.26.0

Apr 21, 2026

1.25.0

Apr 21, 2026

1.24.0

Apr 21, 2026

1.23.0

Apr 21, 2026

1.22.0

Apr 21, 2026

1.21.0

Apr 21, 2026

1.20.0

Apr 21, 2026

1.19.0

Apr 21, 2026

1.18.0

Apr 21, 2026

1.17.0

Apr 21, 2026

1.16.0

Apr 20, 2026

1.15.0

Apr 20, 2026

1.14.0

Apr 20, 2026

1.13.0

Apr 20, 2026

1.12.0

Apr 20, 2026

1.11.0

Apr 20, 2026

1.10.0

Apr 20, 2026

1.9.0

Apr 20, 2026

1.8.0

Apr 20, 2026

1.7.0

Apr 20, 2026

1.6.0

Apr 20, 2026

1.5.0

Apr 20, 2026

1.4.0

Apr 20, 2026

1.3.0

Apr 20, 2026

1.2.0

Apr 19, 2026

1.1.0

Apr 19, 2026

1.0.0

Apr 19, 2026

0.99.0

Apr 19, 2026

0.98.0

Apr 18, 2026

0.95.1

Mar 23, 2026

0.95.0

Mar 23, 2026

0.94.1

Mar 23, 2026

0.94.0

Mar 23, 2026

0.93.0

Mar 23, 2026

0.92.0

Mar 23, 2026

0.91.0

Mar 23, 2026

0.90.0

Mar 22, 2026

0.89.0

Mar 22, 2026

0.88.0

Mar 22, 2026

0.87.0

Mar 22, 2026

0.86.0

Mar 22, 2026

0.85.0

Mar 22, 2026

0.84.1

Mar 22, 2026

0.84.0

Mar 22, 2026

0.83.0

Mar 21, 2026

0.82.0

Mar 21, 2026

0.81.1

Mar 21, 2026

0.81.0

Mar 21, 2026

0.80.0

Mar 21, 2026

0.79.0

Mar 21, 2026

0.78.0

Mar 21, 2026

0.77.0

Mar 21, 2026

0.76.0

Mar 21, 2026

0.75.0

Mar 20, 2026

0.74.0

Mar 20, 2026

0.73.0

Mar 20, 2026

0.72.0

Mar 20, 2026

0.71.0

Mar 20, 2026

0.70.0

Mar 20, 2026

0.69.0

Mar 20, 2026

0.68.0

Mar 20, 2026

0.67.0

Mar 20, 2026

0.66.0

Mar 20, 2026

0.65.0

Mar 20, 2026

0.64.0

Mar 20, 2026

0.63.0

Mar 20, 2026

0.62.0

Mar 19, 2026

0.61.0

Mar 19, 2026

0.60.0

Mar 19, 2026

0.59.0

Mar 19, 2026

0.58.0

Mar 19, 2026

0.57.0

Mar 19, 2026

0.56.0

Mar 19, 2026

0.55.0

Mar 19, 2026

0.54.0

Mar 19, 2026

0.53.0

Mar 19, 2026

0.52.0

Mar 18, 2026

0.51.0

Mar 18, 2026

0.50.0

Mar 18, 2026

0.49.0

Mar 18, 2026

0.48.0

Mar 18, 2026

0.47.0

Mar 18, 2026

0.46.0

Mar 18, 2026

0.45.0

Mar 18, 2026

0.44.0

Mar 18, 2026

0.43.0

Mar 18, 2026

0.42.0

Mar 17, 2026

0.41.0

Mar 17, 2026

0.40.1

Mar 17, 2026

0.40.0

Mar 17, 2026

0.39.0

Mar 17, 2026

0.38.0

Mar 17, 2026

0.37.0

Mar 16, 2026

0.36.1

Mar 16, 2026

0.36.0

Mar 16, 2026

0.35.0

Mar 16, 2026

0.34.0

Mar 16, 2026

0.33.0

Mar 16, 2026

0.32.0

Mar 16, 2026

0.31.0

Mar 16, 2026

0.30.0

Mar 16, 2026

0.29.0

Mar 16, 2026

0.28.0

Mar 16, 2026

0.27.0

Mar 15, 2026

0.26.1

Mar 15, 2026

0.26.0

Mar 15, 2026

0.25.2

Mar 15, 2026

0.25.1

Mar 15, 2026

0.25.0

Mar 15, 2026

0.24.0

Mar 15, 2026

0.23.0

Mar 15, 2026

0.22.0

Mar 15, 2026

0.21.0

Mar 15, 2026

0.20.0

Mar 15, 2026

0.19.0

Mar 15, 2026

0.16.2

Mar 14, 2026

0.16.1

Mar 14, 2026

0.16.0

Mar 14, 2026

0.15.0

Mar 14, 2026

0.14.0

Mar 14, 2026

0.13.0

Mar 14, 2026

0.12.0

Mar 14, 2026

0.11.0

Mar 13, 2026

0.10.0

Mar 13, 2026

This version

0.9.0

Mar 13, 2026

0.8.0

Mar 13, 2026

0.7.0

Mar 13, 2026

0.6.0

Mar 13, 2026

0.5.1

Mar 13, 2026

0.5.0

Mar 13, 2026

0.4.0

Mar 13, 2026

0.3.0

Mar 13, 2026

0.2.1

Mar 12, 2026

0.2.0

Mar 12, 2026

0.1.0

Mar 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentkit_cli-0.9.0.tar.gz (72.2 kB view details)

Uploaded Mar 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentkit_cli-0.9.0-py3-none-any.whl (37.3 kB view details)

Uploaded Mar 13, 2026 Python 3

File details

Details for the file agentkit_cli-0.9.0.tar.gz.

File metadata

Download URL: agentkit_cli-0.9.0.tar.gz
Upload date: Mar 13, 2026
Size: 72.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for agentkit_cli-0.9.0.tar.gz
Algorithm	Hash digest
SHA256	`17b343402616234b1f845e8a7d40b7bbeb56f9190a3eed806ac92ecca52c4380`
MD5	`d7f7c726cce536a7b99ca27b14c5a6b1`
BLAKE2b-256	`e329820c3447b22e962da847b0f39a07140ea492ce9ac2aa0d917c6b828fe4a9`

See more details on using hashes here.

File details

Details for the file agentkit_cli-0.9.0-py3-none-any.whl.

File metadata

Download URL: agentkit_cli-0.9.0-py3-none-any.whl
Upload date: Mar 13, 2026
Size: 37.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for agentkit_cli-0.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3cb86a779d427f8eee4b6e4c087534c4c42752d2cd2f767144354b5554539b19`
MD5	`211ee7a7a5c21adcc3098d45abcb980c`
BLAKE2b-256	`ed8a1590768cfbdc8acaa1574ecd092981023c85f1deea52ff6c1390db1758aa`

See more details on using hashes here.

agentkit-cli 0.9.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

agentkit-cli

What is it?

Pipeline Overview

Installation

Usage

agentkit report

Sharing Results

GitHub Action

agentkit demo

agentkit init

agentkit run

agentkit status

agentkit doctor

agentkit ci

agentkit watch

CI Integration

Auto-Inject Badge

Add a Badge (Manual)

Links

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`agentkit report`

`agentkit demo`

`agentkit init`

`agentkit run`

`agentkit status`

`agentkit doctor`

`agentkit ci`

`agentkit watch`