Unified CLI for the Agent Quality Toolkit (agentmd, coderace, agentlint, agentreflect)
Project description
agentkit-cli
One install to rule them all.
pip install agentkit-cli
Get the full Agent Quality Toolkit pipeline in a single command — no more juggling four separate pip install steps.
What is it?
agentkit-cli is a unified meta-CLI that wraps the Agent Quality Toolkit quartet:
| Tool | Purpose |
|---|---|
| agentmd | Generate CLAUDE.md context files |
| agentlint | Lint AI context files and git diffs |
| coderace | Benchmark AI coding performance |
| agentreflect | Generate reflection reports from failures |
Pipeline Overview
agentkit run
│
├── 1. agentmd generate → produces CLAUDE.md
├── 2. agentlint check-context → lints CLAUDE.md
├── 3. agentlint (diff) → lints recent changes
├── 4. coderace benchmark → (opt-in via --benchmark)
└── 5. agentreflect generate → reflection on failures
Zero to full pipeline in 30 seconds.
Installation
pip install agentkit-cli
Install the quartet tools too:
pip install agentmd agentlint coderace agentreflect
Usage
agentkit report
Generate a shareable HTML quality report for the current project in one command.
agentkit report # saves agentkit-report.html in current dir
agentkit report --output myreport.html # custom output path
agentkit report --open # save and auto-open in browser
agentkit report --json # emit structured JSON to stdout
agentkit report --path /my/project # specify project directory
Runs all installed toolkit tools (agentlint, agentmd, coderace, agentreflect) with 60-second timeouts, collects results, and assembles a self-contained HTML report — no CDN, no external fonts, suitable for sharing as a screenshot or file. Tools that aren't installed are shown as skipped; the command never crashes regardless of toolkit state.
Report sections:
- Toolkit coverage score (% of tools ran successfully)
- Context quality score from agentlint (freshness, top issues)
- Context docs score from agentmd (file sizes, tier structure)
- Agent benchmark results from coderace (scores per agent, winner highlighted)
- Reflection summary from agentreflect
- Pipeline status table (installed / success / failed / not installed)
JSON output format:
{
"project": "my-project",
"version": "0.5.0",
"coverage": 75,
"tools": [
{"tool": "agentlint", "installed": true, "status": "success"},
{"tool": "agentmd", "installed": true, "status": "success"},
{"tool": "coderace", "installed": false, "status": "not_installed"},
{"tool": "agentreflect", "installed": true, "status": "success"}
],
"agentlint": { ... },
"agentmd": { ... },
"coderace": null,
"agentreflect": { ... }
}
agentkit demo
Zero-config first-run experience. Works in any directory — no .agentkit.yaml needed.
agentkit demo # auto-detect project type and agents
agentkit demo --skip-benchmark # just run generate + lint + reflect
agentkit demo --task refactor # pick a specific coderace task
agentkit demo --agents claude,codex # specify agents manually
agentkit demo --json # emit results as JSON
Detects your project type (python/typescript/javascript/generic), picks the best benchmark task, finds installed agents (claude, codex), and runs the full pipeline without any setup. Footer shows how to continue with agentkit init && agentkit run.
agentkit init
Initialize agentkit in a project. Checks which tools are installed and creates .agentkit.yaml.
agentkit init
agentkit init --path /my/project
Creates .agentkit.yaml:
tools:
coderace: true
agentmd: true
agentlint: true
agentreflect: true
defaults:
min_score: 80
context_file: CLAUDE.md
agentkit run
Run the full quality pipeline sequentially.
agentkit run
agentkit run --path /my/project
agentkit run --skip generate
agentkit run --skip lint
agentkit run --skip benchmark
agentkit run --skip reflect
agentkit run --benchmark # include benchmark step
agentkit run --json # emit summary as JSON
agentkit run --notes "regression after refactor"
Missing tools are skipped automatically with a warning — you don't need all four installed.
agentkit status
Quick health check: tool versions, config presence, last run summary.
agentkit status
agentkit status --path /my/project
agentkit status --json
agentkit doctor
Diagnose whether all quartet tools are installed and functional.
agentkit doctor
agentkit doctor --json
Outputs a Rich table with ✓/✗ per tool, version, and install command. Exits 1 if any tool is missing.
agentkit ci
Generate a ready-to-use GitHub Actions workflow that runs the full quartet pipeline on every PR.
# Write .github/workflows/agentkit.yml
agentkit ci
# Preview without writing (dry run)
agentkit ci --dry-run
# With coderace benchmark step
agentkit ci --benchmark
# Gate PR on minimum lint score
agentkit ci --min-score 80
# Custom Python version
agentkit ci --python-version 3.11
# Custom output path
agentkit ci --output-dir .github/workflows
Generated workflow example:
name: Agent Quality Toolkit
on:
pull_request:
push:
branches: [main, master]
jobs:
agentkit:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: actions/setup-python@v5
with:
python-version: "3.12"
- name: Install quartet tools
run: pip install agentmd-gen ai-agentlint coderace ai-agentreflect agentkit-cli
- name: Run agentkit pipeline
run: agentkit run --ci
- name: Upload lint report
uses: actions/upload-artifact@v4
with:
name: agentkit-lint-report
path: .agentlint_report.json
| Flag | Default | Description |
|---|---|---|
--python-version |
3.12 |
Python version for the workflow |
--benchmark |
off | Include coderace benchmark step |
--min-score |
none | Gate PR on maintainer rubric score |
--output-dir |
.github/workflows |
Where to write the workflow file |
--dry-run |
off | Print to stdout instead of writing |
agentkit watch
Watch the project for file changes and automatically re-run the pipeline.
# Watch current directory
agentkit watch
# Watch specific directory
agentkit watch --path /my/project
# Custom extensions
agentkit watch --extensions .py --extensions .md
# Custom debounce (seconds)
agentkit watch --debounce 3.0
# Run in CI mode on changes
agentkit watch --ci
Shows:
Watching /my/project for changes... (Ctrl+C to stop)
Extensions: .py, .md, .yaml, .yml
Debounce: 2.0s
On any matching file change, clears the screen and re-runs agentkit run. Press Ctrl+C to stop.
Requirements: pip install watchdog
CI Integration
Use the agentkit GitHub Action to run the full pipeline in CI:
- name: Run agentkit pipeline
uses: mikiships/agentkit-cli@v0.2.0
with:
python-version: '3.12'
skip: ''
benchmark: 'false'
fail-on-lint: 'true'
Or use agentkit ci to generate the workflow automatically (recommended for v0.3.0+).
Inputs:
| Input | Default | Description |
|---|---|---|
skip |
'' |
Comma-separated steps to skip (generate, lint, benchmark, reflect) |
benchmark |
false |
Enable coderace benchmark step |
python-version |
3.12 |
Python version to use |
fail-on-lint |
true |
Exit 1 on agentlint failures |
Links
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentkit_cli-0.5.0.tar.gz.
File metadata
- Download URL: agentkit_cli-0.5.0.tar.gz
- Upload date:
- Size: 45.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
de08b2c9c398e85c3be5b10ae2203514c17933b6ee38988f1da8a8509e8451f2
|
|
| MD5 |
2afaadc49e252f84828e11ec11b5fb9b
|
|
| BLAKE2b-256 |
c74baa17206856a46d0a19da2ab564f92a223d256d412675a0038f8643403768
|
File details
Details for the file agentkit_cli-0.5.0-py3-none-any.whl.
File metadata
- Download URL: agentkit_cli-0.5.0-py3-none-any.whl
- Upload date:
- Size: 27.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cc80390ec28fece63fa6354435982d9d86be2bcc024763022591b837871ed6dd
|
|
| MD5 |
25dcca2c6c3d6b27b2186c96568fb930
|
|
| BLAKE2b-256 |
e150b24dc59ddeccb9bf07058096a9d2da8a582b30ef3082e94337bfcaa6ed59
|