Unified CLI for the Agent Quality Toolkit (agentmd, coderace, agentlint, agentreflect)
Project description
agentkit-cli
Unified CLI for the Agent Quality Toolkit (agentmd, coderace, agentlint, agentreflect).
Installation
pip install agentkit-cli
Quick Start
pip install agentkit-cli
agentkit quickstart # ๐ fastest path to a score โ start here
agentkit quickstart checks your toolchain, runs a fast composite score (agentlint + agentmd), prints a beautiful Rich summary, and optionally publishes a shareable score card โ all in under 60 seconds.
agentkit run # run the full pipeline
agentkit score # compute composite score
agentkit gate # fail if score < threshold
Configuration
agentkit uses .agentkit.toml for project-level configuration.
agentkit config init # create .agentkit.toml with defaults
agentkit config show # show effective config with sources
agentkit config set gate.min_score 80
agentkit config get gate.min_score
Config Precedence
CLI flags > env vars > project .agentkit.toml > user config > defaults
Profiles
Profiles are named presets for gate thresholds, notify config, and sweep targets. Switch your entire quality policy in one command.
Built-in Presets
| Profile | Min Score | Max Drop | Notify On | Gate |
|---|---|---|---|---|
strict |
85 | 3 | fail | enabled |
balanced |
70 | 10 | never | enabled |
minimal |
50 | 20 | never | disabled |
Usage
# Switch to strict quality standards
agentkit profile use strict
# List all profiles (built-in + user-defined)
agentkit profile list
# Show profile details
agentkit profile show strict
# Run gate with a specific profile
agentkit gate --profile strict
# Create a custom profile based on strict
agentkit profile create myprofile --from strict --min-score 90
# Export a profile as JSON or TOML
agentkit profile export strict --format json
Using Profiles with Commands
All major commands support --profile:
agentkit gate --profile strict
agentkit run --profile balanced
agentkit sweep --profile minimal owner/repo1 owner/repo2
agentkit score --profile balanced
agentkit analyze --profile strict github:owner/repo
Explicit CLI flags always override profile values:
# Uses strict profile but overrides min-score to 99
agentkit gate --profile strict --min-score 99
Commands
agentkit quickstartโ ๐ fastest path to a score (start here)agentkit runโ run the full pipelineagentkit scoreโ compute composite scoreagentkit gateโ fail if score < thresholdagentkit analyze <target>โ analyze any GitHub repoagentkit sweep <targets>โ batch analyze multiple reposagentkit duel <repo1> <repo2>โ head-to-head agent-readiness comparisonagentkit tournament <repo1> ... <repoN>โ round-robin bracket across 4-16 reposagentkit profile <sub>โ manage quality profilesagentkit config <sub>โ manage configurationagentkit historyโ show score historyagentkit leaderboardโ compare runs by labelagentkit insightsโ cross-repo pattern synthesisagentkit trendingโ fetch and rank trending GitHub repos by agent quality
Trending Analysis
agentkit trending answers: "Which repos blowing up on GitHub are most AI-agent-ready today?"
# Rank this week's trending AI repos (default)
agentkit trending
# Fast mode: list repos without scoring
agentkit trending --no-analyze
# Filter by topic, publish a shareable report
agentkit trending --topic ai-agent --share
# Weekly trending, top 15, min 500 stars, JSON output
agentkit trending --period week --limit 15 --min-stars 500 --json
# Use a GitHub token for higher rate limits
agentkit trending --token ghp_xxx
Output: a ranked Rich table (Rank | Repo | Stars | Score | Grade | URL) and optionally a dark-theme HTML report published to here.now.
Tournament
agentkit tournament runs a round-robin bracket across 4-16 repos and ranks them by win/loss record with avg score tiebreak.
# Run a 4-repo tournament
agentkit tournament github:fastapi/fastapi github:tiangolo/starlette github:django/django github:pallets/flask
# Publish a shareable HTML bracket report
agentkit tournament github:fastapi/fastapi github:tiangolo/starlette github:django/django github:pallets/flask --share
# JSON output for CI/scripting
agentkit tournament github:fastapi/fastapi github:tiangolo/starlette github:django/django github:pallets/flask --json
# Sequential (no parallel), quiet mode, save HTML
agentkit tournament github:fastapi/fastapi github:tiangolo/starlette github:django/django github:pallets/flask \
--no-parallel --quiet --output bracket.html
Output: standings table (Rank | Repo | W-L | Avg Score | Grade), match results matrix, and winner banner. Use --share to publish a dark-theme HTML bracket to here.now.
Portfolio Insights
Once you've analyzed multiple repos with agentkit analyze or agentkit run, the
agentkit insights command synthesizes patterns across all historical runs:
# Portfolio health summary (avg score, best/worst repo, top issue)
agentkit insights
# Most common agentlint findings across all repos
agentkit insights --common-findings
# Repos scoring in the bottom quartile
agentkit insights --outliers
# Repos with significant score movement between runs
agentkit insights --trending
# All sections in one view
agentkit insights --all
# Machine-readable JSON (useful for scripts/dashboards)
agentkit insights --json
# Use a specific history DB
agentkit insights --db /path/to/history.db
Store agentlint findings alongside scores for richer cross-repo analysis:
agentkit run --record-findings
agentkit analyze github:owner/repo --record-findings
JSON output schema:
{
"portfolio_summary": {
"avg_score": 74.5,
"total_runs": 12,
"unique_repos": 4,
"top_issue": "missing-tools-section",
"best_repo": "owner/repo-a",
"worst_repo": "owner/repo-d"
},
"common_findings": [
{"finding": "missing-tools-section", "repo_count": 3, "total_occurrences": 5}
],
"outliers": [
{"project": "owner/repo-d", "latest_score": 42.0, "avg_score": 48.5, "run_count": 2}
],
"trending": [
{"project": "owner/repo-b", "previous_score": 55.0, "latest_score": 80.0, "delta": 25.0, "direction": "up"}
]
}
Sharing Results
Share your agent quality score card with a single command:
# Generate and upload a score card to here.now
agentkit share
# Share from a saved JSON report
agentkit share --report agentkit-report.json
# Hide raw numbers (show pass/fail only)
agentkit share --no-scores
# Output JSON with URL and score
agentkit share --json
# Auto-share after a run
agentkit run --share
# Auto-share after generating a report
agentkit report --share
# Quickest way to get a score + share URL for any repo
agentkit quickstart github:owner/repo
# Full analyze with share (more detail, slower)
agentkit analyze github:owner/repo --share
# Batch analyze repos and share a combined scorecard
agentkit sweep github:owner/repo1 github:owner/repo2 --share
Score cards are standalone HTML pages (dark theme) showing: composite score, per-tool breakdown, project name, git ref, and timestamp. Anonymous cards expire in 24h; set HERENOW_API_KEY for persistent links.
GitHub Actions
Use the agentkit GitHub Action to run quality checks on every PR:
- uses: mikiships/agentkit-cli@v0.7.0
with:
github-token: ${{ secrets.GITHUB_TOKEN }}
min-score: 70
Or install and run directly:
- uses: actions/checkout@v4
- run: pip install agentkit-cli
- run: agentkit gate --profile strict
See agentkit setup-ci for automated workflow generation.
Local Dashboard
agentkit serve starts a lightweight local web dashboard showing all toolkit runs from the history database:
agentkit serve [OPTIONS]
Options:
--port PORT Port to serve on (default: 7890)
--open Auto-open the dashboard in your browser on start
--once Render dashboard HTML to stdout and exit (no server)
--json Print server URL as JSON and exit (useful for scripts)
The dashboard shows a dark-theme summary of every project run: latest score, grade (AโF), per-tool breakdown, timestamp, and run ID. Scores are color-coded green (โฅ80), yellow (โฅ60), and red (<60). The page auto-refreshes every 30 seconds.
Quick start:
agentkit serve --open # start server + open browser
agentkit run --serve # run pipeline, then print dashboard URL
agentkit serve --once > out.html # render to file
No external dependencies โ uses Python stdlib only (http.server, threading, webbrowser).
Live Dashboard
Run once and watch scores update in real-time:
# Combined: watch files + serve dashboard (updates without reload)
agentkit watch --serve --port 7890
# Or start server in live mode (polls for external writes):
agentkit serve --live
The dashboard connects via SSE (/events) and re-renders the runs table in-place when new pipeline results arrive. A โ Live indicator shows connection status; it drops to โ Offline if the server stops.
Release Check
agentkit release-check verifies the 4-part release surface to confirm a package is truly shipped, not just locally complete:
agentkit release-check [PATH] [OPTIONS]
Options:
--version VERSION Version to verify (default: from pyproject.toml/package.json)
--package NAME Package name (default: from pyproject.toml/package.json)
--registry pypi|npm|auto (default: auto-detected)
--skip-tests Skip the pytest/npm test step for quick checks
--json Output structured JSON for CI integration
Example output:
agentkit release-check โ /your/project
โโโโโโโโโโโโโโฌโโโโโโโโโฌโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ Check โ Status โ Detail โ
โโโโโโโโโโโโโโผโโโโโโโโโผโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ tests โ โ PASS โ 42 passed in 1.23s โ
โ git_push โ โ PASS โ Local HEAD abc12345 matches rem โ
โ git_tag โ โ PASS โ Tag v1.0.0 found on remote. โ
โ registry โ โ PASS โ PyPI: mypkg==1.0.0 is live. โ
โโโโโโโโโโโโโโดโโโโโโโโโดโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
Verdict: SHIPPED
Verdict levels:
- SHIPPED โ all 4 surfaces confirmed (exit 0)
- RELEASE-READY โ tests + git confirmed, registry not yet live (exit 1)
- BUILT โ tests pass locally, not yet pushed (exit 1)
- UNKNOWN โ tests failing (exit 1)
Integrate with agentkit gate --release-check or agentkit run --release-check to add release verification to your pipeline.
Architecture
All quartet tool invocations (agentmd, agentlint, coderace, agentreflect) go through ToolAdapter in agentkit_cli/tools.py. This ensures canonical correct flags are used everywhere and flag-wiring bugs cannot recur across subcommands.
Run pytest -m smoke before any release to catch integration regressions.
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentkit_cli-0.35.0-py3-none-any.whl.
File metadata
- Download URL: agentkit_cli-0.35.0-py3-none-any.whl
- Upload date:
- Size: 150.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
125cebdc5657a015550233031c32b2f62dcc041245b4889b7e36199efaf1422c
|
|
| MD5 |
a75e96c8dcef25abee37f55a0905b631
|
|
| BLAKE2b-256 |
8e8fac2f8b87c86a229100a8b4fa7576a5d0e5e8bc8218c380d2db91efbbf90d
|