AI-driven software factory: CrewAI + Codex CLI + Claude Code CLI with multi-CLI router, A2A roundtable, MCP server, scale-adaptive delivery from brief to release-ready project.
Project description
autodev-ai
A multi-CLI software factory powered by CrewAI agents, Codex CLI, and Claude Code CLI.
From a one-paragraph project brief to milestone-driven, gate-protected delivery — with a full audit trail, A2A agent networking, MCP server support, and sprint-mode planning.
Install
Status: alpha —
v0.1.0a4on GitHub Releases.
From PyPI (recommended):
pip install --pre autodev-ai==0.1.0a4
# or latest pre-release:
pip install --pre autodev-ai
Status (Alpha)
This is a public alpha. v0.1.0a4 closed most of the supply-chain and coverage gaps that were open at a1:
| Area | Status |
|---|---|
| SLSA L3 provenance | Live — slsa-framework/slsa-github-generator attests every wheel/sdist on tag push (see docs/release/slsa_verification.md). |
| SBOM | Live — CycloneDX 1.5 + SPDX 2.3 attached to every Release (see docs/release/sbom_consumption.md). |
| cosign image signing | Live — keyless Fulcio OIDC + Rekor for ghcr.io/merchloubna70-dot/autodev-ai (see docs/release/cosign_verification.md). |
| MCP per-caller auth | Live — IdentityRegistry with scope-based RBAC and triple-gate apply mode (see docs/mcp_server_per_caller_auth.md). |
| Coverage | 91.8% line+branch; all 11 release-critical modules ≥85%, all 3 security-critical modules ≥90%. |
| Enterprise use | Still alpha — production deployments are not yet supported. See docs/release_notes/v0.1.0a1.md for the full scope statement. |
From source:
git clone https://github.com/merchloubna70-dot/autodev-ai.git
cd autodev-ai
pip install -e ".[dev]"
Docker (codex + claude pre-installed, ~2 GB):
docker pull ghcr.io/merchloubna70-dot/autodev-ai:0.1.0a4
docker run --rm ghcr.io/merchloubna70-dot/autodev-ai:0.1.0a4 --help
Homebrew (macOS / Linux)
brew tap merchloubna70-dot/autodev
brew install autodev-ai
autodev --version
Linux users require Linuxbrew. The tap is live at https://github.com/merchloubna70-dot/homebrew-autodev.
Optional — real CrewAI runtime:
pip install -e ".[crewai]"
5-minute quickstart
No API key required — the mock executor generates realistic artifacts instantly.
# Deliver a full project from a brief (dry-run, mock mode)
autodev deliver-project \
--project-brief examples/01-mdlines/brief.md \
--from-scratch true \
--mode dry-run \
--executor auto \
--allow-mock-executor true \
--repo-path /tmp/mdlines-demo
# Explore what was produced
ls /tmp/mdlines-demo/.dev-factory/runs/*/delivery/
cat /tmp/mdlines-demo/.dev-factory/runs/*/delivery/final_report.md
Expected output:
[autodev] --scale not given; will auto-infer from PRD/brief
run_id=20240514-143012-a1b2c3 mode=dry-run mock=True release=NotReleaseReady
See the full 5-minute quickstart guide.
Features
CrewAI agents — full pipeline coverage
A structured agent graph converts any input (brief / PRD / GitHub issue / bug description) into structured delivery artifacts:
InputClassifier → ProductManager → PRDWriter → SystemArchitect
→ MilestonePlanner → TaskDecomposer → ExecutorRouter
→ QualityGate → SecurityReviewer → Verifier → DocWriter
→ ReleaseManager → audit artifacts on disk
Multi-CLI executor routing
All CLI traffic passes through a single ExecutorRouter. It automatically
selects between Codex CLI (mechanical writes, small patches, tests,
scaffolds) and Claude Code CLI (architecture, long-context refactors,
security reviews, release roll-ups).
autodev deliver-project ... --executor auto # smart routing (default)
autodev deliver-project ... --executor codex # force Codex for everything
autodev deliver-project ... --executor claude # force Claude Code for everything
| Task type | Default backend |
|---|---|
| Scaffold, test generation, small patch | Codex CLI |
| Architecture, refactor, security, docs, release | Claude Code CLI |
Either CLI missing + --allow-mock-executor true |
Mock (deterministic) |
4-stage bug-fix flow
autodev fix-bug \
--bug "p99 latency is wrong: off-by-one index in aggregate.py" \
--repo-path /tmp/log-analyzer \
--mode dry-run \
--allow-mock-executor true
Stages: Reproduce → Locate → Patch → Verify. Every step produces a structured JSON artifact. See Tutorial 01 — Bug-fix flow.
BMAD-derived sprint mode
Plan and track multi-week sprints with course-correction support:
autodev sprint-start --goal "Deliver MVP slug library" --duration-days 10
autodev sprint-status
autodev sprint-retro --sprint-id sprint-001
autodev sprint-correct --sprint-id sprint-001 --change "Add JSON output mode"
See Tutorial 04 — Sprint mode.
Roundtable party-mode (A2A)
Recruit N specialist agents by skill, get independent analysis, synthesize:
autodev roundtable \
--topic "SQLite vs PostgreSQL for the kanban board" \
--skills security,arch,perf \
--repo-path /tmp/my-project
Set FACTORY_FORCE_MOCK=1 for CI / no-API-key usage. See
Tutorial 05 — Roundtable.
MCP server — use autodev from Claude Desktop
autodev mcp-serve # JSON-RPC 2.0 over stdio
Add to claude_desktop_config.json:
{
"mcpServers": {
"autodev": {
"command": "autodev",
"args": ["mcp-serve"]
}
}
}
A2A server — accept tasks from external agents
autodev a2a-serve --port 8421
autodev a2a-call --endpoint http://127.0.0.1:8421 --skill fix-bug \
--task-json '{"text": "Fix the percentile bug"}'
Full audit trail
Every run writes a structured artifact tree under
<repo>/.dev-factory/runs/<run_id>/:
input/ product/ architecture/ planning/ execution/
quality/ verification/ delivery/ run_state.json
Runs are resumable (autodev continue-run), replayable (autodev replay),
and support milestone-by-milestone execution (autodev execute-milestone).
Safety baked in
- No business code calls
codexorclaudedirectly — onlyExecutorRouter - Shell executor blocks
rm -rf,sudo,cat .env,curl | bash, etc. in all modes --commit,--push,--tagare off by default — opt in explicitlyrelease_checkreturnsNotReleaseReadywhen mock execution ordry-runwas usedfinal_report.mdnever relabels a failed gate as passed
Multi-CLI routing table
| Task type | Default backend | Why |
|---|---|---|
| scaffold | Codex | Small mechanical writes |
| test | Codex | Targeted unit-test additions |
| feature (≤5 files, low/medium risk) | Codex | Fast, deterministic |
| feature (>5 files or high risk) | Claude Code | Long-context reasoning |
| refactor | Claude Code | Multi-file coherence |
| architecture | Claude Code | System design |
| integration (single-language) | Codex | Targeted API changes |
| integration (cross-language) | Claude Code | Contract reasoning |
| security | Claude Code | Deeper threat review |
| docs | Claude Code | Tone and cohesion |
| release | Claude Code | Evidence roll-up |
All CLI commands
autodev deliver-project — Brief/PRD → full project delivery
autodev run-issue — GitHub issue → structured change
autodev fix-bug — 4-stage bug-fix: reproduce/locate/patch/verify
autodev multi-patch-fix-bug — Generate N patch candidates, vote for best
autodev execute-milestone — Run a single milestone from a completed plan
autodev continue-run — Resume a failed run from last checkpoint
autodev replay — Re-run a stage from a checkpoint
autodev scan — Scan repo for context
autodev verify — Run verification on a completed run
autodev release-check — Evaluate release readiness
autodev report — Print run summary
autodev export-delivery — Export delivery artifacts to a directory
autodev push — Push committed changes to remote
autodev create-pr — Open a GitHub PR for a completed run
autodev sprint-start — Open a new sprint
autodev sprint-status — Check sprint health metrics
autodev sprint-retro — Run a retrospective
autodev sprint-correct — Analyse mid-sprint change impact
autodev roundtable — Party-mode A2A discussion by skill
autodev mcp-serve — Start MCP server over stdio
autodev a2a-serve — Start A2A HTTP server
autodev a2a-register — Register a remote A2A agent
autodev a2a-call — Send a task to a remote A2A agent
autodev next — Suggest next action from run state
autodev design-ux — BMAD-Sally UX design workflow
autodev investigate — Open a structured investigation case
autodev generate-context — Generate project-context.md from repo
autodev document-project — Generate brownfield AI-onboarding docs
autodev classify-input — Classify a brief/issue into mode + metadata
autodev create-prd — Generate a PRD from a brief
autodev plan-project — Generate a project plan from PRD
autodev plan-milestones — Generate milestones from a project plan
autodev plan-tasks — Generate tasks from milestones
autodev review — Approve/reject a paused human-review gate
autodev dashboard — Launch the Textual TUI dashboard (requires: pip install autodev-ai[tui])
Documentation
| Doc | Contents |
|---|---|
| 5-minute quickstart | Install → credentials → first run → explore artifacts |
| Tutorial 01 — Bug-fix flow | 4-stage fix-bug on log-analyzer |
| Tutorial 02 — Rust project | Deliver slug-rs from brief |
| Tutorial 03 — Multi-CLI routing | ExecutorRouter internals |
| Tutorial 04 — Sprint mode | sprint-start / status / retro / correct |
| Tutorial 05 — Roundtable | Party-mode A2A discussion |
| Tutorial 06 — MCP server | Claude Desktop integration |
| Tutorial 07 — A2A server | HTTP agent-to-agent |
| Architecture reference | Layers, flows, audit trail, failure policy |
| Configuration | ConfigStack 4-layer TOML, env vars, Codex/Claude CLI auth, MCP, A2A |
| Troubleshooting | 12 common problems and fixes |
| FAQ | Top 15 questions |
| CHANGELOG | Release history |
| Contributing | How to contribute |
License
MIT.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file autodev_ai-0.1.0a4.tar.gz.
File metadata
- Download URL: autodev_ai-0.1.0a4.tar.gz
- Upload date:
- Size: 1.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5c876cce67a0193bae82a5f518e827429dffa35618ca3f40f5fbad21ec5e7a3e
|
|
| MD5 |
4587d6a2d464d6284a384485d865c237
|
|
| BLAKE2b-256 |
3c63f9e48227cdf4d7e114c6950658b291a28a56f37538371d364b50dba6fd8a
|
File details
Details for the file autodev_ai-0.1.0a4-py3-none-any.whl.
File metadata
- Download URL: autodev_ai-0.1.0a4-py3-none-any.whl
- Upload date:
- Size: 337.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7dda66495d8251d08c1da44b69e93f8d0fc2792fb424acff9da713081b4e5049
|
|
| MD5 |
aab721ff2b7409d81e517d5ea94acb12
|
|
| BLAKE2b-256 |
e9c06e9fca8a1a614242790ccf40778d7711bfa1b524709e04483963d60f8ee0
|