Skip to main content

AI-driven software factory: CrewAI + Codex CLI + Claude Code CLI with multi-CLI router, A2A roundtable, MCP server, scale-adaptive delivery from brief to release-ready project.

Project description

autodev-x

License: MIT Python ≥3.10 PyPI

A multi-CLI software factory powered by CrewAI agents, Codex CLI, and Claude Code CLI.

From a one-paragraph project brief to milestone-driven, gate-protected delivery — with a full audit trail, A2A agent networking, MCP server support, and sprint-mode planning.



Install

Status: alpha — v0.1.0a4 on GitHub Releases. Where we're going: v0.1.0 GA PRD.

From PyPI (recommended):

pip install --pre autodev-x==0.1.0a4
# or latest pre-release:
pip install --pre autodev-x

Status (Alpha)

This is a public alpha. v0.1.0a4 closed most of the supply-chain and coverage gaps that were open at a1:

Area Status
SLSA L3 provenance Live — slsa-framework/slsa-github-generator attests every wheel/sdist on tag push (see docs/release/slsa_verification.md).
SBOM Live — CycloneDX 1.5 + SPDX 2.3 attached to every Release (see docs/release/sbom_consumption.md).
cosign image signing Live — keyless Fulcio OIDC + Rekor for ghcr.io/merchloubna70-dot/autodev-x (see docs/release/cosign_verification.md).
MCP per-caller auth Live — IdentityRegistry with scope-based RBAC and triple-gate apply mode (see docs/mcp_server_per_caller_auth.md).
Coverage 91.8% line+branch; all 11 release-critical modules ≥85%, all 3 security-critical modules ≥90%.
Enterprise use Still alpha — production deployments are not yet supported. See docs/release_notes/v0.1.0a1.md for the full scope statement.

From source:

git clone https://github.com/merchloubna70-dot/autodev-x.git
cd autodev-x
pip install -e ".[dev]"

Docker (codex + claude pre-installed, ~2 GB):

docker pull ghcr.io/merchloubna70-dot/autodev-x:0.1.0a4
docker run --rm ghcr.io/merchloubna70-dot/autodev-x:0.1.0a4 --help

Homebrew (macOS / Linux)

brew tap merchloubna70-dot/autodev
brew install autodev-x
autodev --version

Linux users require Linuxbrew. The tap is live at https://github.com/merchloubna70-dot/homebrew-autodev.

Optional — real CrewAI runtime:

pip install -e ".[crewai]"

5-minute quickstart

No API key required — the mock executor generates realistic artifacts instantly.

# Deliver a full project from a brief (dry-run, mock mode)
autodev deliver-project \
  --project-brief examples/01-mdlines/brief.md \
  --from-scratch true \
  --mode dry-run \
  --executor auto \
  --allow-mock-executor true \
  --repo-path /tmp/mdlines-demo

# Explore what was produced
ls /tmp/mdlines-demo/.dev-factory/runs/*/delivery/
cat /tmp/mdlines-demo/.dev-factory/runs/*/delivery/final_report.md

Expected output:

[autodev] --scale not given; will auto-infer from PRD/brief
run_id=20240514-143012-a1b2c3 mode=dry-run mock=True release=NotReleaseReady

See the full 5-minute quickstart guide.


Features

CrewAI agents — full pipeline coverage

A structured agent graph converts any input (brief / PRD / GitHub issue / bug description) into structured delivery artifacts:

InputClassifier → ProductManager → PRDWriter → SystemArchitect
    → MilestonePlanner → TaskDecomposer → ExecutorRouter
    → QualityGate → SecurityReviewer → Verifier → DocWriter
    → ReleaseManager → audit artifacts on disk

Multi-CLI executor routing

All CLI traffic passes through a single ExecutorRouter. It automatically selects between Codex CLI (mechanical writes, small patches, tests, scaffolds) and Claude Code CLI (architecture, long-context refactors, security reviews, release roll-ups).

autodev deliver-project ... --executor auto    # smart routing (default)
autodev deliver-project ... --executor codex   # force Codex for everything
autodev deliver-project ... --executor claude  # force Claude Code for everything
Task type Default backend
Scaffold, test generation, small patch Codex CLI
Architecture, refactor, security, docs, release Claude Code CLI
Either CLI missing + --allow-mock-executor true Mock (deterministic)

4-stage bug-fix flow

autodev fix-bug \
  --bug "p99 latency is wrong: off-by-one index in aggregate.py" \
  --repo-path /tmp/log-analyzer \
  --mode dry-run \
  --allow-mock-executor true

Stages: Reproduce → Locate → Patch → Verify. Every step produces a structured JSON artifact. See Tutorial 01 — Bug-fix flow.

BMAD-derived sprint mode

Plan and track multi-week sprints with course-correction support:

autodev sprint-start  --goal "Deliver MVP slug library" --duration-days 10
autodev sprint-status
autodev sprint-retro  --sprint-id sprint-001
autodev sprint-correct --sprint-id sprint-001 --change "Add JSON output mode"

See Tutorial 04 — Sprint mode.

Roundtable party-mode (A2A)

Recruit N specialist agents by skill, get independent analysis, synthesize:

autodev roundtable \
  --topic "SQLite vs PostgreSQL for the kanban board" \
  --skills security,arch,perf \
  --repo-path /tmp/my-project

Set FACTORY_FORCE_MOCK=1 for CI / no-API-key usage. See Tutorial 05 — Roundtable.

MCP server — use autodev from Claude Desktop

autodev mcp-serve   # JSON-RPC 2.0 over stdio

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "autodev": {
      "command": "autodev",
      "args": ["mcp-serve"]
    }
  }
}

See Tutorial 06 — MCP server.

A2A server — accept tasks from external agents

autodev a2a-serve --port 8421
autodev a2a-call --endpoint http://127.0.0.1:8421 --skill fix-bug \
  --task-json '{"text": "Fix the percentile bug"}'

See Tutorial 07 — A2A server.

Full audit trail

Every run writes a structured artifact tree under <repo>/.dev-factory/runs/<run_id>/:

input/ product/ architecture/ planning/ execution/
quality/ verification/ delivery/ run_state.json

Runs are resumable (autodev continue-run), replayable (autodev replay), and support milestone-by-milestone execution (autodev execute-milestone).

Safety baked in

  • No business code calls codex or claude directly — only ExecutorRouter
  • Shell executor blocks rm -rf, sudo, cat .env, curl | bash, etc. in all modes
  • --commit, --push, --tag are off by default — opt in explicitly
  • release_check returns NotReleaseReady when mock execution or dry-run was used
  • final_report.md never relabels a failed gate as passed

Multi-CLI routing table

Task type Default backend Why
scaffold Codex Small mechanical writes
test Codex Targeted unit-test additions
feature (≤5 files, low/medium risk) Codex Fast, deterministic
feature (>5 files or high risk) Claude Code Long-context reasoning
refactor Claude Code Multi-file coherence
architecture Claude Code System design
integration (single-language) Codex Targeted API changes
integration (cross-language) Claude Code Contract reasoning
security Claude Code Deeper threat review
docs Claude Code Tone and cohesion
release Claude Code Evidence roll-up

All CLI commands

autodev deliver-project    — Brief/PRD → full project delivery
autodev run-issue          — GitHub issue → structured change
autodev fix-bug            — 4-stage bug-fix: reproduce/locate/patch/verify
autodev multi-patch-fix-bug — Generate N patch candidates, vote for best
autodev execute-milestone  — Run a single milestone from a completed plan
autodev continue-run       — Resume a failed run from last checkpoint
autodev replay             — Re-run a stage from a checkpoint
autodev scan               — Scan repo for context
autodev verify             — Run verification on a completed run
autodev release-check      — Evaluate release readiness
autodev report             — Print run summary
autodev export-delivery    — Export delivery artifacts to a directory
autodev push               — Push committed changes to remote
autodev create-pr          — Open a GitHub PR for a completed run
autodev sprint-start       — Open a new sprint
autodev sprint-status      — Check sprint health metrics
autodev sprint-retro       — Run a retrospective
autodev sprint-correct     — Analyse mid-sprint change impact
autodev roundtable         — Party-mode A2A discussion by skill
autodev mcp-serve          — Start MCP server over stdio
autodev a2a-serve          — Start A2A HTTP server
autodev a2a-register       — Register a remote A2A agent
autodev a2a-call           — Send a task to a remote A2A agent
autodev next               — Suggest next action from run state
autodev design-ux          — BMAD-Sally UX design workflow
autodev investigate        — Open a structured investigation case
autodev generate-context   — Generate project-context.md from repo
autodev document-project   — Generate brownfield AI-onboarding docs
autodev classify-input     — Classify a brief/issue into mode + metadata
autodev create-prd         — Generate a PRD from a brief
autodev plan-project       — Generate a project plan from PRD
autodev plan-milestones    — Generate milestones from a project plan
autodev plan-tasks         — Generate tasks from milestones
autodev review             — Approve/reject a paused human-review gate
autodev dashboard          — Launch the Textual TUI dashboard (requires: pip install autodev-x[tui])

Documentation

Doc Contents
5-minute quickstart Install → credentials → first run → explore artifacts
Tutorial 01 — Bug-fix flow 4-stage fix-bug on log-analyzer
Tutorial 02 — Rust project Deliver slug-rs from brief
Tutorial 03 — Multi-CLI routing ExecutorRouter internals
Tutorial 04 — Sprint mode sprint-start / status / retro / correct
Tutorial 05 — Roundtable Party-mode A2A discussion
Tutorial 06 — MCP server Claude Desktop integration
Tutorial 07 — A2A server HTTP agent-to-agent
Architecture reference Layers, flows, audit trail, failure policy
Configuration ConfigStack 4-layer TOML, env vars, Codex/Claude CLI auth, MCP, A2A
Troubleshooting 12 common problems and fixes
FAQ Top 15 questions
CHANGELOG Release history
Contributing How to contribute

License

MIT.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autodev_x-0.1.0a5.tar.gz (1.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autodev_x-0.1.0a5-py3-none-any.whl (337.1 kB view details)

Uploaded Python 3

File details

Details for the file autodev_x-0.1.0a5.tar.gz.

File metadata

  • Download URL: autodev_x-0.1.0a5.tar.gz
  • Upload date:
  • Size: 1.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for autodev_x-0.1.0a5.tar.gz
Algorithm Hash digest
SHA256 b81e88f8add2440f5faa2e6b7aa1ef7ed86081499468d5cefe60687ae93d235d
MD5 0c00f83640a80f8fa6c1112c4cee299d
BLAKE2b-256 0a9302683be99ebfd8119d1e631df09a6d1220b0cdf8457fd8d018749b5e011e

See more details on using hashes here.

File details

Details for the file autodev_x-0.1.0a5-py3-none-any.whl.

File metadata

  • Download URL: autodev_x-0.1.0a5-py3-none-any.whl
  • Upload date:
  • Size: 337.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for autodev_x-0.1.0a5-py3-none-any.whl
Algorithm Hash digest
SHA256 63d2349b3c03eaa7ceeb2d02210e6ca82f26dc58ab084437577e99efe55ea9ae
MD5 0f37e4703e3dcdae0b79f5d703979888
BLAKE2b-256 09aed195fc3771ddbbd4818320ff1240df33cc2f82a061ad0fefe7b70d21d513

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page