CLI orchestrator for multi-step AI execution plans (Codex, Claude, Gemini, Copilot, Qwen, Ollama, Llama)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

  ██   ██  ████  ██████  ████ ██████ █████   ████
  ███ ███ ██  ██ ██     ██      ██   ██  ██ ██  ██
  ██ █ ██ ██████ █████   ███    ██   █████  ██  ██
  ██   ██ ██  ██ ██        ██   ██   ██ ██  ██  ██
  ██   ██ ██  ██ ██████ ████    ██   ██  ██  ████

  Maestro CLI -- Version 2.5.4
  CLI orchestrator for multi-step AI execution plans

Maestro CLI

Maestro turns a YAML file into a parallel, dependency-aware pipeline of AI agents and shell commands. Instead of gluing engine CLIs together with bash, you declare the pipeline once and get parallel DAG scheduling, context passing between steps, cost budgets, quality gates, and deterministic, replayable logs for free -- across Claude, Codex, Gemini, Copilot, Qwen, Ollama, and Llama, from one dependency-light CLI (PyYAML is the only required dependency; everything else is stdlib or optional).

# the smallest useful plan
version: 1
name: hello
tasks:
  - id: greet
    engine: claude
    prompt: "Say hello in three languages."

maestro run plan.yaml      # runs the DAG; one engine call, deterministic logs in .maestro-runs/

That example needs the claude CLI on PATH. To try Maestro with zero setup and no API keys, run the engine-free examples/demo_plan.yaml from the Quickstart below.

Maestro CLI demo

Supported Engines

Claude Codex Gemini Copilot Qwen Ollama Llama

Contents: Why Maestro? · Python SDK · Install · Quickstart · Features · Plan Schema · CLI Commands · Writing Plans · Models · Architecture · Troubleshooting

Why Maestro?

One YAML, many engines -- orchestrate Claude, Codex, Gemini, Copilot, Qwen, Ollama, and Llama (plus raw shell commands) in a single plan, with per-engine model aliases and reasoning-effort control.
Parallel DAG scheduling -- declare dependencies and Maestro runs tasks in the right order with configurable parallelism, matrix expansion, and nested sub-plans.
Context that flows -- pass outputs between tasks with 11 context modes, from zero-cost BM25 selection to multi-model council deliberation, with token budgets and progressive compaction.
Quality gates built in -- LLM-as-Judge (rubrics, G-Eval, debate, quorum), zero-cost typed assertions, and verify_command retries with feedback injection keep results honest.
Cost-aware and resilient -- per-task and cross-run budgets, retries with backoff, auto-escalation to stronger models, cross-engine fallback, and circuit breakers.
Deterministic and observable -- every run is logged to JSON/JSONL with hash-chained, tamper-detectable events; watch it live in a TUI, a Web UI, or maestro report.
Secure by default -- untrusted-context taint tracking, prompt-injection containment (allowed_tools:), and 23 audit rules via maestro audit. The optional Web UI/API confines plan and run paths to the project root and limits CORS to same-machine origins by default.
Dependency-light -- PyYAML is the only required dependency; everything else is stdlib or an optional extra.

For the exhaustive capability list, see Features below. | v1 stability contract: docs/V1_API_FREEZE.md | Migration from 0.x: docs/MIGRATING_TO_V1.md

Python SDK

Maestro CLI ships with a programmatic API (29 stable exports) and a py.typed marker for static analysis:

from maestro_cli import load_plan, run_plan

plan = load_plan("plan.yaml")
result = run_plan(plan)

Install

pip install maestro-ai-cli                          # core CLI
pip install "maestro-ai-cli[live]"                  # + Rich live table (--output live)
pip install "maestro-ai-cli[tui]"                   # + Textual TUI (--output tui)
pip install "maestro-ai-cli[web]"                   # + Web UI (FastAPI + uvicorn)
pip install "maestro-ai-cli[agui]"                  # + AG-UI protocol endpoint
pip install "maestro-ai-cli[mcp]"                   # + MCP server for IDE integration
pip install "maestro-ai-cli[otel]"                  # + OpenTelemetry OTLP exporter
pip install "maestro-ai-cli[live,web,tui,agui,mcp,otel]"   # everything

The package installs as maestro-ai-cli; the command is maestro and the Python import is maestro_cli. For development, install from source instead:

git clone https://github.com/tiagojcperez/maestro-cli.git
cd maestro-cli && pip install -e ".[live,web,tui,agui,mcp,otel]"

Requires Python >= 3.11 and at least one engine CLI on PATH (codex, claude, gemini, copilot, qwen, ollama, llama).

Quickstart

# Validate a plan (examples/demo_plan.yaml runs with no API keys)
maestro validate examples/demo_plan.yaml
# -> "Plan is valid: ... name: demo, tasks: 4"

# Run it -- the demo uses only shell tasks, so no API keys are needed
maestro run examples/demo_plan.yaml
# -> "... 4 ok / 0 failed / 0 skipped"  (artifacts written to .maestro-runs/)

# Dry run (build commands without executing)
maestro run examples/demo_plan.yaml --dry-run

# Run with parallel tasks
maestro run examples/demo_plan.yaml --max-parallel 3 --execution-profile yolo

# Resume from last failed run
maestro run examples/demo_plan.yaml --resume-last

# Stream structured events to stdout
maestro run examples/demo_plan.yaml --output jsonl

# Run multiple plans with shared budget
maestro run examples/demo_plan.yaml examples/demo_plan.yaml --parallel

# Adaptive re-planning on failure (multi-variant search, tournament selection,
# elitism and diversity floors are documented in docs/CLI_REFERENCE.md)
maestro replan examples/demo_plan.yaml --max-attempts 3

# Autonomous metric-driven iteration loop (your plan needs a watch: block)
maestro watch your-plan.yaml --output tui

# Multi-model interactive chat
maestro chat --engine claude --model sonnet

# Trace failure causality in a completed run
maestro blame .maestro-runs/<run-dir>/

# Diagnose environment
maestro doctor --json

# Security audit a plan
maestro audit examples/demo_plan.yaml --fix

maestro audit catches dangerous plans before they run — a structurally valid plan can still be unsafe (unbounded spend, rm -rf, yolo bypass flags):

Maestro audit

Next steps: write your own plan with the Plan Guide -> copy a ready-made recipe from the Playbook -> look up any flag in the CLI Reference.

Features

Core

Category	Features
Scheduling	YAML DAG format, dependency validation, cycle detection (DFS), parallel execution with `max_parallel`, matrix expansion, batch task mode
Engines	`codex exec`, `claude --print`, `gemini`, `copilot` (autopilot), `qwen`, `ollama` (local), `llama` (local via llama-cpp), raw shell commands; execution profiles (plan/safe/yolo)
Prompts & context	Inline, file, or markdown extraction; inter-task context passing (11 modes: raw/selective/summarized/map_reduce/recursive/layered/structural/council/knowledge_graph/codebase_map/scip); progressive compaction; privacy pipeline (`output_redact`, `context_allowlist`)
Reliability	`verify_command`, workspace assertions (`assert:`), `max_retries` with feedback injection, retry strategies (constant/linear/exponential), `allow_failure`, auto-escalation, cross-engine fallback, circuit breakers, `checkpoint` protocol
Cost control	Per-task cost/token tracking, budget limits (`max_cost_usd`), cross-run budget tracking (`budget_period`), per-engine pricing tables, budget warning thresholds
Quality gates	LLM-as-Judge with typed assertions, Likert rubrics, G-Eval, adversarial debate judge, comparative eval, named presets, `guard_command`, quorum voting, timeout auto-scaling
Security	`maestro audit` (SEC001-SEC023) + `--fix`; `allowed_tools:` per-task restriction; untrusted-context detection and taint propagation; control flow integrity; trajectory guardrails; semantic firewall for MCP metadata; phantom workspace; git worktree isolation per task; Web UI/API path confinement + localhost-default CORS
Caching	Policy-versioned SHA-256 Merkle DAG keys, short-lived negative cache (`negative_cache_ttl_sec`), contamination-aware bypass, pre-hash normalization, eviction "why" fields, `--no-cache`, `maestro explain` / `maestro status`
Flow control	Conditional execution (`when`), `fail_fast`, `--only`/`--skip` filtering, `--tags`/`--skip-tags`, approval gates
Secrets	`secrets:` plan field (explicit list or `auto`), `--mask-secrets` CLI flag

Advanced capabilities — relational contracts, adaptive search, persistent memory, policies, watch loops, blame, event sourcing

Category	Features
Relational safety	Typed `contract_type:` producers, `consumes_contracts:` consumers, `consistency_group:` membership, `reconcile_after:` group gates
Adaptive	Mid-task signals (`signals: true`) — progress, metrics, timeout extension, budget query, artifacts; dynamic task decomposition (`dynamic_group: true`); cross-run knowledge auto-injection with prompt-relevant retrieval + `{{ knowledge_index }}`; adaptive temporal routing with trend detection + cross-task affinity; population-based search (best-of-N models); MCTS workflow search (draft/debug/improve, UCB1, `tree.jsonl`); self-evolving `replan` (multi-variant, tournament, elitism, diversity, novelty/knowledge priors, stepping stones)
Knowledge + memory	SQLite-backed per-plan memory store (`.maestro-cache/memory/<plan>.db`) with WAL, automatic JSONL migration, time-decayed confidence, bi-temporal records (`valid_from`/`valid_to` + `recorded_at`), provenance/trust labels, conflict resolution, relation confidence, poisoning quarantine, and score history (`ScoreRecord`, `plan_hash`, `quality_score`)
Plan intelligence	`deliberation: true` — haiku pre-call skips engine if task is self-answerable; `maestro validate` prints DAG density report (S_complex, W17/W18/W19 warnings); `output_schema` for structured inter-task typed outputs
Imports	`imports:` for reusable task templates with prefix namespacing, nested imports, cycle detection
Policies	Declarative runtime policies (`block`/`warn`/`audit`) with safe AST evaluation
Event sourcing	Hash-chained `events.jsonl` with tamper detection; `maestro verify` validates integrity
Blame	`maestro blame` traces failure causality, classifies root causes, suggests fixes
Watch	Autonomous iteration loops (`maestro watch`): custom metric-driven mode and built-in plan improvement mode (`mode: improve`); git commit/rollback, consolidation agent with safety gates (trust labels, instructionality rejection, firewall), experiments.jsonl

Output Modes

Mode	Flag	Description
Text	`--output text`	Default. Human-readable `[maestro]` console output
JSONL	`--output jsonl`	Structured JSON Lines events to stdout; suppresses text
Live	`--output live`	Real-time Rich table with task progress, cost, duration. `pip install maestro-ai-cli[live]`
TUI	`--output tui`	Interactive Textual app with DAG panel, detail panel, event feed, keyboard nav. `pip install maestro-ai-cli[tui]`

Maestro TUI

Observability

Feature	Description
Token tracking	`TokenUsage` per task (input/cached/output), aggregated in manifest and summary
Web UI	Dashboard with stats, charts, cost trends, and collaboration summaries; run detail with ownership/blockers/activity, Gantt timeline, and log viewer
Structured errors	Error codes E001-E072 (validation), E100-E110 (runtime); warning codes W1-W30
Mid-task signals	`task_progress`, `task_metric`, `task_artifact`, `timeout_extended` events from running tasks
OTel export	`maestro export-otel` emits per-task spans with `gen_ai.*` attributes, optional content capture/redaction, and task events such as `knowledge_poison_alert` and `memory_write`
Diagnostics	`maestro doctor` checks Python, PyYAML, engine CLIs, plugins, Git; `--full` adds cache, knowledge, skills, plans, prior runs

Plan Schema (compact)

version: 1
name: my-plan
workspace_root: /path/to/project
max_parallel: 3
max_cost_usd: 25.00
budget_warning_pct: 0.8

defaults:
  timeout_sec: 600
  claude:
    model: sonnet

tasks:
  - id: setup
    command: ["bash", "-c", "npm install"]
    tags: [infra]

  - id: implement
    depends_on: [setup]
    engine: claude
    model: haiku
    escalation: [haiku, sonnet, opus]
    negative_cache_ttl_sec: 300
    prompt: "Implement the feature..."
    verify_command: "npm test"
    max_retries: 2

  - id: review
    depends_on: [implement]
    context_from: [implement]
    context_mode: summarized
    engine: claude
    cache: false
    prompt: "Review changes: {{ implement.summary }}"
    judge:
      method: g_eval
      criteria:
        - type: rubric
          name: correctness
          levels:
            - { score: 1, description: "Major bugs" }
            - { score: 5, description: "Correct and well-tested" }
      pass_threshold: 0.7
      on_fail: warn

  - id: deploy
    depends_on: [review]
    when: "{{ review.status }} == success"
    requires_approval: true
    command: "echo 'Deploying...'"

Full annotated schema with all fields: docs/PLAN_GUIDE.md

CLI Commands

Group	Commands
Plan lifecycle	`validate`, `check`, `run`, `replan`, `scaffold`, `watch`
Observability	`report`, `diff`, `explain`, `status`, `eval`, `suggest`, `blame`, `budget`
Security	`audit`, `verify`
Infrastructure	`doctor`, `ci`, `ci-analyze`, `cleanup`, `backfill-costs`, `ui`, `mcp-server`, `export-otel`
Interactive	`chat`, `shell`
Discovery	`skill` (`list`, `search`, `recommend`)

Full CLI reference with all flags: docs/CLI_REFERENCE.md

Writing Effective Plans

Prompt sources: inline prompt:, file via prompt_file:, or markdown extraction via prompt_md_file: + prompt_md_heading:.

Context passing: use context_from: to inject upstream outputs. Nine modes: raw (free), selective (BM25 chunk-level, free), summarized (haiku per upstream), map_reduce (N haiku + synthesis), recursive (full workspace awareness), layered (L0/L1/L2 budget-aware tiers), structural (package-aware symbol extraction with re-export resolution and PageRank-based blast-radius scoring, free), council (multi-model deliberation with star/chain/graph topologies), knowledge_graph (entity extraction, free). Control budget with context_budget_tokens:. Add context_compaction: progressive for staged degradation. Use maestro explain --context to see why context was selected.

Cross-run memory: full runs accumulate reusable lessons in .maestro-cache/memory/<plan>.db. Matching records are injected automatically as {{ task_knowledge }}, and a compact {{ knowledge_index }} is available when you want the model to choose from a wider memory surface without pasting every record.

Semantic firewall: set top-level firewall_model: haiku to enable an opt-in second pass for MCP metadata and tainted upstream text (stdout_tail, result_text, summary). Classifier failures fail open to the deterministic sanitizer rather than blocking task execution. For MCP providers that mutate shared local state, set mcp_servers[].is_concurrency_safe: false so parallel worktree: true tasks are serialized around that provider.

Verify + retry: add verify_command: + max_retries: -- failure output is auto-injected into the retry prompt. Add retry_delay_sec: [2, 5, 15] for backoff.

Caching policy: keep default positive caching for deterministic tasks. For flaky or rate-limited tasks, set negative_cache_ttl_sec: to suppress repeated failures for a short window without pinning bad results for too long. Results from untrusted/tainted/partial runs, plus structured tool-failure outputs, are excluded from positive cache writes.

Tracing: maestro export-otel converts completed runs into OTLP spans or JSON, attaching gen_ai.* attributes, optional task input/output previews via --include-content, and prompt redaction via --otel-mask-prompts. Span events include runtime signals such as knowledge_poison_alert and memory_write.

Quality gates: judge: block with typed assertions (zero-cost contains/regex/is-json) and LLM rubrics. guard_command: for lightweight stdin-pipe validation.

Budget: max_cost_usd: at plan level. Set budget_warning_pct: 0.8 for early alerts. Track across runs with budget_period: weekly.

Conditional execution: when: "{{ task.status }} == success" enables deploy-on-success / rollback-on-failure patterns.

Signals: signals: true on long-running tasks for progress reporting, timeout extension, and budget queries via [MAESTRO_SIGNAL] stdout protocol.

Watch loops: maestro watch plan.yaml for autonomous metric-driven iteration. mode: improve auto-fixes plan failures.

Complete guide with examples: docs/PLAN_GUIDE.md

Models (quick reference)

Engine	Example Aliases	Reasoning Effort	Cost Model
Claude	`haiku`, `sonnet`, `opus`	`low` to `max` (Opus; `xhigh`/`max` are Opus-tier)	Per-token
Codex	`5.4-mini`, `5.4`, `5.5`	`none` to `xhigh`	Per-token
Gemini	`flash`, `pro`, `pro-3.1`	N/A (use model selection)	Per-token
Copilot	`sonnet`, `gpt-5.4-codex`, `gemini-pro`	N/A	Subscription
Qwen	`coder`, `max`, `qwq`	N/A	Per-token
Ollama	`llama4`, `qwen3-coder`, `deepseek-r1`	N/A	Free (local)
Llama	`llama3`, `llama4-scout`, `codellama`	N/A	Free (local, llama-cpp)

Set model: auto for automatic routing based on task complexity. Control bias with routing_strategy: cost_optimized | quality_first | balanced.

Full model alias tables and pricing: docs/MODELS.md

Execution Profiles

Profile	Codex	Claude	Gemini	Copilot	Qwen	Ollama	Llama
`plan`	Use YAML args exactly	Use YAML args exactly	Use YAML args exactly	Use YAML args exactly	Use YAML args exactly	Use YAML args exactly	Use YAML args exactly
`safe`	Forces sandbox + approval gates	Forces default permissions	Strips dangerous flags	Strips `--yolo`	Strips `--yolo`	No change	No change
`yolo`	Ensures full bypass	Ensures `--dangerously-skip-permissions`	Ensures `--approval-mode yolo`	Ensures `--yolo`	Ensures `--yolo`	No change	No change

Output Structure

.maestro-runs/<timestamp>_<plan-name>/
├── .cache/               # Task result cache (content-addressable)
├── run_manifest.json      # Aggregated results (status, cost, tokens, plan_hash/quality_score on full runs)
├── run_summary.md         # Human-readable summary with cost/timing/tokens
├── events.jsonl           # Structured event log (hash-chained, tamper-detectable)
├── <task-id>.log          # Execution transcript
└── <task-id>.result.json  # Structured result (status, exit_code, duration, cost, tokens)

Cross-run state lives separately in .maestro-cache/: task cache entries, knowledge JSONL from older versions, and the current SQLite memory store under .maestro-cache/memory/.

Architecture

src/maestro_cli/
├── cli.py             # argparse CLI (27 subcommands) + --version + banner
├── models.py          # Dataclasses and typed results (PlanSpec, TaskSpec, TaskResult, ScoreRecord, ...)
├── loader.py          # YAML parsing + validation + cycle detection + matrix + imports
├── runners.py         # Command building + execution + verify/retry + judge + signals + secrets
├── scheduler.py       # DAG scheduler (ThreadPoolExecutor) + context + policies + budget getter
├── routing.py         # Semantic model routing (model: auto) + predictive + temporal + cross-task
├── memory.py          # SQLite-backed Knowledge + Memory v2 (WAL, provenance, score history)
├── policy.py          # Declarative runtime policy engine (safe AST)
├── blame.py           # Causal failure attribution
├── audit.py           # Plan security scanner (SEC001-SEC023)
├── eventsource.py     # Hash-chained event sourcing + verify
├── replan.py          # Adaptive re-planning + multi-variant search + knowledge-guided scoring
├── mcts.py            # MCTS workflow search (WorkflowVariant tree, selection, simulation, pruning)
├── watch.py           # Autonomous metric-driven iteration loop (custom + improve mode)
├── worktree.py        # Git worktree isolation per task
├── dynamic.py         # Dynamic task decomposition (dynamic_group runtime sub-plans)
├── knowledge.py       # Cross-run knowledge accumulation + consolidation + auto-inject
├── contracts.py       # Typed contract normalization (sql-schema, api-schema, etc.)
├── budget.py          # Cross-run budget tracking and ledger
├── chat.py            # Multi-model interactive terminal
├── shell.py           # Interactive REPL with slash commands
├── plugins.py         # Custom engine plugin discovery
├── multi.py           # Multi-plan execution
├── live.py            # Rich live output + signal progress (--output live)
├── tui/               # Textual TUI + signal events (--output tui)
├── web/               # FastAPI + vanilla JS dashboard
├── cache.py           # Task/plan hashing + semantic cache policy
├── codebase_graph.py  # AST-backed codebase graph + blast radius analysis
├── scaffold.py        # Plan generation from briefs + workflow libraries
├── skill_registry.py  # Skill discovery, search, and recommendation (maestro skill)
├── ci_agent.py        # CI failure analysis and remediation (maestro ci-analyze)
├── suggest.py         # Run history analysis + optimization heuristics
├── eval.py            # Batch judge evaluation
├── diff.py            # Run comparison
├── explain.py         # Cache hit/miss explanation + context trajectory
├── status.py          # Task staleness detection
├── report.py          # HTML report generation
├── cost_backfill.py   # Historical cost/token backfill
├── cleanup.py         # Run directory cleanup
├── doctor.py          # Environment diagnostics
├── ag_ui.py           # AG-UI protocol adapter (event translation + state tracking)
├── mcp_server.py      # MCP server (12 tools, 8 resources, 3 prompts)
├── otel.py            # OTLP exporter (run → OpenTelemetry spans)
├── council.py         # Multi-model deliberation (star/chain/graph topologies)
├── knowledge_graph.py # Entity extraction + graph-based context
├── symbols.py         # Regex-based code symbol extraction (10 languages)
├── errors.py          # PlanValidationError (E001-E072), TaskExecutionError (E100-E110)
└── utils.py           # Paths, templates, markdown extraction

Design principles: local-first persistence (run artifacts + SQLite memory), engine-agnostic (shells out to CLIs), minimal deps (only PyYAML in core; SQLite is stdlib), strong typing (dataclasses throughout).

Testing & Guarantees

Maestro is tested offline-first. The full suite (13k+ tests) runs on every push across Python 3.11 / 3.12 / 3.13 plus a Windows lane, alongside strict mypy, a documentation lint, and CodeQL. Engine calls are mocked in CI; the real-engine end-to-end tests are opt-in (MAESTRO_RUN_REAL_ENGINE_TESTS=1, they need provider credentials and cost money) and run on a separate, manually-enabled lane. Coverage is uploaded to Codecov and Coveralls (with optional SonarCloud and Codacy backends — see docs/COVERAGE_PLATFORMS.md).

Area	Unit	Integration	Real-engine
DAG scheduling / dependencies	✅	✅	n/a
Shell tasks	✅	✅	n/a
Engines (codex/claude/gemini/copilot/qwen/ollama/llama)	✅ (mocked)	✅ (command build)	opt-in
Context passing (11 modes)	✅	✅	partial
Budgets / cost / token tracking	✅	✅	partial
Quality gates (judge / verify / guard / assert)	✅	✅	partial
Retries / fallback / circuit breakers	✅	✅	n/a
Secret masking	✅	✅	n/a
Policy engine (safe AST)	✅ (fuzzed)	✅	n/a
Security audit (SEC001-SEC023)	✅	✅	n/a
Cache / event sourcing / blame	✅	✅	n/a

Maestro guarantees DAG ordering and dependency semantics, retry/fallback behaviour, the documented version: 1 plan schema and run-artifact shapes (the v1 stability contract), best-effort secret masking, and deterministic, replayable run logs.

Maestro does not guarantee deterministic LLM output, provider/CLI availability, model-pricing accuracy (the pricing tables are best-effort and overridable), or the safety of arbitrary user-authored shell commands — you own the plans you run.

For the full per-engine breakdown (which engines have real end-to-end tests, what runs in default CI, and the outstanding gaps stated honestly), see docs/TESTED_GUARANTEES.md.

Troubleshooting

Symptom	Cause	Fix
`[E0xx] message`	Plan validation error	Check error code, fix the YAML
`Heading not found in markdown`	`prompt_md_heading` includes `##`	Remove the `##` prefix
`No code fence found`	Prompt text not in ```text block	Wrap prompt in code fence
Task immediately fails	Engine CLI not on PATH	Run `maestro doctor`
Copilot auth fails	GitHub token missing/expired	Set `COPILOT_GITHUB_TOKEN` or `GH_TOKEN`
`exit_code: 124`	Task timed out	Increase `timeout_sec`
Task skipped	Dependency failed or `fail_fast`	Fix upstream, use `--resume-last`
Shell fails on Windows	`shell: true` uses `cmd.exe`	Use list-format command with Git Bash
Budget exceeded	`max_cost_usd` limit reached	Increase limit or optimize model selection
Judge times out	`g_eval` + many criteria	Set `judge.timeout_sec: 180` or let auto-scaling handle it
Worktree merge conflict	Parallel tasks edit same files	Use `worktree: true` with non-overlapping tasks, or add a reconciler

For Windows-specific pitfalls, see docs/PITFALLS.md.

Roadmap

See CHANGELOG.md for full release history and docs/ROADMAP.md for planned features.

Current repo state: v2.5.4 is the latest release — parameter-scoped tool grants: allowed_tools entries like Bash(git *) or Write(src/*) are now enforced (natively via Claude's --allowedTools, plus post-hoc verification of the observed tool-call stream on every engine that exposes one), with a new on_grant_violation: warn|fail gate, a tool_grant_violation event, and a has_scoped_tools policy field. Also: context_mode: codebase_map/scip now work standalone (without context_from), and CI tooling is pinned via a Dependabot-maintained constraints file. Builds on v2.5.3 (estimate, hardware-aware routing, scip, FTS5 selective) and v2.5.2/v2.5.1 (FTS5 knowledge ranking, codebase_map). 7 engines, 11 context modes, SQLite-backed memory, and ~13.6K tests in the latest full-suite run.

Requirements

Python >= 3.11
PyYAML >= 6.0
Engine CLIs on PATH: codex, claude, gemini, copilot, qwen, ollama, llama (as needed)
Optional: [live] (Rich), [tui] (Textual), [web] (FastAPI + uvicorn), [agui] (AG-UI protocol), [mcp] (MCP SDK), [otel] (OpenTelemetry OTLP exporter)

Contributing

See CONTRIBUTING.md for development setup, testing, CI, and code conventions. Please also read our Code of Conduct, and the security policy for reporting vulnerabilities privately.

Full documentation index: docs/README.md.

License

Maestro CLI is released under the MIT License.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tiagojcperez

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.5.4

Jun 9, 2026

2.5.3

Jun 9, 2026

2.5.2

Jun 9, 2026

2.5.1

Jun 8, 2026

2.5.0

Jun 4, 2026

2.4.0

Jun 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

maestro_ai_cli-2.5.4.tar.gz (1.6 MB view details)

Uploaded Jun 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

maestro_ai_cli-2.5.4-py3-none-any.whl (511.2 kB view details)

Uploaded Jun 9, 2026 Python 3

File details

Details for the file maestro_ai_cli-2.5.4.tar.gz.

File metadata

Download URL: maestro_ai_cli-2.5.4.tar.gz
Upload date: Jun 9, 2026
Size: 1.6 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for maestro_ai_cli-2.5.4.tar.gz
Algorithm	Hash digest
SHA256	`d6b43fec2317d5da3497980f97000207ca8302f5996b6754bf4cc008dee85b0d`
MD5	`f52d3bd39942207186e84f94c5705759`
BLAKE2b-256	`6dae84c9818aa329ea86630bf5a6dd96e3e7555e159549cee1649ad275e06eb9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for maestro_ai_cli-2.5.4.tar.gz:

Publisher: publish.yml on tiagojcperez/maestro-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: maestro_ai_cli-2.5.4.tar.gz
- Subject digest: d6b43fec2317d5da3497980f97000207ca8302f5996b6754bf4cc008dee85b0d
- Sigstore transparency entry: 1770202788
- Sigstore integration time: Jun 9, 2026
Source repository:
- Permalink: tiagojcperez/maestro-cli@68787407fe0b1477792db842329756587439e409
- Branch / Tag: refs/tags/v2.5.4
- Owner: https://github.com/tiagojcperez
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@68787407fe0b1477792db842329756587439e409
- Trigger Event: release

File details

Details for the file maestro_ai_cli-2.5.4-py3-none-any.whl.

File metadata

Download URL: maestro_ai_cli-2.5.4-py3-none-any.whl
Upload date: Jun 9, 2026
Size: 511.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for maestro_ai_cli-2.5.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8e5bdc0ec82d075802d99b746d6a5939c434e055303ea3063858e49a579fd16c`
MD5	`989109f144a28bb597e695dd735f8c06`
BLAKE2b-256	`fcce0a9fe92ec4a32678e63ca91b1919525cddbfd029ba777bd2b9a85edbe39d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for maestro_ai_cli-2.5.4-py3-none-any.whl:

Publisher: publish.yml on tiagojcperez/maestro-cli

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: maestro_ai_cli-2.5.4-py3-none-any.whl
- Subject digest: 8e5bdc0ec82d075802d99b746d6a5939c434e055303ea3063858e49a579fd16c
- Sigstore transparency entry: 1770202898
- Sigstore integration time: Jun 9, 2026
Source repository:
- Permalink: tiagojcperez/maestro-cli@68787407fe0b1477792db842329756587439e409
- Branch / Tag: refs/tags/v2.5.4
- Owner: https://github.com/tiagojcperez
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@68787407fe0b1477792db842329756587439e409
- Trigger Event: release

maestro-ai-cli 2.5.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Maestro CLI

Supported Engines

Why Maestro?

Python SDK

Install

Quickstart

Features

Core

Output Modes

Observability

Plan Schema (compact)

CLI Commands

Writing Effective Plans

Models (quick reference)

Execution Profiles

Output Structure

Architecture

Testing & Guarantees

Troubleshooting

Roadmap

Requirements

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance