Deterministic orchestration shell for autonomous AI agent execution

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Topic
- Software Development :: Build Tools

Project description

Arcwright AI

Deterministic orchestration shell for autonomous AI agent execution.

Arcwright AI takes BMAD planning artifacts (PRD, Architecture, Epics, Stories) and autonomously executes them through Claude, enforcing validation gates, tracking decision provenance, and writing structured run artifacts after every execution.

Prerequisites
Installation
Project Setup
Running Stories
Run Artifacts
Understanding the Output
LangGraph Studio
Development
Troubleshooting

Prerequisites

Python 3.11+ (3.14 recommended; see LangGraph Studio for the exception)
Claude API key (Anthropic): ARCWRIGHT_API_CLAUDE_API_KEY
A project initialised with BMAD (_spec/planning-artifacts/ containing PRD, architecture, epics, and story files)

Installation

From the arcwright-ai/ directory:

python -m venv .venv
.venv/bin/pip install -e ".[dev]"

Set your API key (add to your shell profile or .env):

export ARCWRIGHT_API_CLAUDE_API_KEY="sk-ant-..."

Project Setup

Before dispatching stories, initialise Arcwright AI in your target project (the project whose stories you want to implement — not this repo):

# From inside the target project root:
arcwright-ai init

# Or point explicitly:
arcwright-ai init --path /path/to/your/project

This creates .arcwright-ai/ with the following layout:

.arcwright-ai/
├── config.yaml       ← project-level configuration (committed)
├── runs/             ← execution artifacts (git-ignored)
├── worktrees/        ← git worktrees (git-ignored)
└── tmp/              ← transient scratch space (git-ignored)

config.yaml defaults (edit to suit your project):

model:
  version: "claude-opus-4-6"

limits:
  tokens_per_story: 200000
  cost_per_run: 10.0
  retry_budget: 3
  timeout_per_story: 300

methodology:
  artifacts_path: "_bmad-output"   # where your BMAD planning docs live
  type: "bmad"

scm:
  branch_template: "arcwright-ai/{story_slug}"

API key security: Never put your API key in config.yaml. Set it via ARCWRIGHT_API_CLAUDE_API_KEY environment variable, or in the global ~/.arcwright-ai/config.yaml (user-level, outside any repo).

Verify your setup:

arcwright-ai validate-setup

Running Stories

Dispatch a single story by its epic.story identifier (e.g., story 4 of epic 2 is 2.4):

# From inside the target project root:
arcwright-ai dispatch --story 2.4

# Dashes also work:
arcwright-ai dispatch --story 2-4

The pipeline runs:

preflight → budget_check → agent_dispatch → validate → commit → finalize

Each node writes artifacts to .arcwright-ai/runs/<run-id>/stories/<story-slug>/.

Exit codes:

Code	Meaning
`0`	Story completed successfully
`1`	Unexpected error (configuration, I/O, etc.)
`2`	Story escalated (validation failed, could not auto-fix)

Run Artifacts

Every execution produces a run directory:

.arcwright-ai/runs/<run-id>/
├── run.yaml                          ← metadata: status, cost, story list
└── stories/<story-slug>/
    ├── context-bundle.md             ← assembled context injected into the agent
    ├── agent-output.md               ← raw output from Claude
    ├── validation.md                 ← V6 invariant + V3 reflexion results and decision log
    ├── halt-report.md                ← populated only on escalation
    └── summary.md                    ← produced by finalize node (success or halt)

Run ID format: YYYYMMDD-HHMMSS-<4-char-id> (e.g. 20260305-022632-4b90)

Reading a halt report

When a run escalates, check these files in order:

halt-report.md — escalation reason, retry history, suggested fix
validation.md — exact V6 invariant failures and V3 reflexion AC results
agent-output.md — what Claude produced (verify files actually exist on disk before trusting V6 failures)

Understanding the Output

`status: escalated` vs. failure

escalated means the pipeline ran successfully but validation could not be satisfied within the retry budget. It does not mean the agent crashed. The agent's work (files, code) is still on disk in the target project.

Escalation reasons:

Reason	Meaning
`v6_invariant_failure`	Hard rule violation (missing file, bad name, syntax error) — retries won't help without a fix
`max_retries_exhausted`	V3 reflexion (AC review) kept failing after N retries
`budget_exceeded`	Token/cost ceiling hit before validation passed

False-positive V6 failures

If validation.md shows a file_existence failure for a file that does exist on disk, check whether the path in the error has a leading backtick (e.g., `backend/app/routers/admin.py). This is a known pattern when the agent uses inline code formatting in markdown headers. The V6 checker strips backticks as of the current version. If you see this after upgrading from an older run, the files are fine — re-run to get a clean pass.

LangGraph Studio

Arcwright AI ships a langgraph.json config so you can visualise and inspect the execution graph in LangGraph Studio.

Why a separate venv?

The main .venv uses Python 3.14. The langgraph-api package (required for langgraph dev) depends on pyo3-based Rust extensions that do not yet publish wheels for Python 3.14 and cannot be compiled without matching support. A separate Python 3.13 venv is used exclusively for Studio.

One-time setup

Ensure Python 3.13 is available (via Homebrew or pyenv), then:

cd arcwright-ai/

# Create Studio venv with Python 3.13
python3.13 -m venv .venv-studio

# Install project + LangGraph Studio deps
.venv-studio/bin/pip install -e ".[dev]" "langgraph-cli[inmem]"

Starting Studio

cd arcwright-ai/
.venv-studio/bin/langgraph dev

The server starts at http://127.0.0.1:2024. Open the Studio UI at:

https://smith.langchain.com/studio/?baseUrl=http://127.0.0.1:2024

You'll see the story_graph with all nodes and conditional edges:

START → preflight → budget_check ──(ok)──→ agent_dispatch → validate ──(success)──→ commit → finalize → END
                          │                                       │
                     (exceeded)                               (escalated)
                          └──────────────────────────────────────┴──→ finalize → END
                                                    ↑(retry)
                                          validate ─┘ → budget_check

A free LangSmith account is required to use the Studio UI. The local API server itself runs without one.

Development

All development commands use the main .venv (Python 3.14):

# Install dev dependencies
pip install -e ".[dev]"
# Prefer explicit venv invocation to avoid interpreter mismatch:
.venv/bin/pip install -e ".[dev]"

# Run tests
.venv/bin/python -m pytest -q

# Lint
.venv/bin/ruff check .
.venv/bin/ruff format --check .

# Type check
.venv/bin/python -m mypy --strict src/

# All quality gates in one pass
.venv/bin/ruff check . && .venv/bin/ruff format --check . && .venv/bin/python -m mypy --strict src/ && .venv/bin/python -m pytest -q

Python version note

The project targets Python 3.11+ and is developed against 3.14. The .venv-studio venv (Python 3.13) is only for running langgraph dev. Do not use it for tests or type checking — results may differ.

Troubleshooting

ModuleNotFoundError: No module named 'arcwright_ai'

The venv's editable install link may be stale or was not processed correctly on Python 3.14. Re-install:

cd arcwright-ai/
.venv/bin/pip install -e .

This rewrites the .pth file. Verify with:

.venv/bin/python -c "import arcwright_ai; print(arcwright_ai.__file__)"

langgraph dev fails with Required package 'langgraph-api' is not installed

You're using the main .venv (Python 3.14). Use .venv-studio instead:

.venv-studio/bin/langgraph dev

Story dispatched but files don't match what validation expected

Check .arcwright-ai/config.yaml in the target project. The methodology.artifacts_path must point to the directory containing your BMAD planning artifacts (PRD, architecture, epics). Default is _bmad-output; adjust if your project uses _spec/planning-artifacts or another path.

Dev agent File List is consistently incomplete or doesn't match git diff output after a BMAD update

The dev-story workflow in this project includes a custom git diff audit step (Step 9 of instructions.xml) that was added to address a systemic issue where 67% of stories had File List discrepancies. This customization lives in _bmad/bmm/workflows/4-implementation/dev-story/ — a directory that is gitignored and gets overwritten by BMAD framework updates.

If you have recently run a BMAD update and agent File Lists are again going unaudited, the customization was likely overwritten. See the BMAD Workflow Customizations section in the root README.md for the exact changes to re-apply.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Topic
- Software Development :: Build Tools

Release history Release notifications | RSS feed

0.2.42

Mar 22, 2026

0.2.41

Mar 22, 2026

0.2.40

Mar 22, 2026

0.2.39

Mar 22, 2026

0.2.38

Mar 21, 2026

0.2.37

Mar 20, 2026

0.2.36

Mar 20, 2026

0.2.35

Mar 20, 2026

0.2.34

Mar 20, 2026

0.2.33

Mar 20, 2026

0.2.32

Mar 20, 2026

0.2.31

Mar 20, 2026

0.2.30

Mar 20, 2026

0.2.29

Mar 20, 2026

0.2.28

Mar 20, 2026

0.2.27

Mar 20, 2026

0.2.26

Mar 20, 2026

0.2.25

Mar 19, 2026

0.2.23

Mar 19, 2026

0.2.22

Mar 19, 2026

0.2.21

Mar 19, 2026

0.2.20

Mar 18, 2026

0.2.19

Mar 18, 2026

0.2.18

Mar 18, 2026

0.2.17

Mar 18, 2026

0.2.16

Mar 18, 2026

0.2.15

Mar 18, 2026

0.2.14

Mar 18, 2026

0.2.13

Mar 18, 2026

0.2.12

Mar 18, 2026

0.2.11

Mar 18, 2026

0.2.10

Mar 18, 2026

0.2.9

Mar 18, 2026

0.2.8

Mar 16, 2026

0.2.7

Mar 16, 2026

0.2.6

Mar 16, 2026

0.2.5

Mar 16, 2026

0.2.4

Mar 16, 2026

0.2.3

Mar 16, 2026

0.2.2

Mar 16, 2026

0.2.1

Mar 16, 2026

0.2.0

Mar 16, 2026

This version

0.1.1.dev0 pre-release

Mar 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arcwright_ai-0.1.1.dev0.tar.gz (336.0 kB view details)

Uploaded Mar 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

arcwright_ai-0.1.1.dev0-py3-none-any.whl (138.2 kB view details)

Uploaded Mar 15, 2026 Python 3

File details

Details for the file arcwright_ai-0.1.1.dev0.tar.gz.

File metadata

Download URL: arcwright_ai-0.1.1.dev0.tar.gz
Upload date: Mar 15, 2026
Size: 336.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for arcwright_ai-0.1.1.dev0.tar.gz
Algorithm	Hash digest
SHA256	`31096deebf480aebbe28679472b3b24ff179bba3eb4eabbbf2104e7045a82253`
MD5	`f31dbeee5a932bbab7e7e69fe9988c9f`
BLAKE2b-256	`e0fe25d4e331a1774cc50f5c170d2b907875d5aeed4eb42c68ab10208c3dbf91`

See more details on using hashes here.

File details

Details for the file arcwright_ai-0.1.1.dev0-py3-none-any.whl.

File metadata

Download URL: arcwright_ai-0.1.1.dev0-py3-none-any.whl
Upload date: Mar 15, 2026
Size: 138.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for arcwright_ai-0.1.1.dev0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`33dd070f3f8f04f81bcc44a4f9f04865b42db3dff2bd93eac21e2870ae6de490`
MD5	`7f6a2c55e60e9909422169435b41f424`
BLAKE2b-256	`a956ae3455854bfd686177e654bd995047c3c85e819aacd6dc437d02132c58fd`

See more details on using hashes here.

arcwright-ai 0.1.1.dev0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Arcwright AI

Table of Contents

Prerequisites

Installation

Project Setup

Running Stories

Run Artifacts

Reading a halt report

Understanding the Output

status: escalated vs. failure

False-positive V6 failures

LangGraph Studio

Why a separate venv?

One-time setup

Starting Studio

Development

Python version note

Troubleshooting

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`status: escalated` vs. failure