Skip to main content

Claude Code plugins for autonomous development workflows

Project description

stellars-claude-code-plugins

GitHub Actions PyPI version Total PyPI downloads Python 3.12 Brought To You By KOLOMOLO

AI coding agents generate impressive code but cut corners when left unsupervised - skipping tests, losing context between iterations, shipping shallow fixes that pass benchmarks without addressing root causes. The longer an agent runs autonomously, the more these failures compound.

This project provides a shared YAML-driven orchestration engine that pulls agents through structured phases with independent quality gates at every boundary. Instead of relying on the agent's self-discipline, the engine enforces research before implementation, hypothesis tracking across iterations, and multi-agent review before any code ships.

[!TIP] Each plugin provides only YAML configuration files. The shared orchestration engine in stellars_claude_code_plugins handles all execution logic - FSM transitions, gate validation, multi-agent coordination, and state management.

[!NOTE] Read the full article on the approach: Your AI Agent Will Cut Corners. Here's How to Stop It.

What it solves

  • Shallow fixes - forces research and hypothesis before implementation
  • Scope creep - plan locks scope, review catches deviations
  • Lost context - hypothesis catalogue and failure context persist across iterations
  • Unchecked quality - two independent gates (readback + gatekeeper) per phase
  • No accountability - every phase records agents, outputs, and verdicts in YAML audit logs
  • Benchmark gaming - guardian agent checks for benchmark-specific tuning vs genuine improvement

Plugins

Plugin Skills Description
auto-build-claw 3 Autonomous build iteration orchestrator with multi-agent review
devils-advocate 4 Critical document analysis with persona-driven risk scoring

auto-build-claw

Runs structured multi-iteration development cycles where each iteration passes through a full phase lifecycle with quality gates. A program defines what to build, a benchmark measures progress, and the engine enforces the workflow until the objective is met or iterations are exhausted.

Skills: auto-build-claw (orchestrator), program-writer, benchmark-writer

Workflow types

Type Phases Use when
full RESEARCH -> HYPOTHESIS -> PLAN -> IMPLEMENT -> TEST -> REVIEW -> RECORD -> NEXT Feature work, improvements
fast PLAN -> IMPLEMENT -> TEST -> REVIEW -> RECORD -> NEXT Clear objective, no exploration needed
gc PLAN -> IMPLEMENT -> TEST -> RECORD -> NEXT Cleanup, refactoring
hotfix IMPLEMENT -> TEST -> RECORD Targeted bug fix
planning RESEARCH -> PLAN -> RECORD -> NEXT Work breakdown (auto-chains before full)

Usage

# Describe what you want - the plugin handles the rest
/auto-build-claw improve error handling in the API layer

The plugin writes PROGRAM.md and BENCHMARK.md from your prompt, asks you to approve, then runs the orchestrator autonomously.

See auto-build-claw/README.md for the full phase lifecycle, agent architecture, and configuration details.

devils-advocate

Systematically critiques documents from the perspective of their toughest audience. Builds a devil persona, harvests verifiable facts, generates a risk-scored concern catalogue, and iterates corrections until residual risk is acceptable.

Skills: setup (build persona + fact repository), evaluate (concern catalogue + baseline scorecard), iterate (apply corrections or re-score), run (full workflow end-to-end)

Risk scoring uses a Fibonacci scale (1-8) for likelihood and impact, producing risk scores from 1-64. Each concern is scored 0-100% on how well the document addresses it, and the residual risk (what remains unaddressed) drives iteration priority.

Usage

# Full end-to-end workflow
/devils-advocate:run

# Step by step
/devils-advocate:setup       # Build persona, harvest facts
/devils-advocate:evaluate    # Generate concerns + baseline scorecard
/devils-advocate:iterate     # Apply corrections, re-score (repeat)

See devils-advocate/README.md for scoring formula details, artefact format, and the full concern catalogue methodology.

Install

pip install stellars-claude-code-plugins

As a Claude Code plugin marketplace:

/plugin marketplace add stellarshenson/claude-code-plugins

Architecture

stellars_claude_code_plugins/          # Shared engine (pip installable)
  engine/
    fsm.py                             # Phase lifecycle state machine
    model.py                           # Typed YAML model loader + validator
    orchestrator.py                    # Complete orchestration engine
    resources/
      workflow.yaml                    # Default iteration types and phase sequences
      phases.yaml                      # Default phase templates, agents, gates
      app.yaml                         # Default display text and CLI config

auto-build-claw/                       # Plugin: autonomous build iterations
  .claude-plugin/plugin.json           # Plugin registration
  skills/
    auto-build-claw/SKILL.md           # Orchestrator skill definition
    program-writer/SKILL.md            # Program definition skill
    benchmark-writer/SKILL.md          # Benchmark definition skill

devils-advocate/                       # Plugin: critical document analysis
  .claude-plugin/plugin.json           # Plugin registration
  skills/
    setup/SKILL.md                     # Build persona + fact repository
    evaluate/SKILL.md                  # Concern catalogue + scorecard
    iterate/SKILL.md                   # Apply corrections, re-score
    run/SKILL.md                       # Full workflow end-to-end

.claude-plugin/marketplace.json        # Plugin marketplace registry

Building a new plugin

Plugins are pure configuration - no Python code required. Create a directory with skills and register it in the marketplace:

my-plugin/
  .claude-plugin/plugin.json           # Plugin registration and skill triggers
  skills/
    my-skill/SKILL.md                  # Skill definition with description and instructions

The plugin.json registers your skills with Claude Code, defining when they trigger and what tools they have access to. Each SKILL.md contains the instructions Claude follows when the skill is invoked. The shared orchestration engine (pip install stellars-claude-code-plugins) provides the orchestrate CLI command that handles state management, FSM transitions, gate execution, and audit logging.

Register your plugin in the marketplace by adding an entry to .claude-plugin/marketplace.json.

Development

make install          # create venv, install deps, editable install
make test             # run 212 tests
make lint             # ruff format + check
make format           # auto-fix formatting
make build            # clean, test, bump version, build wheel
make publish          # build + twine upload to PyPI

License

MIT License

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

stellars_claude_code_plugins-0.8.64-py3-none-any.whl (74.5 kB view details)

Uploaded Python 3

File details

Details for the file stellars_claude_code_plugins-0.8.64-py3-none-any.whl.

File metadata

File hashes

Hashes for stellars_claude_code_plugins-0.8.64-py3-none-any.whl
Algorithm Hash digest
SHA256 a48b93e82ce1278795ec0bfa6de933fe7e460f105dff38c08760e13418e7521a
MD5 53ab5dbd876958a0415087162a047b9f
BLAKE2b-256 d1b295f7024ab668d308b0d5874afce44fbabc4a1fa43585ad5c2a06bc7d844e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page