Actor-critic loop for AI coding agents — orchestrate actors and critics via CLI backends

These details have not been verified by PyPI

Project links

Project description

spike-claude-code-actor-critics

Spike — this is a learning experiment, not production software. The goal is to explore the actor-critic pattern, understand how it can refine output quality, and build practical intuition for using it effectively.

Overview

This CLI tool orchestrates an actor and multiple specialised critics by invoking the Claude Code executable as a subprocess. Each role has its own dedicated Claude configuration. Communication uses structured output.

The actor generates output (e.g. code, text, a plan)
Critics review that output — each focuses on a specific quality dimension
The actor refines based on critic feedback
The loop repeats until a stopping condition is met

The right orchestration strategy for multiple critics and the best stopping conditions (fixed rounds, critic green-light, consensus, etc.) are open questions this spike aims to answer.

Installation

Install from PyPI:

pip install actor-critic-loop
# or with uv
uv add actor-critic-loop

This installs the actor-critic CLI command. You still need at least one supported coding agent CLI (see below).

Prerequisites

Python 3.13+
uv — installation guide
At least one supported coding agent CLI:
- Claude Code (default) — installed and authenticated (claude login)
- Codex — npm i -g @openai/codex
- OpenCode — https://opencode.ai
- pi-coding-agent — npm i -g @mariozechner/pi-coding-agent

Setup

git clone https://github.com/ondrasek/spike-claude-code-actor-critics.git
cd spike-claude-code-actor-critics
uv sync

Usage

Configure actor and critics

uv run actor-critic init ./my-task \
  --actor actor_critic.actors.basic \
  --critic correctness=actor_critic.critics.basic \
  --critic style=actor_critic.critics.basic \
  --orchestrator actor_critic.orchestrators.sequential \
  --stop-condition actor_critic.stop_conditions.fixed_rounds

This scaffolds a workspace at ./my-task with dedicated folders for the actor and each critic, containing their Claude Code configurations (CLAUDE.md, settings, skills, subagents). Edit actor-critic.yaml and role configs after init to tune behaviour.

Scaffold from a template

Use --template to clone an existing workspace (e.g. one of the examples) as a starting point:

# Clone an example as your starting point
uv run actor-critic init ./my-task --template ./examples/hello-world/template

# Override specific values while using the template structure
uv run actor-critic init ./my-task --template ./examples/code-review/template --max-rounds 5

# Quick scaffold with defaults (no template needed)
uv run actor-critic init ./my-task

When using --template, the --actor and --critic flags are not allowed (the template defines those). You can still override --orchestrator, --stop-condition, and --max-rounds.

Wizard mode

Use wizard to scaffold a workspace and immediately launch Claude Code for interactive setup:

uv run actor-critic wizard ./my-task

The wizard creates the workspace structure, installs the /customize-workspace skill, and opens Claude Code. Claude guides you through configuring the actor, designing critics, writing prompts, and installing plugins — all interactively. If the workspace already exists, the wizard skips scaffolding and launches Claude Code directly.

Run the actor-critic loop

Place source material in the workspace's input/ directory, then run:

# Basic run (reads everything from actor-critic.yaml)
uv run actor-critic run --workspace ./my-task

# With additional prompt (appended to input/prompt.md)
uv run actor-critic run --workspace ./my-task --prompt "Focus on clarity"

# With verbose output (INFO logging)
uv run actor-critic run --workspace ./my-task --verbose

All configuration — actor, critics, orchestrator, stop condition, and max rounds — lives in actor-critic.yaml at the workspace root.

CLI flags

Flag	Short	Default	Description
`--workspace`		(required)	Path to workspace directory (must contain `actor-critic.yaml`)
`--prompt`			Additional prompt (appended to `input/prompt.md`)
`--verbose`	`-v`		Enable INFO-level logging
`--backend`			Override backend for all roles (e.g. `claude`, `codex`, `opencode`, `pi`)

Use --verbose when debugging issues. Claude Code is always invoked with streaming mode (--output-format stream-json --verbose), which provides real-time diagnostics.

What verbose mode shows:

Session initialization (model, tools available)
Tool calls as they happen (Read, Edit, Bash, etc.)
Final result with token usage and cost

All I/O uses workspace directory conventions — no external path flags:

Directory	Purpose
`<workspace>/input/`	Source material + optional `prompt.md` (user-provided)
`<workspace>/output/`	Final output (written by orchestrator after last round)
`<workspace>/actor/output/`	Actor's working output area (intermediate, per-round)
`<workspace>/critics/<name>/output/`	Critic's optional output artifacts (e.g. test files, diffs)

Exit codes

Code	Meaning
0	Final verdict is PASS (all critics satisfied)
1	Final verdict is FAIL (max rounds exhausted without satisfying stop condition)
2	Config, resolver, or runtime error (malformed YAML, missing directories, Claude Code subprocess failure)
130	Interrupted by user (Ctrl+C)

CLI output

After a run, the CLI displays:

Final Output — the actor's last stdout
Artifacts — files created/modified by the actor (if any)
Round Summary — per-round table showing timing and critic verdicts (PASS/FAIL/PASS*)
Critic Verdicts (Final Round) — per-critic verdict with explanation (reconciled verdicts show [reconciled: reason])
Result — rounds completed, final verdict (colored), stop reason, total time

PASS* (yellow) indicates a verdict that was reconciled via re-evaluation. This occurs when the orchestrator detects oscillation (a critic cycling between PASS and FAIL) or goalpost-shifting (consecutive FAILs with different issues each round). The suspect critic is re-invoked with its own verdict history and asked to resolve the contradiction.

Note: The exact CLI interface will evolve as the spike progresses.

Multi-Backend Configuration

The actor-critic loop supports multiple coding agent CLIs as backends. Each role (actor, critic) can use a different backend:

# actor-critic.yaml
actor:
  impl: actor_critic.actors.basic
  backend: codex          # Use Codex for the actor

critics:
  - name: quality
    impl: actor_critic.critics.basic
    backend: claude       # Use Claude for the critic

Supported backends:

Backend	CLI	`--add-dir`	JSON schema	Thinking control
`claude` (default)	`claude`	Yes	Yes (native)	Yes
`codex`	`codex`	Yes	Yes (native)	No
`opencode`	`opencode`	No (warning)	No (warning)	No
`pi`	`pi`	No (warning)	No (warning)	No

Override all backends from the CLI: --backend codex

Features not supported by a backend degrade gracefully with logged warnings.

Key Design Decisions

Claude Code as subprocess, not SDK — keeps the tool decoupled from API changes; each role runs in a full Claude Code environment with its own config, skills, and subagents
Structured output — implementations control how Claude Code output is formatted and parsed (e.g. --output-format json, stream-json, --json-schema, or plain text)
Separate configuration per role — each actor/critic gets its own CLAUDE.md, settings, skills, and subagents so roles can be tuned independently without interference
Self-explanatory configuration — every value in actor-critic.yaml should be understandable without reading the source code; use fully qualified names over short aliases
Plugins over MCP servers — prefer official Anthropic plugins, custom skills, and rules over MCP servers for simplicity and reliability
No future-proofing — don't add parameters, code, or abstractions for features planned in later phases; add them when needed

Plugins, Skills, and Rules

Role workspaces can be enhanced with Claude Code capabilities in priority order:

Rules — auto-loaded guidance in .claude/rules/ (always use output.md to constrain writes)
Skills — custom slash commands in .claude/skills/ for specific workflows
Plugins — official Anthropic plugins for code intelligence and review
MCP servers — avoid when possible; use only for external service integration

Installing Plugins

# Install plugins to actor workspace (project scope)
cd my-task/actor
claude plugin install pyright-lsp --scope project
claude plugin install code-simplifier --scope project

# Install plugins to critic workspace
cd my-task/critics/correctness
claude plugin install code-review --scope project
claude plugin install security-guidance --scope project

Recommended Plugins

Role Type	Recommended Plugins
Python actors	`pyright-lsp`, `code-simplifier`
TypeScript actors	`typescript-lsp`, `code-simplifier`
Code critics	`code-review`, `security-guidance`
Style critics	`pr-review-toolkit`

Skills and Rules

Skills provide custom slash commands:

.claude/skills/
└── generate/
    └── SKILL.md    # Instructions for /generate command

Rules provide auto-loaded guidance:

.claude/rules/
├── output.md       # Required: constrain writes to ./output/
└── style.md        # Optional: code style guidelines

See plans/PLUGINS-SKILLS-RULES.md for complete documentation.

Invocation Model

Each actor and critic is invoked as a Claude Code subprocess in non-interactive mode (claude -p). The orchestrator runs Claude Code from the role's workspace directory so it picks up the role's CLAUDE.md, settings, skills, and subagents. Roles access shared directories via --add-dir.

Actor invocation:

cd ./my-task/actor && claude -p "the prompt and critic feedback" \
  --add-dir ../input \
  --add-dir ../critics/correctness/output \
  --add-dir ../critics/style/output

Critic invocation:

cd ./my-task/critics/correctness && claude -p "<rendered prompt>" \
  --add-dir ../../input --add-dir ../../actor/output

Critic prompts are Jinja2 templates stored in critics/<name>/input/prompt.md. At runtime, the template is rendered with context about the actor's work:

Variable	Type	Description
`actor_output_files`	`list[str]`	Filenames in `actor/output/` (excluding `.gitkeep`)
`actor_prompt_file`	`str`	Absolute resolved path to the actor's `input/prompt.md`
`actor_prompt`	`str`	Verbatim contents of the actor's `input/prompt.md` (empty string if missing)
`actor_output_text`	`str`	Actor's stdout text from Claude Code (empty string on first round)
`critic_output_dirs`	`dict[str, str]`	Mapping of critic name to absolute output dir path
`round_num`	`int`	Current round number (1-indexed)
`previous_actor_outputs`	`list[str]`	Actor stdout text from all previous rounds (empty on round 1)
`previous_output_dirs`	`list[str]`	Absolute paths to `snapshots/actor/round-{N}/` directories

Templates use XML sections following Anthropic prompt engineering guidelines (data/context first, instructions in middle, reminder at end):

<actor_task>
The original requirements given to the actor are in `{{ actor_prompt_file }}`.
</actor_task>

<actor_output>
The actor produced the following files:
{% for file in actor_output_files %}
- `{{ file }}`
{% endfor %}
</actor_output>

<review_instructions>
[critic-specific evaluation criteria]
</review_instructions>

<reminder>
- Ground your review in specific evidence.
- Your response MUST be valid JSON with `review`, `passed`, and optionally `issues`.
</reminder>

Every critic must provide its own template — there is no default fallback.

The actor writes its working output to <workspace>/actor/output/. Critics review it there via --add-dir and may produce optional artifacts in their own output/ directories. The actor can access critic artifacts via --add-dir to each critic's output/. After the final round, the orchestrator copies the actor's output to <workspace>/output/.

Each role is constrained to write only to its own output/ subdirectory via permissions in settings.json and a rule in .claude/rules/output.md.

Output format, schema enforcement, and response parsing are implementation-specific — each actor/critic implementation decides which Claude Code flags to use.

Session strategy

Whether a role resumes its Claude Code session across rounds or starts fresh is configurable. Both approaches have trade-offs:

Strategy	Benefits	Risks
resume	Full history of prior attempts; no need to re-pass context	Past feedback may anchor or bias; context window fills over many rounds
fresh	Clean slate each round; predictable context size	Must re-pass everything explicitly; may repeat failed approaches

The right strategy depends on the role and the task — this is one of the things the spike aims to explore empirically. Session strategy is configured per role in actor-critic.yaml.

Oscillation prevention

The most significant failure mode in actor-critic loops is critic oscillation — where fixing one critic's feedback breaks another's requirements, or a single critic cycles between PASS and FAIL across rounds. The orchestrator includes automatic oscillation detection and reconciliation:

Two triggers:

Trigger	Default threshold	What it detects
Verdict reversal	2 reversals	Critic cycling PASS→FAIL→PASS→FAIL
Consecutive fails	3 rounds	Same critic failing with different issues each round (goalpost-shifting)

How reconciliation works:

After each round, the orchestrator checks failing critics for suspicious patterns
Suspect critics are re-invoked with their own verdict history and a reconciliation prompt
The critic reviews its prior verdicts and determines whether the current FAIL is genuine or self-contradictory
If the critic reconciles to PASS, the verdict is updated and marked as PASS* (reconciled)

Configuration (in actor-critic.yaml):

stop_condition:
  config:
    max_rounds: 5
    max_consecutive_fails: 4   # default: 3
    oscillation_reversals: 3   # default: 2

Reconciliation adds zero overhead on clean runs — detection is a pure function over round summaries. Only triggered critics incur an extra invocation.

Project Structure

├── src/
│   └── actor_critic/
│       ├── __init__.py
│       ├── cli.py
│       ├── types.py
│       ├── claude/                # Shared Claude Code invocation
│       │   ├── __init__.py
│       │   ├── invoke.py          # Core invoke() function
│       │   ├── events.py          # NDJSON event parsing
│       │   └── types.py           # ClaudeResult, ClaudeEvent
│       ├── config.py              # YAML config loading
│       ├── resolver.py            # Dynamic module import
│       ├── scaffold.py            # Workspace scaffolding (init command)
│       ├── scaffold_templates/    # Built-in templates for init
│       ├── backends/              # Coding agent CLI backends
│       │   ├── base.py            # BackendResult, BackendEvent
│       │   ├── resolve.py         # Name → module resolver
│       │   ├── claude.py          # Claude Code (wraps claude/invoke.py)
│       │   ├── codex.py           # OpenAI Codex CLI
│       │   ├── opencode.py        # OpenCode CLI
│       │   └── pi.py              # pi-coding-agent CLI
│       ├── actors/
│       │   ├── __init__.py
│       │   ├── basic.py
│       │   └── team.py            # Agent teams (parallel work)
│       ├── critics/
│       │   ├── __init__.py
│       │   └── basic.py
│       ├── orchestrators/
│       │   ├── __init__.py
│       │   ├── sequential.py
│       │   ├── oscillation.py     # Oscillation detection + reconciliation
│       │   └── feedback.py        # Critic feedback serialization
│       └── stop_conditions/
│           ├── __init__.py
│           ├── all_pass.py        # Stop when all critics pass
│           ├── any_pass.py        # Stop when any critic passes
│           ├── fixed_rounds.py    # Stop at max rounds
│           └── quorum.py          # Stop when N critics pass
├── tests/
│   ├── system/
│   ├── integration/
│   └── unit/
├── pyproject.toml
└── README.md

Note: The structure above represents the planned/target layout for this spike and not all directories or files may exist yet in the repository.

Example workspace

A workspace is created by actor-critic init and contains the full Claude Code configuration for each role:

my-task/
├── actor-critic.yaml
├── input/                 # Source material (user-provided before run)
├── output/                # Final output (written by orchestrator)
├── snapshots/               # Created at runtime — actor output history
│   └── actor/
│       ├── round-1/         # Copy of actor/output/ after round 1
│       └── round-2/         # Copy of actor/output/ after round 2
├── actor/
│   ├── output/            # Actor's working output (intermediate, per-round)
│   ├── CLAUDE.md
│   └── .claude/
│       ├── settings.json  # Permissions: writes to ./output/ only
│       ├── rules/
│       │   └── output.md  # "Write all output files to ./output/"
│       ├── agents/
│       │   └── researcher.md
│       ├── commands/
│       │   └── generate.md
│       └── skills/
│           └── code-generation/
│               └── SKILL.md
└── critics/
    ├── correctness/
    │   ├── input/
    │   │   └── prompt.md  # Jinja2 template — critic review prompt
    │   ├── output/        # Critic's optional output artifacts
    │   ├── CLAUDE.md
    │   └── .claude/
    │       ├── settings.json
    │       ├── rules/
    │       │   └── output.md
    │       ├── agents/
    │       │   └── test-verifier.md
    │       └── skills/
    │           └── verify-logic/
    │               └── SKILL.md
    └── style/
        ├── input/
        │   └── prompt.md  # Jinja2 template — critic review prompt
        ├── output/
        ├── CLAUDE.md
        └── .claude/
            ├── settings.json
            ├── rules/
            │   └── output.md
            └── skills/
                └── check-style/
                    └── SKILL.md

`actor-critic.yaml`

Top-level orchestration configuration that ties the workspace together:

actor:
  path: ./actor
  impl: actor_critic.actors.basic

critics:
  - name: correctness
    path: ./critics/correctness
    impl: actor_critic.critics.basic
  - name: style
    path: ./critics/style
    impl: actor_critic.critics.basic

orchestrator:
  impl: actor_critic.orchestrators.sequential

stop_condition:
  impl: actor_critic.stop_conditions.all_pass
  config:
    max_rounds: 3
    max_consecutive_fails: 3    # optional: trigger reconciliation after N consecutive FAILs
    oscillation_reversals: 2    # optional: trigger reconciliation after N verdict reversals

Testing

Tests are run with pytest and organised into:

CLI / system tests — end-to-end verification of the console interface
Integration tests — interaction between actor and critic components
Unit tests — where they add clear value

# Run all tests with coverage
uv run pytest --cov

Quality Gate Hooks

This project uses Claude Code hooks to enforce code quality standards. When Claude completes work, a quality gate automatically runs and feeds any failures back to Claude for fixing, creating an iterative loop until all checks pass.

Iterative feedback loop

┌─────────────────────────────────────────────────────────────────┐
│                    ITERATIVE FEEDBACK LOOP                      │
├─────────────────────────────────────────────────────────────────┤
│                                                                 │
│   Claude completes work                                         │
│         │                                                       │
│         ▼                                                       │
│   Stop event fires ──► Quality gate hook runs                   │
│         │                                                       │
│         ├── Any check fails ──► Exit 2 ──► Claude fixes ──┐     │
│         │                                                 │     │
│         ▼                                                 │     │
│   All checks pass ──► Auto-commit hook runs               │     │
│         │                                                 │     │
│         ├── Pre-commit fails ──► Exit 2 ──► Claude fixes ─┤     │
│         │                                                 │     │
│         ▼                                                 │     │
│   Commit + push ──► Session ends cleanly                  │     │
│                                                           │     │
│         ◄─────────────── Stop event fires again ──────────┘     │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

The hooks are configured in .claude/settings.json and are mandatory for all developers. Exit code 2 feeds stderr back to Claude, prompting automatic fixes.

Quality checks

Tool	Purpose	Rationale
pytest	Test execution (fail-fast)	Core TDD requirement
pytest-cov	80% coverage threshold	Enforces test completeness
ruff check	Linting	Fast, replaces flake8/pylint
ruff format	Formatting verification	Fast, replaces black/isort
pyright	Static type checking	Faster than mypy, stricter
bandit	Security scanning (medium+ severity)	Catches OWASP vulnerabilities
vulture	Dead code detection (80% confidence)	Keeps codebase clean

Environment variable overrides

Developers can temporarily adjust checks without modifying shared configuration:

Variable	Default	Purpose
`COVERAGE_THRESHOLD`	`80`	Minimum coverage percentage
`SKIP_TYPE_CHECK`	unset	Set to `1` to skip pyright
`SKIP_SECURITY_SCAN`	unset	Set to `1` to skip bandit
`SKIP_DEAD_CODE`	unset	Set to `1` to skip vulture

# Skip slow checks during rapid iteration
SKIP_TYPE_CHECK=1 SKIP_SECURITY_SCAN=1 claude

# Lower coverage threshold temporarily
COVERAGE_THRESHOLD=60 claude

Running checks manually

# Run the full quality gate
./.claude/hooks/quality-gate.sh

# Run individual checks
uv run pytest -x --tb=short
uv run ruff check src/ tests/
uv run ruff format --check src/ tests/
uv run pyright src/
uv run bandit -r src/ -ll
uv run vulture src/ --min-confidence 80

Hook execution order

PreToolUse (before every Write/Edit/MultiEdit):

tdd-gate.sh — blocks implementation writes without a failing test (60s timeout)

PostToolUse (after every Write/Edit/MultiEdit):

per-edit-fix.sh — ruff fix + format + codespell (60s timeout)
post-edit-tests.sh — runs pytest for immediate feedback (120s timeout)

Stop (when Claude finishes):

Code review agent — inspects changed files (120s timeout)
quality-gate.sh — full test suite + lint + types (10 min timeout)
auto-commit.sh — commit + push if clean (4 min timeout)

If any hook returns exit 2, Claude receives stderr and fixes issues. The loop continues until all checks pass.

TDD workflow with hooks

Write failing test → Claude writes test file (tdd-gate.sh allows test files)
Post-edit tests run → post-edit-tests.sh reports the new test fails (expected)
Write implementation → tdd-gate.sh allows (failing test exists)
Post-edit tests run → post-edit-tests.sh reports pass/fail immediately
Claude finishes → Code review agent inspects changes
Quality gate → Full suite + lint + types + coverage
All pass → Auto-commit, session complete

TDD hook environment variables

Variable	Default	Purpose
`SKIP_TDD_GATE`	unset	Set to `1` to disable test-first enforcement
`SKIP_POST_EDIT_TESTS`	unset	Set to `1` to disable post-edit test runner

# Disable TDD gate during non-code tasks
SKIP_TDD_GATE=1 claude

# Disable post-edit tests during rapid iteration
SKIP_POST_EDIT_TESTS=1 claude

# Disable both for non-TDD workflows
SKIP_TDD_GATE=1 SKIP_POST_EDIT_TESTS=1 claude

Future

See docs/FUTURE.md.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.0

Feb 22, 2026

This version

0.1.0

Feb 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

actor_critic_loop-0.1.0.tar.gz (1.1 MB view details)

Uploaded Feb 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

actor_critic_loop-0.1.0-py3-none-any.whl (64.9 kB view details)

Uploaded Feb 22, 2026 Python 3

File details

Details for the file actor_critic_loop-0.1.0.tar.gz.

File metadata

Download URL: actor_critic_loop-0.1.0.tar.gz
Upload date: Feb 22, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for actor_critic_loop-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5ef637d9ed7179f23f2a8636f7926c02e88f18d3bcbca81d9e259c28153db841`
MD5	`ae9710fd8210c9988bf7eeb7efe63c15`
BLAKE2b-256	`56570eddc4fa9d7b7e1a5c56502982b756de2bbc66d00e7abd8e181fbbaf6029`

See more details on using hashes here.

File details

Details for the file actor_critic_loop-0.1.0-py3-none-any.whl.

File metadata

Download URL: actor_critic_loop-0.1.0-py3-none-any.whl
Upload date: Feb 22, 2026
Size: 64.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.4 {"installer":{"name":"uv","version":"0.10.4","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for actor_critic_loop-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0212ea8353ca6d192daa13c4e0bc278af32ea50e547abcb480ecf5c20879c1bc`
MD5	`a6cecd05cc2c5b3cc40f89d28ddfdee6`
BLAKE2b-256	`2897c1c533e9706e9e6c9c04cd152eb935b53de3c5e0ed8cf488c8f3a9003bd2`

See more details on using hashes here.

actor-critic-loop 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

spike-claude-code-actor-critics

Overview

Installation

Prerequisites

Setup

Usage

Configure actor and critics

Scaffold from a template

Wizard mode

Run the actor-critic loop

CLI flags

Exit codes

CLI output

Multi-Backend Configuration

Key Design Decisions

Plugins, Skills, and Rules

Installing Plugins

Recommended Plugins

Skills and Rules

Invocation Model

Session strategy

Oscillation prevention

Project Structure

Example workspace

actor-critic.yaml

Testing

Quality Gate Hooks

Iterative feedback loop

Quality checks

Environment variable overrides

Running checks manually

Hook execution order

TDD workflow with hooks

TDD hook environment variables

Future

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`actor-critic.yaml`