Who guards the agents? A framework for orchestrating AI coding agents through verified implementation phases.

Project description

Juvenal

Quis custodiet ipsos agentes? — Who guards the agents?

Juvenal

Juvenal is a framework for orchestrating AI coding agents through verified implementation phases. It prevents agents from cheating on success criteria, helps agents implement complex projects in phases, etc.

The Problem

Agents such at giant problems. This is probably only a temporary problem, but for now, an AI agent given a massive problem will fumble it. It'll take shortcuts, lie, cheat, steal, the works.

The Solution

There's no honor among agents! Agent B feels no obligation to cover for some shortcut that Agent A made. This makes an implementation-verification loop with separate agents pretty effective for catching cut corners. When Agent B catches Agent A's shoddy work, Agent C can be spun up to implement fixes, and so on.

How It Works

A non-agentic Python script orchestrates AI coding agents (Claude or Codex) through alternating steps:

Implementation — an agent executes a prompt to build/modify code
Verification — separate checkers (scripts, agents, or both) verify the work
Bounce — if verification fails, the pipeline bounces back (to a configurable target phase or the most recent implement phase) with failure context injected. A global bounce limit (max_bounces) prevents infinite loops.

The implementing agent and the checking agent are separate processes, so the implementer can't cheat by weakening tests, etc.

Other Such Frameworks

Juvenal is conceptually similar to ralph, but it works slightly better for my exact purposes and reinventing the wheel is cheap now!

Install

pip install -e ".[dev]"

Claude Code Skill

Juvenal ships as a Claude Code plugin, so you can use it directly from Claude Code with /juvenal.

Install the plugin

From the marketplace (pending approval):

/plugin install juvenal

From source (works now):

claude --plugin-dir /path/to/juvenal/plugin

Usage

Once installed, invoke the skill in Claude Code:

/juvenal add authentication to the Flask app

Claude will create a Juvenal workflow for your goal and run it. You can also ask for help with workflow formats or run existing workflows.

Quick Start

# Scaffold a workflow
juvenal init my-project

# Run a workflow
juvenal run workflow.yaml

# Generate a workflow from a goal
juvenal plan "implement a REST API with tests" -o workflow.yaml

# Plan and immediately run
juvenal do "add authentication to the Flask app"

Workflow Formats

YAML

name: "my-workflow"
backend: claude
max_bounces: 999

phases:
  - id: implement
    prompt: "Implement the feature."
    checkers:
      - type: script
        run: "pytest tests/ -x"
      - type: agent
        role: tester

Directory Convention

my-workflow/
  phases/
    01-setup/
      prompt.md            # implementation prompt
      check-build.sh       # script checker (exit 0 = pass)
      check-quality.md     # agent checker
    02-implement/
      prompt.md
      check-tests.sh       # paired with .md = composite
      check-tests.md       # gets {script_output} injected

Bare Markdown

phases/
  01-setup.md              # single phase, default tester checker

Checker Types

Type	Description
`script`	Shell command; exit 0 = PASS, nonzero = FAIL
`agent`	AI agent that emits `VERDICT: PASS` or `VERDICT: FAIL: reason`
`composite`	Script runs first, output fed to agent via `{script_output}`

Built-in Roles

Agent checkers can use built-in verification personas:

tester — runs tests, checks for build errors
architect — validates design, checks for circular dependencies
pm — confirms requirements are met, no TODOs remain
senior-tester — checks test integrity, looks for cheating
senior-engineer — reviews code quality, completeness, security

CLI

juvenal run <workflow> [--resume] [--rewind N] [--rewind-to PHASE_ID] [--phase X]
                       [--max-bounces N] [--backend claude|codex] [--dry-run]
                       [--backoff SECONDS] [--notify WEBHOOK_URL]
                       [--working-dir DIR] [--state-file PATH]
juvenal plan "goal" [-o output.yaml] [--backend claude|codex]
juvenal do "goal" [--backend claude|codex] [--max-bounces N]
juvenal status [--state-file path]
juvenal init [directory] [--template name]
juvenal validate <workflow>

Resume & Rewind

# Resume from last saved state
juvenal run workflow.yaml --resume

# Rewind 2 phases back from the resume point
juvenal run workflow.yaml --rewind 2

# Rewind to a specific phase by ID
juvenal run workflow.yaml --rewind-to setup

--rewind and --rewind-to implicitly load existing state (no need for --resume) and invalidate from the target phase onward so everything from that point gets re-executed.

License

MIT

Project details

Release history Release notifications | RSS feed

0.28.25

Apr 26, 2026

0.28.24

Apr 18, 2026

0.28.23

Apr 15, 2026

0.28.22

Apr 12, 2026

0.28.21

Apr 11, 2026

0.28.20

Apr 11, 2026

0.28.19

Apr 11, 2026

0.28.18

Apr 11, 2026

0.28.17

Apr 11, 2026

0.28.16

Apr 7, 2026

0.28.15

Apr 5, 2026

0.28.14

Apr 3, 2026

0.28.13

Apr 3, 2026

0.28.12

Apr 3, 2026

0.28.11

Apr 3, 2026

0.28.10

Apr 1, 2026

0.28.9

Mar 26, 2026

0.28.8

Mar 24, 2026

0.28.7

Mar 24, 2026

0.28.6

Mar 19, 2026

0.28.5

Mar 19, 2026

0.28.4

Mar 18, 2026

0.28.3

Mar 17, 2026

0.28.2

Mar 17, 2026

0.28.1

Mar 17, 2026

0.28.0

Mar 17, 2026

0.27.4

Mar 17, 2026

0.27.3

Mar 17, 2026

0.27.2

Mar 17, 2026

0.27.1

Mar 17, 2026

0.27.0

Mar 17, 2026

0.26.0

Mar 14, 2026

0.25.0

Mar 14, 2026

0.24.0

Mar 14, 2026

0.23.2

Mar 14, 2026

0.23.1

Mar 14, 2026

0.23.0

Mar 14, 2026

0.22.0

Mar 14, 2026

0.21.0

Mar 14, 2026

0.20.0

Mar 14, 2026

0.19.1

Mar 14, 2026

0.19.0

Mar 14, 2026

0.18.6

Mar 14, 2026

0.18.5

Mar 14, 2026

0.18.4

Mar 14, 2026

0.18.3

Mar 14, 2026

0.18.2

Mar 14, 2026

0.18.1

Mar 13, 2026

0.18.0

Mar 13, 2026

0.17.0

Mar 13, 2026

0.16.0

Mar 13, 2026

0.15.0

Mar 13, 2026

0.14.0

Mar 12, 2026

0.13.2

Mar 12, 2026

0.13.1

Mar 12, 2026

0.13.0

Mar 11, 2026

0.12.0

Mar 11, 2026

0.11.0

Mar 11, 2026

0.10.2

Mar 9, 2026

0.10.1

Mar 9, 2026

This version

0.10.0

Mar 9, 2026

0.9.3

Mar 9, 2026

0.9.2

Mar 9, 2026

0.9.1

Mar 9, 2026

0.9.0

Mar 9, 2026

0.8.0

Mar 9, 2026

0.7.0

Mar 1, 2026

0.6.0

Mar 1, 2026

0.5.0

Mar 1, 2026

0.4.0

Mar 1, 2026

0.3.2

Mar 1, 2026

0.3.1

Mar 1, 2026

0.3.0

Mar 1, 2026

0.2.2

Mar 1, 2026

0.2.1

Feb 28, 2026

0.2.0

Feb 28, 2026

0.1.1

Feb 28, 2026

0.1.0

Feb 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

juvenal-0.10.0.tar.gz (57.9 kB view details)

Uploaded Mar 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

juvenal-0.10.0-py3-none-any.whl (52.9 kB view details)

Uploaded Mar 9, 2026 Python 3

File details

Details for the file juvenal-0.10.0.tar.gz.

File metadata

Download URL: juvenal-0.10.0.tar.gz
Upload date: Mar 9, 2026
Size: 57.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for juvenal-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`c89e470eb1725e46afb53880f7ace860af4f9001eb41c5530cd0244a1fc5a37d`
MD5	`f4e689a86a7e82a7f647092c409e1f23`
BLAKE2b-256	`846a89fd92f1201e437a25f973351247ad090f5dd874957a7697c5cefc7b95d6`

See more details on using hashes here.

File details

Details for the file juvenal-0.10.0-py3-none-any.whl.

File metadata

Download URL: juvenal-0.10.0-py3-none-any.whl
Upload date: Mar 9, 2026
Size: 52.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for juvenal-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f5cae585092a838beacd835be7467b753e812904d446b183b10baf4f0e17451e`
MD5	`be4e288598fd4335c6da7d279bedf952`
BLAKE2b-256	`ccb4c257d1360042550b974258078de3a834408dbbd4ba095da1ce5f24da02da`

See more details on using hashes here.

juvenal 0.10.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Juvenal

The Problem

The Solution

How It Works

Other Such Frameworks

Install

Claude Code Skill

Install the plugin

Usage

Quick Start

Workflow Formats

YAML

Directory Convention

Bare Markdown

Checker Types

Built-in Roles

CLI

Resume & Rewind

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes