A harness for delegating coding work to LLMs while keeping your main agent clean. Structured artifacts, long-running support, and swappable backends.

These details have not been verified by PyPI

Project links

Project description

code-delegation-harness

A simple harness for delegating coding work to LLMs while keeping your main agent clean.

It gives you structured output + reviewable artifacts (JSON + human report + patch) even on long-running tasks, and lets you swap different backends without rewriting your top-level orchestration.

Quick Start

For normal use:

pip install git+https://github.com/joshuafielden43/code-delegation-harness.git@v0.2.1
gcdh --help

For development or the absolute latest code, see the full installation guide.

Works best with Grok right now, but designed to be usable with other models too.

Why This Exists

Most agent setups mix your long-term context, style, and memory with the actual coding work. Over time that gets messy.

This tool tries to keep those concerns separate:

Your primary agent/persona stays yours.
Coding tasks get handed off to a focused execution environment.
You get back clean, reviewable results instead of another giant conversation.

It also gives you the option to swap the backend harness later without having to change how you orchestrate things at the top level.

Current Output

When you run a task with --output-file, you get:

result.json — structured data
result.report.md — human-readable review document with diffs and observations
result.patch — ready-to-apply unified diff (when code changed)

The goal is simple: you stay you. The work gets done well. And you only have to review the actual result.

Current Output (What You Actually Review)

When you delegate with --output-file, the harness produces:

result.json — full structured data (status, files changed, summaries, errors)
result.report.md — the primary document for reviewing the end result (clear status, per-file change descriptions with line stats and diff previews, observations from any analysis, verification steps)
result.patch (when code was modified) — a ready-to-apply unified diff you can review or apply directly with git apply

The design goal is that you spend your time reviewing actual delivered work and collaborating on changes, not reviewing incremental proposals.

Basic Usage

From within a Grok / agent session:

Delegate this task using the code-delegation-harness:

Task: Add Google-style docstrings and type hints to the `process_batch` function in src/batch.py. Do not change behavior.

Target directory: /path/to/your/project

Constraints: Follow the existing style in the file. Keep changes minimal.

For the best experience, have the underlying wrapper called with --output-file so you receive the full set of review artifacts.

Professional CLI (`gcdh`)

Recommended today (immediate & reliable):

cd code-delegation-harness
chmod +x bin/gcdh
export PATH="$PWD/bin:$PATH"

gcdh --help
gcdh --quiet --task "..." --target-dir /path/to/project --output-file result.json

Installation

See the full Installation guide for the recommended path for primary agents versus development use.

Documentation

Full documentation lives in the docs/ directory:

Installation
Usage Notes
Examples
Dogfooding Case Study
Complete Proposed Documentation Structure (what we intend to build over time)

For Agents, Sidecars & Automation (Robot-Useful)

The harness was built to be driven by other agents and sidecars:

--quiet (-q) → minimal, clean output perfect for programmatic consumption.
--output-file is strongly recommended — it produces the full machine + human artifact set.
Structured JSON output is stable and self-describing.
--dry-run + --quiet gives agents a cheap, safe way to preview a task before committing.
Persistent status files + --status / --resume make long-running work observable and resumable from outside.

This design makes it a strong primitive for building reliable delegation into larger agent architectures (including future sidecar systems).

Quiet Mode

--quiet (-q) is one of the most useful flags for both humans and agents. In quiet mode the tool only emits errors and the final artifact paths, making runs much cleaner — especially when combined with --output-file.

Long-running tasks are well supported:

Use --timeout (seconds) and --max-turns to give big jobs room to breathe.
Add --wait-for-completion --max-wait 14400 (for example) and the wrapper will automatically poll until a background run finishes, then deliver the complete artifacts (full .json + .report.md + .patch + .run-meta.json).
Persistent .cdh-run-<id>.status files are written for any run using --run-name or --wait-for-completion. These contain task snippet, run_name, timing, and state (launched / waiting / completed / max_wait_exceeded).
Use --status --target-dir /path at any time to see both active and completed runs in that tree.
Use --resume <run-id-or-file> to re-attach to a background run (smart short-circuit if it already finished).
All final human review artifacts reflect the full background/resumption story when relevant.

See the docs/ directory for usage notes, examples, and case studies.

Dry-Run Preview Mode

Use --dry-run (ideally with --quiet) before expensive or long-running delegations. It shows exactly what would happen without executing anything or writing files:

The full prompt that would be sent
Resolved configuration and flags
Expected output artifacts

This is extremely useful for agents and for humans reviewing scope.

Combine with --quiet for the cleanest preview output.

Strengths

Produces excellent, self-contained review artifacts by default
Strong, reliable support for long-running and background tasks
Clean handling of read-only / no-change work
Clear error categorization and status reporting
Full observability via persistent status files + --status / --resume

Current Status (Production Ready)

The harness is ready for real, repeated use on production work.

It delivers clean, reviewable end results even on long-running tasks:

Complete structured JSON
High-signal human .report.md with checklists and "How to Review This Change" guidance
Ready-to-apply .patch
.run-meta.json for reproducibility
Persistent status files (.cdh-run-*.status) with full launch → wait → completion lifecycle, queryable via --status

Long-running support is first-class:

--timeout / --max-turns
--wait-for-completion with automatic background recovery
--status and smart --resume for visibility and control

The project maintains two aligned purposes:

A practical, MIT-licensed tool for reliable coding delegation
The primary dogfooding platform for designing clean delegation patterns for future sidecar architectures

The code in this repository is the current production version.

Development & Contributing

Core implementation lives in src/code_delegation_harness/harness.py (with shims in bin/gcdh; scripts/grok_delegate.py is a legacy compatibility shim). You can also run with python -m code_delegation_harness after install or with PYTHONPATH=src.

See CHANGELOG.md for release history. Full development notes live in the upstream development tree.

License

MIT

Built as a focused coding harness. The priority is useful, low-friction delegation with excellent output — for the person who still has to read, understand, and take responsibility for the final result.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.1

May 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

code_delegation_harness-0.2.1.tar.gz (27.2 kB view details)

Uploaded May 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

code_delegation_harness-0.2.1-py3-none-any.whl (22.0 kB view details)

Uploaded May 30, 2026 Python 3

File details

Details for the file code_delegation_harness-0.2.1.tar.gz.

File metadata

Download URL: code_delegation_harness-0.2.1.tar.gz
Upload date: May 30, 2026
Size: 27.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for code_delegation_harness-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`ddd74c696028762c7f958059a3c2a9d1ae8103ea4c5237470f6b2b25273f7dc9`
MD5	`315ebb35b5100afcad107ccd7e8327a3`
BLAKE2b-256	`ba687ccb901ee7c8fcd8b34764babdd008f3d3207ca1148b8ab3516d0eb5cd2e`

See more details on using hashes here.

File details

Details for the file code_delegation_harness-0.2.1-py3-none-any.whl.

File metadata

Download URL: code_delegation_harness-0.2.1-py3-none-any.whl
Upload date: May 30, 2026
Size: 22.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for code_delegation_harness-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e8ef6e7ef862779c1d83415e117ab06ed00c91106929ed6ed851324c976c282b`
MD5	`53b824080ba5a662c090968f48b36eb7`
BLAKE2b-256	`b8417582358837399a8803463b5862508b0954e37bf132dc0a110e9907a6f1a8`

See more details on using hashes here.

code-delegation-harness 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

code-delegation-harness

Quick Start

Why This Exists

Current Output

Current Output (What You Actually Review)

Basic Usage

Professional CLI (gcdh)

Documentation

For Agents, Sidecars & Automation (Robot-Useful)

Quiet Mode

Dry-Run Preview Mode

Strengths

Current Status (Production Ready)

Development & Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Professional CLI (`gcdh`)