The acceptance layer for autonomous coding — 'CI for AI agents'. A swarm works until a verifier PROVES the goal is done.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ypollak2

These details have not been verified by PyPI

Project description

loophole

A swarm of AI agents that work on a goal until it's provably done — and tells you exactly what it couldn't prove.

The acceptance layer for autonomous coding — "CI for AI agents." Bring your own agent; loophole is the trusted gate that decides what's actually done.

Contents

See it work in 10 seconds
The problem · The idea
60-second quickstart
How it works · Three kinds of "done"
Anti-reward-hacking
CLI · Providers
Bring your own executor
For teams — CI acceptance gate
Honest status & safety · Roadmap
Contributing · License

▶️ See it work in 10 seconds — no setup, no API key

After installing (below), run either demo. Both are fully self-contained — no LLM, no keys, no config:

loophole demo          # 30-sec proof: a naive agent says "Done! ✅" on buggy code —
                       # loophole REJECTS it (the check fails), then accepts once it's fixed.
loophole watch --demo  # watch the swarm work LIVE in your terminal (animated, no browser).

loophole demo reaching DONE only after the bug is fixed is the whole idea in one command: a check decides "done," never the agent.

What that looks like (real loophole demo output)

[1] A naive agent reports "Done! ✅" on buggy add() — return a - b
    → loophole runs the check:  assert add(2,3)==5  ✗  FAIL
    → loophole does NOT declare done. The false claim is caught.

[2] The agent's completion claim is rejected:
    "the candidate satisfies the declared Goal Contract under the trusted
     verifier boundary" — NOT "the goal is provably achieved".

[3] Now the bug is fixed (return a + b) and re-verified:

================================================================
loophole — Residual-Risk Report
================================================================
Outcome: DONE
What was VERIFIED:
  [PASS] hard:python3 -c "assert add(2,3)==5; assert add(-1,1)==0"
================================================================
    → Only now does loophole accept completion.

Takeaway: 'done' means the verifier passed — not that an LLM said so.

The problem

You give an AI agent a real task — "build a REST API for a todo app, with tests" — and three things go wrong:

It quits too early. One pass, a confident "Done! ✅", and a half-working result.
It lies about being finished. Models are trained to please. "All tests pass!" — except it deleted the failing tests.
It can't tell when it's actually done. No goalpost, so it either stops at the first plausible output or loops forever.

/loop-style tools keep one agent grinding. But a single agent in a single context can't divide labor, can't hold a big task, and still grades its own homework.

The idea

loophole turns a goal into a task graph, runs a swarm of agents across it in isolated git worktrees, and refuses to stop until an external, falsifiable check says the goal is met.

The key move: agents never decide they're done. A check does.

Your tests passing. A build succeeding. curl returning 200. If a goal can't be checked, loophole asks a human instead of guessing. It cannot be argued into calling unfinished work complete.

When the check fails, loophole re-plans, retries, and routes around dead ends — until it genuinely passes or hits your budget. Then it hands you a Residual-Risk Report: what it proved, and what it didn't.

60-second quickstart

git clone https://github.com/Chuzom/loophole && cd loophole
python -m venv .venv && source .venv/bin/activate
pip install -e .          # zero-config: works with local Ollama out of the box

# point it at a goal + a way to check "done":
loophole run "Create add.py with add(a,b) returning a+b" \
  --verify 'python3 -c "from add import add; assert add(2,3)==5; print(\"ok\")"' \
  --workspace ./out

🎯 goal goal-3034039ba5f5
• round 1: planning
• round 1: executing 1 task(s)
• round 1: verify -> PASS (score 1000)
• goal -> done (all verifiers passed)

================ Residual-Risk Report ================
Outcome: DONE
What was VERIFIED:  [PASS] hard:python3 -c "from add import add; ..."
What was NOT proven: anything outside the verifier's scope.
=====================================================

── LoopHole scorecard ─────────────────────────
  ✓ VERIFIED DONE   (1 round · 3s)
  1 agent merge accepted by the verifier
  0 candidates the verifier REJECTED before accepting
  0 cheats the boundary blocked

That's the whole contract: you define "done," loophole reaches it — and every run ends with a scorecard (loophole stats aggregates them) so you can see, in numbers, that "done" was verifier-backed.

How it works

🎯 Goal ─▶ 🧭 Planner ─▶ ⚙️ Executors ─▶ ✅ Verifier ─▶ 🏁 Done
              ▲              (worktrees)        │
              └──── not done: re-plan / retry ──┘

Stage	What happens
Goal Contract	Your goal + verifier(s). A goal with no way to check "done" is rejected up front.
Planner	Decomposes the goal into a dependency DAG; a critic pass attacks the plan before any work runs.
Executors	Tool-using agents (write/read files, run shell) work in parallel, each in its own git worktree, then merge serially.
Verifier	Runs your falsifiable check from a fresh checkout of the merged result. Only this grants completion.
Loop control	Measures progress by verifier metrics (not vibes), detects when it's stuck, and re-plans — with budget + round circuit-breakers.

Three kinds of "done"

Verifier	Behavior	Use it for
`hard` — a command	exit 0 = done. The gold standard.	`pytest -q`, `npm test`, `make`, a health-check `curl`
`soft` — an LLM rubric	can only veto, never grant	subjective quality gates layered on top of a hard check
`human` — a checkpoint	loophole pauses and asks you	irreducibly subjective goals (prose, design)

Anti-reward-hacking (the part most tools skip)

Because the verifier is the goalpost, loophole defends it:

Verifier adversary review — before any work runs, an LLM pass attacks your declared verifier ("how could an agent pass this without satisfying intent?") and lists the concrete bypass strategies it finds in the Residual-Risk Report, so you can harden the check first.
Verification boundary — protected files (your tests, configs) are checked against the original commit; if an agent edits them, the run fails.
Test-count audit — the suite can't silently shrink to make red turn green.
No fake "done" — if an agent claims completion but changed nothing, it's rejected.
Secrets never reach verifiers — your ANTHROPIC_API_KEY and friends are scrubbed from the subprocess environment.
Scoped egress — an executor granted network access reaches only the hosts you declare (macOS: enforced via a localhost-only jail + a host-allowlisted proxy; denied hosts are 403'd and audited).

CLI

loophole init                                   # infer a starter loophole.json from the repo
loophole init --template refactor-frozen-tests  # or scaffold from a template
loophole run                                    # auto-loads ./loophole.json
loophole run "<goal>" --verify "pytest -q" [--workspace DIR]
loophole run --contract loophole.json --executor-command 'claude -p {task}'  # BYO agent
loophole run "<goal>" --protect "tests/**" --expect-test-delta 0   # lock the suite
loophole contract validate loophole.json        # validate / show a contract (path or URL)
loophole registry list                            # named, shareable acceptance specs
loophole registry add team-default ./loophole.json   # publish a spec; reuse by name
loophole run --contract team-default              # run a registry spec by name
loophole run "<goal>"                            # live STREAM view by DEFAULT (append-only)
loophole run "<goal>" --view forge               # THE FORGE full-screen dashboard (TTY only)
loophole run "<goal>" --no-watch                 # plain log (CI / when piping)
# models route via Chuzom by DEFAULT (planner=chuzom:simple, executor=chuzom:complex);
# set CHUZOM_URL to route through a live `chuzom-route` server, else local tier policy.
loophole run "<goal>" --executor-model ollama:qwen3-coder:30b   # or pin a model directly
loophole stats                                    # your value scorecard — verified, rejections, cheats blocked
loophole watch --demo                             # self-driving terminal swarm demo (no LLM)
loophole serve                                    # live web FLEET — all runs, click to drill in
loophole serve <goal-id>                          # live web Forge for one run
loophole demo                                     # the 30s 'can't-fake-done' demo (no LLM)
loophole audit <goal-id>                         # full audit trail (the trust artifact)
loophole runs                                    # list past runs
loophole estimate "<goal>" --max-rounds 10     # dry-run cost prediction
loophole status <goal-id>  ·  loophole resume <goal-id>  ·  loophole ls

Providers

Provider-agnostic — pick per role (cheap executors, strong planner):

--planner-model anthropic:claude-sonnet-4-6 --executor-model ollama:qwen3-coder:30b

Default is Ollama (free, local, zero-config). Set ANTHROPIC_API_KEY / OPENAI_API_KEY to use those. pip install -e '.[anthropic]' or '.[openai]' for the SDKs.

Bring your own executor

loophole's value is the trusted boundary around an untrusted executor — so the executor is a pluggable backend. Use the built-in agent, or drive any external/ frontier coding agent as a black box; git-worktree isolation, the write-allowlist, merge gate, and verifier apply to every executor — no adapter can grant "done":

loophole run --contract loophole.json --executor-command 'claude -p {task}'

That's the bet: as models commoditize, who wrote the code matters less than whether it provably passes. loophole is the neutral referee, not another coder.

Swarm on top of any agent framework

Each swarm worker can be a full agent framework — Claude Code, Agno, a Hermes harness, your own — running in its own git worktree while loophole stays the orchestrator + trust layer. Claude Code is built in:

loophole run "<goal>" --executor claude-code   # runs `claude -p` per task, streams its
                                               # tool calls into the FORGE (agent_step)

Trust exception: the built-in claude-code adapter defaults to trusted — it runs outside the OS sandbox so it can reach your subscription login (macOS keychain). What still holds regardless: git-worktree isolation, the write-allowlist, merge gate, and verifier — no adapter, trusted or not, can grant "done." Force it back into the sandbox with --executor-sandboxed (this blocks keychain/subscription auth; API-key auth via --executor-secret keeps working sandboxed). Generic --executor-command executors are always fully OS-sandboxed by default.

For an API-calling framework, grant scoped egress without touching the filesystem sandbox — on macOS --executor-network is enforced: the jail's network is localhost-only and traffic tunnels through a host-allowlisted egress proxy (denied hosts are 403'd and land in the audit trail). On Linux/bubblewrap egress is still all-or-nothing (netns scoping is on the roadmap):

loophole run "<goal>" --executor claude-code \
  --executor-network api.anthropic.com --executor-secret ANTHROPIC_API_KEY

Add your own framework — implement a tiny Executor subclass, register it under the loophole.executors entry point, and loophole run --executor <name> picks it up (it appears in loophole executor list). The sandbox → write-allowlist → merge gate → verifier boundary is unchanged; no adapter can grant "done." Copy-paste template: examples/adapter_package/.

For teams — loophole as a CI acceptance gate

Let any agent open a PR; make loophole the gate that decides if it's done — in CI, on neutral ground, with a reviewable audit trail:

loophole.json is acceptance-spec-as-code — committed, reviewed, reusable. Scaffold with loophole init (it infers a starter from your repo) or loophole init --template <name>; share contracts by path or URL.
loophole audit <run> renders every boundary decision (merge-gate rejections, write-allowlist violations, soft-judge escalations) with its reason — trust the result without reading every diff. loophole runs lists past runs.
See examples/ci_gate.md for a GitHub Actions gate, and examples/cant_fake_done.py for the 30-second "it can't lie to me" demo.

Honest status & safety

loophole is v0.1. Its promise is precise: it proves "the candidate satisfies the declared contract under a trusted verifier boundary" — not "the goal is objectively achieved." The Residual-Risk Report always says what went unchecked.

Agent-run shell commands are OS-sandboxed (macOS Seatbelt / Linux bubblewrap), deny-by-default, network-denied, with provider secrets scrubbed — and fail-closed if no sandbox is available. Still, treat goals and repos as you would any tool that runs code, and prefer a disposable workspace. The architecture is adversarially audited by a multi-model council; we publish our own findings.

Roadmap

Shipped

OS sandbox for run_shell/verifiers (Seatbelt/bubblewrap, deny-by-default, fail-closed)
Enforced per-task write-globs at commit · per-merge re-verification (verified-green invariant)
Fail-closed soft judge · verifier-adversary pre-flight review
Pluggable executors (bring-your-own-agent) · built-in Claude Code adapter
Audit trail · shareable contracts + templates · contract registry (local + remote index)
Value scorecard + loophole stats · module SDK (graded, domain-specific verifiers)
Chuzom-routed models — with verifier verdicts fed back as ground-truth routing quality
Enforced scoped egress (localhost jail + host-allowlisted proxy) on macOS
History-grounded loophole estimate · goal finish-reasons surfaced in status/audit

Next — the acceptance-layer bet ("CI for AI agents," bring-your-own-executor):

Hosted control-plane (run history, audit, policy, fleet dashboards)
First-class executor adapters for frontier coding agents
Richer verifier adapters (coverage, mutation testing)
bubblewrap netns egress scoping (Linux parity with the macOS proxy)

Contributing

Issues and PRs welcome. Run the suite with pip install -e '.[dev]' && pytest. The architecture was designed — and adversarially audited — by a multi-model council; that critique style is the project's default. Bring disagreement.

License

MIT — see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ypollak2

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Jul 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

loophole_agents-0.1.0.tar.gz (158.3 kB view details)

Uploaded Jul 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

loophole_agents-0.1.0-py3-none-any.whl (123.1 kB view details)

Uploaded Jul 1, 2026 Python 3

File details

Details for the file loophole_agents-0.1.0.tar.gz.

File metadata

Download URL: loophole_agents-0.1.0.tar.gz
Upload date: Jul 1, 2026
Size: 158.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for loophole_agents-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`b6289af9a14445ccb13770b21a8faba5317ec76f4b415b2ea60761eae3c58276`
MD5	`3524376ee20766e1ac4648c369aed36a`
BLAKE2b-256	`65044585e76879481ea3cf30a3b09fd0e031b407cf581eed6382a46933ea589b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for loophole_agents-0.1.0.tar.gz:

Publisher: release.yml on Chuzom/loophole

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: loophole_agents-0.1.0.tar.gz
- Subject digest: b6289af9a14445ccb13770b21a8faba5317ec76f4b415b2ea60761eae3c58276
- Sigstore transparency entry: 2041768206
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: Chuzom/loophole@5e3d7655e585846e8fedcedbb41b6c58f922c149
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Chuzom
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@5e3d7655e585846e8fedcedbb41b6c58f922c149
- Trigger Event: push

File details

Details for the file loophole_agents-0.1.0-py3-none-any.whl.

File metadata

Download URL: loophole_agents-0.1.0-py3-none-any.whl
Upload date: Jul 1, 2026
Size: 123.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for loophole_agents-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5a6c0cff759d49fb8546a1cb3aab353d6eb4b27d3b8eae8e5c986d253b5f306a`
MD5	`ea078d9655da20a052aeac375f95b12b`
BLAKE2b-256	`addfd3eca976dd26bb722afe0a87bd0e479bb4f29ec873194fd37e5c384ba3a7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for loophole_agents-0.1.0-py3-none-any.whl:

Publisher: release.yml on Chuzom/loophole

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: loophole_agents-0.1.0-py3-none-any.whl
- Subject digest: 5a6c0cff759d49fb8546a1cb3aab353d6eb4b27d3b8eae8e5c986d253b5f306a
- Sigstore transparency entry: 2041768445
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: Chuzom/loophole@5e3d7655e585846e8fedcedbb41b6c58f922c149
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/Chuzom
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@5e3d7655e585846e8fedcedbb41b6c58f922c149
- Trigger Event: push

loophole-agents 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

loophole

▶️ See it work in 10 seconds — no setup, no API key

The problem

The idea

60-second quickstart

How it works

Three kinds of "done"

Anti-reward-hacking (the part most tools skip)

CLI

Providers

Bring your own executor

Swarm on top of any agent framework

For teams — loophole as a CI acceptance gate

Honest status & safety

Roadmap

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance