A production runtime guardrail for AI agents: budget caps, timeouts, tool limits, circuit breakers, verifier retries, and OpenTelemetry traces.

These details have not been verified by PyPI

Project description

GuardLoop

GuardLoop is a production runtime guardrail for AI agents. It wraps model clients and tools with hard budget caps, timeout control, tool-call limits, and per-tool circuit breakers, re-runs an agent against verifiers until the output passes, and emits OpenTelemetry traces for every protected call. Runaway agent loops can be stopped before they burn through money, flaky tools can be cut off before an agent retries them into a bigger incident, and confidently-wrong answers get a second pass.

The v0.4 focus: runtime guardrails for async Python agents, including agents built with LangGraph or the OpenAI Agents SDK — direct OpenAI and Anthropic wrappers, protected tool calls, per-tool circuit breakers, a verify-fix-retry loop, and framework adapters that put all of it under an existing graph or Runner.run(...) without rewriting your agent.

from guardloop import (
    GuardLoop,
    BudgetConfig,
    CircuitBreakerConfig,
    CircuitBreakerPolicy,
    RunContext,
    VerifierConfig,
    is_json_object,
)

runtime = GuardLoop(
    budget=BudgetConfig(
        cost_limit_usd="0.10",
        token_limit=10_000,
        time_limit_seconds=60,
        tool_call_limit=20,
    ),
    circuit_breakers=CircuitBreakerConfig(
        default=CircuitBreakerPolicy(
            failure_threshold=3,
            recovery_timeout_seconds=30,
        )
    ),
    verifiers=[is_json_object(required_keys=["answer"])],
    verifier_config=VerifierConfig(max_retries=2),
)


async def agent(ctx: RunContext, prompt: str) -> str:
    instructions = prompt
    if ctx.retry_feedback:
        instructions += "\n\nFix the previous attempt: " + "; ".join(ctx.retry_feedback)
    response = await ctx.openai.responses.create(
        model="gpt-5.2",
        input=instructions,
        max_output_tokens=300,
    )
    return str(response.output_text)


result = await runtime.run(agent, "research agent runtime safety")
print(result.model_dump_json(indent=2))

Why This Exists

Agents are loops around probabilistic systems. When they go wrong, they can call the same model or tool repeatedly, spend unexpected money, and fail without a clear trace. GuardLoop puts an explicit execution layer around that loop:

flowchart LR
    LG["LangGraph graph"] -. "guarded_graph(...)" .-> U
    OA["OpenAI Agents SDK agent"] -. "guarded_runner(...)" .-> U
    U["Your agent"] --> R["GuardLoop"]
    R --> B["BudgetController"]
    R --> CB["CircuitBreakerRegistry"]
    R --> V["VerifierChain"]
    R --> T["OpenTelemetry spans"]
    R --> C["RunContext"]
    C --> O["Wrapped OpenAI client"]
    C --> A["Wrapped Anthropic client"]
    C --> W["Wrapped tools / framework callbacks & hooks"]
    V -. "feedback on retry" .-> C

Verifier Retry Loop

Agents can return confidently wrong answers. Attach verifiers — plain callables, sync or async — and GuardLoop runs them after the agent finishes. On rejection it feeds the verifier's feedback into ctx.retry_feedback and re-invokes the agent, up to VerifierConfig.max_retries times. Every attempt shares the same budget and the run's timeout, so the retry loop can never spend past a cap.

from guardloop import GuardLoop, RunContext, VerifierConfig, VerifierContext, VerifierResult


def no_todo(output: object, ctx: VerifierContext) -> VerifierResult:
    if "TODO" in str(output):
        return VerifierResult(passed=False, feedback="Replace the TODO placeholder.")
    return VerifierResult(passed=True)


runtime = GuardLoop(verifiers=[no_todo], verifier_config=VerifierConfig(max_retries=2))


async def agent(ctx: RunContext, task: str) -> str:
    # On a retry, ctx.retry_feedback holds the verifier's complaints — read it.
    ...


result = await runtime.run(agent, "draft the release notes")
print(result.verification_passed, result.verification_attempts, result.verification_feedback)

Built-in rule-based verifiers ship in guardloop: non_empty(), matches_regex(...), is_json_object(required_keys=...). By default an output that fails every retry comes back as success=False with terminated_reason="verification_failed" but with output still populated; set VerifierConfig(raise_on_failure=True) for a hard stop.

Framework Adapters

GuardLoop is a wrapper, not a framework — so it slots under the agent frameworks you already use. Each adapter lives behind its own optional extra and is not re-exported from the top-level guardloop package, so import guardloop stays dependency-light.

LangGraph

pip install "guardloop[langgraph]"

from langchain_core.messages import HumanMessage

from guardloop import GuardLoop, BudgetConfig
from guardloop.adapters.langgraph import guarded_graph

runtime = GuardLoop(
    budget=BudgetConfig(cost_limit_usd="0.10", token_limit=10_000, tool_call_limit=20),
    verifiers=[...],  # optional: the verifier retry loop wraps the whole graph run
)

agent = guarded_graph(my_compiled_graph, input_key="messages")
result = await runtime.run(agent, {"messages": [HumanMessage("research agent runtime safety")]})
print(result.success, result.cost_usd, result.tokens_used, result.terminated_reason)

guarded_graph returns a GuardLoop-compatible agent, so you keep calling runtime.run(...) as usual. A LangChain callback handler bound to the RunContext runs the pre-flight budget check before each LLM node, records usage afterward, and routes tool calls through the per-tool circuit breaker and the tool-call budget — so the cost / token / time caps, breakers, and llm_call / tool_call OpenTelemetry spans all apply inside the graph. A budget breach inside the graph terminates the run. On a verifier retry the feedback is injected into a copy of the input state (override feedback_to_state for non-standard state shapes). Because LangChain chat models often omit max_tokens, guarded_graph(..., reserved_output_tokens=N) sets the output-token reservation used by the pre-flight check (default 1024).

OpenAI Agents SDK

pip install "guardloop[openai-agents]"

from agents import Agent

from guardloop import GuardLoop, BudgetConfig
from guardloop.adapters.openai_agents import guarded_runner

runtime = GuardLoop(
    budget=BudgetConfig(cost_limit_usd="0.10", token_limit=10_000, tool_call_limit=20),
    verifiers=[...],  # optional: the verifier retry loop wraps the whole Runner.run
)

agent = guarded_runner(Agent(name="researcher", model="gpt-5.2", instructions="..."))
result = await runtime.run(agent, "research agent runtime safety")
print(result.success, result.cost_usd, result.tokens_used, result.output)

guarded_runner returns a GuardLoop-compatible agent that calls Runner.run under the hood. A GuardLoopRunHooks (a subclass of the SDK's RunHooks) bound to the RunContext runs the pre-flight budget check before each LLM call, records usage afterward, and routes tool calls through the per-tool circuit breaker and the tool-call budget — so the same caps, breakers, and llm_call / tool_call OpenTelemetry spans apply inside Runner.run(...). On a verifier retry the feedback is injected into a copy of the run input (override feedback_to_input for non-standard input shapes; output_from_result for the answer). Since the SDK's chat models often leave model_settings.max_tokens unset, guarded_runner(..., reserved_output_tokens=N) sets the pre-flight output reservation (default 1024).

One caveat: the OpenAI Agents SDK has no tool-error lifecycle hook and, by default, turns a tool exception into an error string fed back to the model — so the breaker tracks tool attempts and successes but not failures (an already open breaker still rejects the next call; route a tool body through ctx.call_tool(...) for full breaker semantics). Streaming (Runner.run_streamed) is not covered yet (usage is still accounted afterward).

Project Guide

For a deeper walkthrough of what has been implemented, how the code is organized, and what the next roadmap goals are, read docs/project-overview.md.

Install

Install from PyPI:

pip install guardloop

For local development:

uv sync

Optional OpenTelemetry exporters are available through the otel extra:

pip install "guardloop[otel]"

For local development with the extra:

uv sync --extra otel

Try the No-Key Demo

uv run python examples/runaway_cost_prevention.py

The demo uses a fake OpenAI-compatible client and intentionally loops forever. GuardLoop stops it when the next model request would exceed the cost cap.

GuardLoop terminal output: a runaway agent loop stopped before the next LLM call, with the RunResult showing success false and terminated_reason set to the cost cap

uv run python examples/tool_circuit_breaker.py

This demo uses a failing fake tool. GuardLoop allows the first failures, opens the circuit breaker, then rejects the next call without invoking the tool.

uv run python examples/verifier_retry_loop.py

This demo's agent first returns a bad answer (a TODO placeholder, then malformed JSON). A verifier chain rejects it with feedback, the agent reads ctx.retry_feedback and self-corrects, and the run ends with verification_passed: true after three attempts.

uv run python examples/langgraph_guarded.py

This demo runs a small LangGraph graph (with an in-process fake chat model, so no API key) under guarded_graph. The first run succeeds with cost and tokens recorded; the second runs under a tiny token budget and is stopped before the model call.

uv run python examples/openai_agents_guarded.py

This demo runs an OpenAI Agents SDK Agent (with an in-process fake model, so no API key) under guarded_runner. The first run succeeds with cost and tokens recorded; the second runs under a tiny token budget and is stopped before the model call.

Live Provider Smoke Tests

export OPENAI_API_KEY="..."
export ANTHROPIC_API_KEY="..."

uv run python examples/live_openai_basic.py
uv run python examples/live_anthropic_basic.py

Both live examples can be customized with OPENAI_MODEL or ANTHROPIC_MODEL.

Quality Gates

uv run pytest
uv run pytest --cov=guardloop
uv run ruff check .
uv run ruff format --check .
uv run pyright

v0.4 Scope

Async Python runtime with src/ package layout.
Hard caps for cost, tokens, time, and tool calls.
Per-tool circuit breakers with closed, open, and half-open states; global default breaker policy plus per-tool overrides.
Verify-fix-retry loop: sync or async output verifiers, fail-fast chains, built-in rule-based verifiers, feedback into ctx.retry_feedback, and an opt-in strict mode — all attempts share one budget and the run timeout.
Framework adapters that put the caps, breakers, verifier loop, and llm_call / tool_call OpenTelemetry spans inside an existing agent without rewriting it: guardloop.adapters.langgraph.guarded_graph (behind the langgraph extra, via a LangChain callback handler) and guardloop.adapters.openai_agents.guarded_runner (behind the openai-agents extra, via a RunHooks subclass).
Direct wrappers for AsyncOpenAI.responses.create and AsyncAnthropic.messages.create.
OpenTelemetry spans for agent runs, LLM calls, tools, and verifiers.
Fake-client tests and demos that do not require API keys; CI on push/PR (pytest + ruff + pyright, Python 3.11–3.13).

Roadmap

v0.2: per-tool circuit breakers. ✅
v0.3: verify-fix-retry loop. ✅
v0.4: LangGraph adapter. ✅
v0.4.1: OpenAI Agents SDK adapter. ✅
v0.5: OpenTelemetry metrics (cost / tokens / tool-calls / retries), per-attempt span nesting, one-command Jaeger + Phoenix stack.
v0.6: persistent breaker state, YAML/TOML policy, multi-model pricing, loop detection.
v1.0: stable API, changelog, docs site, release checklist.

See docs/roadmap.md for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.4.2

May 11, 2026

0.4.1

May 11, 2026

0.4.0

May 11, 2026

0.3.0

May 10, 2026

0.2.0

May 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

guardloop-0.4.2.tar.gz (1.9 MB view details)

Uploaded May 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

guardloop-0.4.2-py3-none-any.whl (41.9 kB view details)

Uploaded May 11, 2026 Python 3

File details

Details for the file guardloop-0.4.2.tar.gz.

File metadata

Download URL: guardloop-0.4.2.tar.gz
Upload date: May 11, 2026
Size: 1.9 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for guardloop-0.4.2.tar.gz
Algorithm	Hash digest
SHA256	`3b1709e69c3a89ce0032b17b5a46ae74d57159766d493a414acb42c2e5c6c81f`
MD5	`4f1e3a0050a06b6888097b5ca58c1f3c`
BLAKE2b-256	`5c5a7b37a3d94c53d7663482091bb71def2c84123614fb93455f0f561b2b799b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for guardloop-0.4.2.tar.gz:

Publisher: publish-pypi.yml on awesome-pro/guardloop

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: guardloop-0.4.2.tar.gz
- Subject digest: 3b1709e69c3a89ce0032b17b5a46ae74d57159766d493a414acb42c2e5c6c81f
- Sigstore transparency entry: 1509884957
- Sigstore integration time: May 11, 2026
Source repository:
- Permalink: awesome-pro/guardloop@ed3293e763197d3ffa4137b821460ce2bdf9c3a4
- Branch / Tag: refs/tags/v0.4.2
- Owner: https://github.com/awesome-pro
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@ed3293e763197d3ffa4137b821460ce2bdf9c3a4
- Trigger Event: release

File details

Details for the file guardloop-0.4.2-py3-none-any.whl.

File metadata

Download URL: guardloop-0.4.2-py3-none-any.whl
Upload date: May 11, 2026
Size: 41.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for guardloop-0.4.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`67a2c2b00f571ecfecd27b6d78202ca538e9bbe538c7e3d42542c6a7fb45d6a6`
MD5	`12af0bf982fc896c3db2157c9d54b044`
BLAKE2b-256	`951d9ce4ecd15e11c09353cc7977784cae32acccac6027bd4a8cc7d0ea0c76b9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for guardloop-0.4.2-py3-none-any.whl:

Publisher: publish-pypi.yml on awesome-pro/guardloop

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: guardloop-0.4.2-py3-none-any.whl
- Subject digest: 67a2c2b00f571ecfecd27b6d78202ca538e9bbe538c7e3d42542c6a7fb45d6a6
- Sigstore transparency entry: 1509885067
- Sigstore integration time: May 11, 2026
Source repository:
- Permalink: awesome-pro/guardloop@ed3293e763197d3ffa4137b821460ce2bdf9c3a4
- Branch / Tag: refs/tags/v0.4.2
- Owner: https://github.com/awesome-pro
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@ed3293e763197d3ffa4137b821460ce2bdf9c3a4
- Trigger Event: release

guardloop 0.4.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

GuardLoop

Why This Exists

Verifier Retry Loop

Framework Adapters

LangGraph

OpenAI Agents SDK

Project Guide

Install

Try the No-Key Demo

Live Provider Smoke Tests

Quality Gates

v0.4 Scope

Roadmap

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance