Cycles budget and tool governance for OpenAI Agents SDK — per-tenant budgets, tool estimates, and audit trails
Project description
Cycles OpenAI Agents SDK Integration
Cycles governance for the OpenAI Agents SDK, powered by Cycles.
Prerequisites
Before you begin, make sure you have:
- Python 3.10+
- An OpenAI API key — required by the OpenAI Agents SDK to call LLMs
- A running Cycles server — see the deployment guide to set one up
- A Cycles API key — see API key management
- A tenant and budget — see tenant management and budget allocation
New to Cycles? The end-to-end tutorial walks through the full setup — from deploying the server to making your first budget-guarded API call — in about 10 minutes.
Why
The OpenAI Agents SDK gives you hooks and guardrails for content safety, but nothing for governance or action authority. Without Cycles governance:
- A retry loop burns through $47 of API calls before anyone notices.
- An agent with a
send_emailtool sends 200 emails in a single run because nothing limits it. - You can't give Tenant A a $10/day budget and Tenant B a $100/day budget — every tenant gets unlimited access.
- There's no audit trail showing which agent called which tool, how many tokens it used, or what was consumed.
This plugin fixes all of that with one line:
result = await Runner.run(agent, input="...", hooks=CyclesRunHooks(tenant="acme"))
Every LLM call and every tool call in the entire agent run — including handoffs to sub-agents — automatically reserves budget before execution and commits actual usage after. If the budget is exhausted, the agent stops. No per-function decoration. No code changes to your tools.
What It Does
| Problem | How This Solves It |
|---|---|
| Runaway LLM spending | Every LLM call reserves budget before running. DENY = agent stops. |
| Uncontrolled tool actions | Tool estimate map assigns per-call estimates (send_email: 50, search: 0). Higher-estimate tools consume budget faster. |
| No per-tenant limits | Pass tenant="acme" — Cycles enforces per-tenant budgets server-side. |
| No pre-run check | cycles_budget_guardrail calls /v1/decide before the agent starts. Zero tokens consumed on DENY. |
| No audit trail | Every reservation, commit, and handoff is recorded in the Cycles ledger. |
| Agent runs forever | TTL heartbeat auto-extends reservations. If the agent dies, reservations expire and budget is released. |
Installation
pip install runcycles-openai-agents
Setup
Set the following environment variables before running your agent:
# Required — OpenAI Agents SDK needs this to call LLMs
export OPENAI_API_KEY=sk-...
# Required — tells the plugin where your Cycles server is
export CYCLES_BASE_URL=http://localhost:7878
export CYCLES_API_KEY=cyc_live_...
Quick Start
from agents import Agent, Runner
from runcycles_openai_agents import CyclesRunHooks, cycles_budget_guardrail
# Pre-run budget check — agent never starts if budget exhausted
guardrail = cycles_budget_guardrail(tenant="acme-corp", estimate=5_000_000)
# Runtime governance — every tool/LLM call goes through Cycles
hooks = CyclesRunHooks(
tenant="acme-corp",
app="support-platform",
tool_estimates={
"send_email": 50, # 50 RISK_POINTS per call
"update_crm": 10, # 10 RISK_POINTS per call
"search_knowledge": 0, # zero estimate — no reservation
},
)
agent = Agent(
name="case-resolver",
instructions="You resolve support cases.",
input_guardrails=[guardrail],
)
result = await Runner.run(agent, input="...", hooks=hooks)
Hook lifecycle
The hooks plug into the SDK's native RunHooks interface and govern the entire agent run automatically:
| Hook | Cycles API Call | Blocking | Detail |
|---|---|---|---|
on_tool_start |
create_reservation (tool estimate) |
Raises on DENY | Budget reserved based on tool estimate map |
on_tool_end |
commit_reservation |
No | Actual amount committed |
on_llm_start |
create_reservation (LLM estimate) |
Raises on DENY | Budget reserved before each LLM call |
on_llm_end |
commit_reservation (actual tokens) |
No | Real token count from response.usage committed |
on_handoff |
create_event (audit trail) |
No | Handoff recorded in Cycles ledger |
All raised exceptions from budget denial trigger BudgetExceededError. See Error Handling Patterns in Python for details.
Error handling
If Runner.run() raises, pending reservations stay locked until TTL expires. Call release_pending() to free them immediately:
hooks = CyclesRunHooks(tenant="acme-corp", app="support-platform")
try:
result = await Runner.run(agent, input="...", hooks=hooks)
except Exception:
await hooks.release_pending("agent_run_failed")
raise
When budget is denied, the hooks raise BudgetExceededError:
from runcycles import BudgetExceededError
try:
result = await Runner.run(agent, input="...", hooks=hooks)
except BudgetExceededError as e:
print(f"Budget denied: {e}")
# Agent stopped — no further tokens consumed
Guardrail (pre-run check)
cycles_budget_guardrail returns an InputGuardrail that calls /v1/decide before the agent starts. If the tenant is suspended or budget is exhausted, the guardrail trips and the agent never runs — zero tokens consumed:
from runcycles_openai_agents import cycles_budget_guardrail
guardrail = cycles_budget_guardrail(
tenant="acme-corp",
estimate=5_000_000, # expected total run estimate
unit=Unit.USD_MICROCENTS,
fail_open=True, # allow if Cycles server is down
)
agent = Agent(name="bot", input_guardrails=[guardrail])
Tool estimate mapping
Define an estimate policy once. New tools added to the agent get a default estimate automatically:
from runcycles_openai_agents import ToolEstimateMap, ToolEstimateConfig
hooks = CyclesRunHooks(
tenant="acme-corp",
tool_estimates=ToolEstimateMap(
mapping={
"send_email": 50, # 50 RISK_POINTS (default unit)
"update_crm": ToolEstimateConfig(
estimate=10,
action_kind="tool.crm.update",
unit=Unit.RISK_POINTS, # explicit unit
),
"search_knowledge": 0, # zero estimate — no reservation
},
default_estimate=1, # unmapped tools: 1 RISK_POINT
default_unit=Unit.RISK_POINTS, # unit for int shorthand values
),
)
Configuration
Explicit client
from runcycles import CyclesConfig, AsyncCyclesClient
from runcycles_openai_agents import CyclesRunHooks
config = CyclesConfig(base_url="http://localhost:7878", api_key="cyc_live_...")
client = AsyncCyclesClient(config)
hooks = CyclesRunHooks(client=client, tenant="acme-corp")
Fail-open / fail-closed
By default, if the Cycles server is unreachable the agent continues (fail_open=True). Set fail_open=False to enforce strict governance:
hooks = CyclesRunHooks(tenant="acme", fail_open=False)
All options
CyclesRunHooks(
client=None, # AsyncCyclesClient (or auto-created from config/env)
config=None, # CyclesConfig (creates client if no client given)
tenant="acme-corp", # Subject.tenant
workspace="prod", # Subject.workspace
app="support-platform", # Subject.app
workflow="case-resolution", # Subject.workflow
agent="case-resolver", # Subject.agent (overridden by actual agent name)
toolset=None, # Subject.toolset (overridden by tool name)
tool_estimates={"email": 50}, # dict or ToolEstimateMap (default unit: RISK_POINTS)
default_tool_estimate=1, # estimate for unmapped tools (in default unit)
llm_estimate=500_000, # per-LLM-call estimate (~$0.005 in USD_MICROCENTS)
llm_unit=Unit.USD_MICROCENTS,
fail_open=True, # allow execution if Cycles is down
ttl_ms=60_000, # reservation TTL (heartbeat extends at half-interval)
overage_policy=CommitOveragePolicy.ALLOW_IF_AVAILABLE,
dry_run=False, # shadow mode — no budget consumed
)
Features
- Framework-native: Plugs into the SDK's
RunHooksinterface — not function-level decoration - Policy-driven: Define tool estimates once in a map, not per-function
- LLM governance: Every LLM call reserves and commits with real token metrics
- Pre-run guardrail:
/v1/decidecheck before agent starts — zero tokens on DENY - Handoff-aware: Agent handoffs recorded as audit events in the Cycles ledger
- Automatic heartbeat: TTL extension keeps reservations alive during long operations
- Fail-safe cleanup:
release_pending()frees locked budget when agent runs fail - Fail-open by default: Agent continues if Cycles server is unreachable
- Environment config:
CYCLES_BASE_URL+CYCLES_API_KEYfor zero-config setup - Typed exceptions:
BudgetExceededErrorfor precise error handling
Examples
The examples/ directory contains runnable integration examples:
| Example | Description |
|---|---|
| basic_budget.py | LLM token budget enforcement |
| tool_governance.py | Tool estimate mapping — higher-estimate tools consume more, read-only tools use zero estimate |
| multi_agent.py | Multi-agent handoff with shared budget and pre-run guardrail |
See examples/README.md for setup instructions.
Development
pip install -e ".[dev]"
# Lint
ruff check .
# Type check (strict mode)
mypy src/runcycles_openai_agents
# Run tests with coverage (95% threshold enforced in CI)
pytest --cov
CI runs all three checks on Python 3.10 and 3.12 for every push and pull request.
Documentation
- Cycles Documentation — full docs site
- Python Client — the underlying
runcyclesclient - Cycles Protocol — how reserve-commit works
- Error Handling Patterns — handling budget errors
License
Apache 2.0
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file runcycles_openai_agents-0.2.0.tar.gz.
File metadata
- Download URL: runcycles_openai_agents-0.2.0.tar.gz
- Upload date:
- Size: 28.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d4b2eed540ad6990559bd620d17442fba1354f34472cfc28791e72695c3bbad3
|
|
| MD5 |
e8e36f68e5d5bc2436a76b4fef54ca7c
|
|
| BLAKE2b-256 |
910c977b2ebef617c91e55c1d649eb1c8ac5eb65e67dfa595326afce032a4a66
|
Provenance
The following attestation bundles were made for runcycles_openai_agents-0.2.0.tar.gz:
Publisher:
python-publish.yml on runcycles/cycles-openai-agents
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
runcycles_openai_agents-0.2.0.tar.gz -
Subject digest:
d4b2eed540ad6990559bd620d17442fba1354f34472cfc28791e72695c3bbad3 - Sigstore transparency entry: 1217122784
- Sigstore integration time:
-
Permalink:
runcycles/cycles-openai-agents@c218d4bb8cf99247fb0dbdf0bdd2c84c7a25223b -
Branch / Tag:
refs/tags/v0.2.0 - Owner: https://github.com/runcycles
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@c218d4bb8cf99247fb0dbdf0bdd2c84c7a25223b -
Trigger Event:
push
-
Statement type:
File details
Details for the file runcycles_openai_agents-0.2.0-py3-none-any.whl.
File metadata
- Download URL: runcycles_openai_agents-0.2.0-py3-none-any.whl
- Upload date:
- Size: 18.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4ab09ca1b477edcef72cbf48abf78fbbf44b3307f94d3a9f01a53e9b09555a75
|
|
| MD5 |
137c12655c04104c8bffab4cc08fc3c9
|
|
| BLAKE2b-256 |
9fe2f3b22dc9010032b7c938ac23837994906ec771da183917bfd8d5e2820235
|
Provenance
The following attestation bundles were made for runcycles_openai_agents-0.2.0-py3-none-any.whl:
Publisher:
python-publish.yml on runcycles/cycles-openai-agents
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
runcycles_openai_agents-0.2.0-py3-none-any.whl -
Subject digest:
4ab09ca1b477edcef72cbf48abf78fbbf44b3307f94d3a9f01a53e9b09555a75 - Sigstore transparency entry: 1217122794
- Sigstore integration time:
-
Permalink:
runcycles/cycles-openai-agents@c218d4bb8cf99247fb0dbdf0bdd2c84c7a25223b -
Branch / Tag:
refs/tags/v0.2.0 - Owner: https://github.com/runcycles
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@c218d4bb8cf99247fb0dbdf0bdd2c84c7a25223b -
Trigger Event:
push
-
Statement type: