LangChain agent middleware for Cycles — pre-execution budget authority for model calls, tool calls, and runaway agent loops in Python create_agent workflows.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

amavashev

These details have not been verified by PyPI

Project links

Homepage

Project description

Cycles for LangChain — AI agent middleware for budget and action authority

LangChain middleware for pre-execution budget authority over model calls, tool calls, and runaway agent loops in create_agent workflows. Provider-neutral: works with any LangChain 1.x agent regardless of model provider, as long as actions flow through LangChain middleware/tool execution.

Built on LangChain's AgentMiddleware API:

wrap_model_call — pre-model-call authorization plus optional reserve/commit/release lifecycle around each LLM invocation (v0.1.5+)
wrap_tool_call — tool-call authorization plus optional reserve/commit/release lifecycle around each tool execution
before_model (with @hook_config(can_jump_to=["end"])) — fan-out caps and external policy halts before another model turn

Per-call actual-cost extraction (provider-specific token-usage parsing) and streaming integration are v0.2.0 scope. Until then, CyclesModelGate commits at the configured estimate; for precise per-call token capture today, use the BaseCallbackHandler recipe in cycles-client-python/examples/langchain_integration.py.

Install via pip install langchain-runcycles.

What's in the box

CyclesModelGate (v0.1.5+) — runs before every model call. Authorizes via client.decide() and/or reserves budget. Returns a ModelResponse carrying the denial reason on deny so the agent terminates naturally.
CyclesToolGate — runs before every tool call. Authorizes via client.decide() and/or reserves budget via client.create_reservation(). Returns a ToolMessage on denial so the model can recover gracefully.
CyclesFanOutGate — runs before every model turn. Halts the agent (with jump_to: "end") when a turn cap is hit or when an external policy says to stop. Useful for runaway-loop protection and per-tenant burst caps.

All three work with sync or async LangChain agents and the sync (CyclesClient) or async (AsyncCyclesClient) Cycles client. Compose them in a single middleware=[...] list — typical order is [CyclesFanOutGate, CyclesModelGate, CyclesToolGate] so fan-out caps trigger before model spend before tool side effects.

Installation

pip install langchain-runcycles

Requires Python 3.10+ and langchain >= 1.0.

Quick Start

from langchain.agents import create_agent
from langchain_core.tools import tool
from langchain_runcycles import CyclesToolGate
from runcycles import Action, CyclesClient, CyclesConfig, Subject

@tool
def send_email(to: str, body: str) -> str:
    """Send an email."""
    return f"Sent to {to}"

client = CyclesClient(CyclesConfig(base_url="http://localhost:7878", api_key="..."))
gate = CyclesToolGate(
    client,
    subject=Subject(tenant="acme", agent="researcher"),
    action={"send_email": Action(kind="tool.call", name="send_email")},
    mode="decide",
)

agent = create_agent(model="claude-sonnet-4-6", tools=[send_email], middleware=[gate])
agent.invoke({"messages": [{"role": "user", "content": "Email alice."}]})

If client.decide() denies the call, send_email is never invoked — the model receives a ToolMessage with the denial reason and can choose another path.

Middleware

`CyclesModelGate` (v0.1.5+)

Gates each model call. Same three modes as CyclesToolGate. On denial in decide mode, returns a ModelResponse whose AIMessage carries the denial reason — the agent terminates naturally because the AIMessage has no tool_calls.

from langchain_runcycles import CyclesModelGate

model_gate = CyclesModelGate(
    client,
    subject=Subject(tenant="acme", agent="researcher"),
    action=Action(kind="llm.completion", name="gpt-4o"),
    mode="reserve",
    estimate=Amount(unit=Unit.USD_MICROCENTS, amount=2_000_000),  # $0.02 per call
)

v0.1.5 commits at the configured estimate. Per-call actual-cost extraction (token usage from provider response metadata) and streaming integration land in v0.2.0. For precise per-call token cost capture today, use the BaseCallbackHandler recipe in cycles-client-python/examples/langchain_integration.py.

`CyclesToolGate`

Gates each tool call. Three modes:

Mode	What it does
`"decide"`	Calls `client.decide()`. Denies the tool call on a non-allow decision. No reservation.
`"reserve"`	Creates a reservation, runs the tool, commits on success / releases on exception.
`"decide+reserve"`	Authorizes via `decide()`, then reserves+commits. Most strict.

gate = CyclesToolGate(
    client,
    subject=Subject(tenant="acme", agent="researcher"),
    action={
        "search": Action(kind="tool.call", name="search"),
        "send_email": Action(kind="tool.call", name="send_email"),
    },
    mode="decide+reserve",
)

`CyclesFanOutGate`

Halts the agent when a turn cap or external policy says stop. Optional client argument enables remote policy checks on each turn:

from langchain_runcycles import CyclesFanOutGate

fanout = CyclesFanOutGate(
    max_turns=20,
    client=client,                       # optional — for remote policy
    subject=Subject(tenant="acme"),
    action=Action(kind="model.turn", name="research"),
)

Pair with CyclesToolGate and HumanInTheLoopMiddleware for production-grade agent governance.

Configuration

Subject

Either a static Subject or a callable resolving from request/state:

from runcycles import Subject

# Static
subject = Subject(tenant="acme", agent="bot")

# Per-call extractor (CyclesToolGate: (request, state); CyclesFanOutGate: (state, state))
def per_tenant(request, state):
    return Subject(tenant=state["config"]["tenant"], agent="bot")

Action

Static, mapping (per-tool name), or callable:

from runcycles import Action

# Static
action = Action(kind="tool.call", name="any")

# Per-tool mapping
action = {
    "send_email": Action(kind="tool.call", name="send_email"),
    "search": Action(kind="tool.call", name="search"),
}

# Callable
def derive(request):
    return Action(kind="tool.call", name=request.tool_call["name"])

Idempotency-key namespacing (v0.1.3+)

Cycles idempotency keys default to {prefix}-{tool_call_id} — deterministic per tool call so retries land on the same reservation. If your runtime can reuse short tool-call ids across runs (tc_1, tc_2, ...), set idempotency_namespace on the middleware to scope keys by run / workflow / tenant. Keys then become {prefix}-{namespace}-{tool_call_id}.

# Static — same namespace every call
gate = CyclesToolGate(
    client,
    subject=Subject(tenant="acme"),
    action=Action(kind="tool.call", name="send_email"),
    idempotency_namespace="run_2026_05_10_abc",
)

# Callable — receives the LangChain ToolCallRequest. Pull the run id from
# wherever your runtime carries it: request state, a contextvar, your own
# middleware, etc.
def my_run_id(request):
    return request.state["run_id"]

gate = CyclesToolGate(
    client,
    subject=Subject(tenant="acme"),
    action=Action(kind="tool.call", name="send_email"),
    idempotency_namespace=my_run_id,
)

CyclesFanOutGate.idempotency_namespace is the same shape; the callable receives the agent state instead of the tool-call request. Without idempotency_namespace, keys keep the v0.1.2 shape exactly — no behavior change.

Per-call opt-out: a callable that returns None (or empty string) for a particular call disables namespacing for that call only, producing the v0.1.2 shape {prefix}-{tool_call_id}. Useful when some calls should be globally scoped (admin / system tools) while others get run-scoped namespacing — branch on the request and return None from the unscoped path.

Errors in the callable propagate: if your callable raises, the exception surfaces from wrap_tool_call / before_model to the agent. This is intentional — fail-fast on a misconfigured callable rather than silently producing keys with no namespace. Wrap in try/except inside the callable if you want a fallback.

Denial messages

denial_message accepts a format string (placeholders: {reason}, {tool}, {decision}) or a callable receiving the CyclesResponse:

gate = CyclesToolGate(
    client,
    subject=...,
    action=...,
    denial_message="Cycles denied {tool}: {reason}",
)

Error handling

Denied tool calls return a ToolMessage with the denial content; the underlying handler is never invoked. The agent's model sees the denial as if a tool returned an error and can recover.
Reservation failures in "reserve" mode are returned as ToolMessage (handler not invoked).
Tool exceptions in "reserve" mode trigger an automatic release_reservation, then the exception propagates.
Async/sync mismatch raises TypeError — pair CyclesClient with .invoke() and AsyncCyclesClient with .ainvoke().

Settlement (commit) failures

In "reserve" and "decide+reserve" modes, the tool runs first, then the reservation is committed. If the commit call itself fails (network blip, server overload, etc.), the tool already ran — its side effect is real. You have two reasonable options, controlled by settlement_error_policy:

Policy	Behavior	When to choose
`"raise"` (default)	Propagate the commit exception to the agent. The tool's return value is lost.	Strict governance — no tool-level cost can go unaccounted.
`"log"`	Log a warning, return the tool result anyway. The reservation will eventually expire via TTL.	UX-first — keep the agent moving, accept best-effort accounting.

gate = CyclesToolGate(
    client,
    subject=...,
    action=...,
    mode="reserve",
    settlement_error_policy="log",   # opt out of strict default
)

Trade-off worth understanding: "raise" surfaces the commit failure as a tool exception, so a LangChain agent may retry — at which point the tool's side effect (e.g. an email send, a payment, a CRM write) repeats. Choose "log" if your tool's side effects are not safely idempotent on retry.

This only affects commit (success-path settlement); release on tool failure always logs and continues so the original tool exception wins.

Async support

Async middleware variants run automatically when the LangChain agent is invoked with .ainvoke(). Pass an AsyncCyclesClient:

from runcycles import AsyncCyclesClient

async_client = AsyncCyclesClient(CyclesConfig(...))
gate = CyclesToolGate(async_client, subject=..., action=..., mode="decide")

agent = create_agent(model="...", tools=[...], middleware=[gate])
await agent.ainvoke({"messages": [...]})

Examples

examples/tenant_budget_agent.py — single-tenant budget gate with risky-tool denial recovery.
examples/multi_agent_fanout.py — multi-agent / HITL flow with CyclesToolGate + CyclesFanOutGate + HumanInTheLoopMiddleware.

Known limitations (v0.1)

Reserve mode commits at the configured estimate, not actual usage. mode="reserve" and mode="decide+reserve" reserve the estimate, run the tool, then commit the same amount on success. Per-tool actual-cost instrumentation (analogous to runcycles.stream_reservation's cost_fn) is on the roadmap. Until then, set estimate to the worst-case spend per call you're willing to debit, or use mode="decide" if you only want policy gating without budget movement.
Model-call middleware is architecture-complete but commits at the configured estimate. CyclesModelGate (v0.1.5+) implements wrap_model_call with all three modes (decide / reserve / decide+reserve), but does not yet extract actual provider token usage from response metadata or support streaming settlement. Provider-specific usage extractors (OpenAI, Anthropic) and streaming integration are v0.2.0 scope. For precise per-call token cost capture today, use the BaseCallbackHandler recipe in cycles-client-python alongside CyclesModelGate (the two compose: middleware reserves, callback commits actuals).
Per-call subject only via the extractor form. Static Subject pins one tenant per middleware instance. For per-tenant/per-agent routing in a multi-tenant deployment, supply a SubjectExtractor callable.
Idempotency keys are deterministic only when tool_call_id is present. Keys take the shape {prefix}-{tool_call_id} so retries land on the same Cycles reservation. If the upstream omits tool_call_id, the middleware synthesizes a fresh missing-<hex> id (and logs a warning) — that path is non-deterministic across retries because the synthesis itself is random. Conformant LangChain runtimes always supply id.

Development

pip install -e ".[dev]"
pytest                          # all tests
pytest --cov=langchain_runcycles  # with coverage (gate: ≥95%)
ruff check . && ruff format
mypy langchain_runcycles

Documentation

LangChain integration page: https://docs.langchain.com/oss/python/integrations/middleware/runcycles (pending PR review)
Cycles protocol & SDK: https://runcycles.io
Architecture: see AUDIT.md

Requirements

Python 3.10+
runcycles >= 0.4.1
langchain >= 1.0, < 2.0
langchain-core >= 1.0, < 2.0

License

Apache-2.0. See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

amavashev

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.2.3

May 11, 2026

0.2.2

May 11, 2026

0.2.1

May 11, 2026

0.2.0

May 11, 2026

This version

0.1.6

May 10, 2026

0.1.5

May 10, 2026

0.1.4

May 10, 2026

0.1.3

May 10, 2026

0.1.2

May 10, 2026

0.1.1

May 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langchain_runcycles-0.1.6.tar.gz (50.8 kB view details)

Uploaded May 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langchain_runcycles-0.1.6-py3-none-any.whl (23.6 kB view details)

Uploaded May 10, 2026 Python 3

File details

Details for the file langchain_runcycles-0.1.6.tar.gz.

File metadata

Download URL: langchain_runcycles-0.1.6.tar.gz
Upload date: May 10, 2026
Size: 50.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for langchain_runcycles-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`66c2d15b8e66a85c871a0ab69c89c38a4d661feefa246762989cca62a78ced5a`
MD5	`8d4b114d6ab5386177b625d12a9fc990`
BLAKE2b-256	`bac917b12aa27dc9c68fb18bd81cee162709a1d5983d3dcf8e27b993499ccfaf`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langchain_runcycles-0.1.6.tar.gz:

Publisher: python-publish.yml on runcycles/langchain-runcycles

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langchain_runcycles-0.1.6.tar.gz
- Subject digest: 66c2d15b8e66a85c871a0ab69c89c38a4d661feefa246762989cca62a78ced5a
- Sigstore transparency entry: 1496879660
- Sigstore integration time: May 10, 2026
Source repository:
- Permalink: runcycles/langchain-runcycles@2c0894f847a98dcb0afe5c492a275411fabcb248
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/runcycles
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@2c0894f847a98dcb0afe5c492a275411fabcb248
- Trigger Event: push

File details

Details for the file langchain_runcycles-0.1.6-py3-none-any.whl.

File metadata

Download URL: langchain_runcycles-0.1.6-py3-none-any.whl
Upload date: May 10, 2026
Size: 23.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for langchain_runcycles-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`15ff969686f785a3de19b9a9a0811c5e6997056f48a0a617e429bb51e5d42591`
MD5	`7da90fd9b1742eb070f9edd592b9afd0`
BLAKE2b-256	`34e6931a91444e47dcc304a43524267c64ae61bc4c251e696a5eb1c911162a68`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langchain_runcycles-0.1.6-py3-none-any.whl:

Publisher: python-publish.yml on runcycles/langchain-runcycles

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langchain_runcycles-0.1.6-py3-none-any.whl
- Subject digest: 15ff969686f785a3de19b9a9a0811c5e6997056f48a0a617e429bb51e5d42591
- Sigstore transparency entry: 1496879690
- Sigstore integration time: May 10, 2026
Source repository:
- Permalink: runcycles/langchain-runcycles@2c0894f847a98dcb0afe5c492a275411fabcb248
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/runcycles
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@2c0894f847a98dcb0afe5c492a275411fabcb248
- Trigger Event: push

langchain-runcycles 0.1.6

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cycles for LangChain — AI agent middleware for budget and action authority

What's in the box

Installation

Quick Start

Middleware

CyclesModelGate (v0.1.5+)

CyclesToolGate

CyclesFanOutGate

Configuration

Subject

Action

Idempotency-key namespacing (v0.1.3+)

Denial messages

Error handling

Settlement (commit) failures

Async support

Examples

Known limitations (v0.1)

Development

Documentation

Requirements

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`CyclesModelGate` (v0.1.5+)

`CyclesToolGate`

`CyclesFanOutGate`