Python agent runtime framework — context management, multi-agent orchestration, and self-improvement for autonomous AI agents

These details have not been verified by PyPI

Project links

Project description

Loom

Build stateful agents with context control, safety boundaries, and extensible runtime capabilities.

English | 中文

Wiki | Quick Start | PyPI

Loom exposes one public API centered on Agent. You configure an agent once, then use run(), stream(), and session() to build applications with multi-step execution, tools, heartbeat monitoring, safety rules, and session-scoped state.

Quick Start

pip install loom-agent
export ANTHROPIC_API_KEY=sk-ant-...

import asyncio
from loom import (
    AgentConfig,
    GenerationConfig,
    ModelRef,
    RunContext,
    SessionConfig,
    create_agent,
    tool,
)
from loom.config import (
    MemoryBackend,
    MemoryConfig,
    PolicyConfig,
    PolicyContext,
    RuntimeConfig,
    RuntimeFeatures,
    RuntimeLimits,
    ToolAccessPolicy,
    ToolPolicy,
    ToolRateLimitPolicy,
)


@tool(description="Search documentation", read_only=True)
async def search_docs(query: str) -> str:
    return f"Results for: {query}"


async def main():
    agent = create_agent(
        AgentConfig(
            model=ModelRef.anthropic("claude-sonnet-4"),
            instructions="You are a concise coding assistant",
            tools=[search_docs],
            policy=PolicyConfig(
                tools=ToolPolicy(
                    access=ToolAccessPolicy(
                        allow=["search_docs"],
                        read_only_only=True,
                    ),
                    rate_limits=ToolRateLimitPolicy(max_calls_per_minute=60),
                ),
                context=PolicyContext.named("repo"),
            ),
            memory=MemoryConfig(backend=MemoryBackend.in_memory()),
            generation=GenerationConfig(max_output_tokens=512),
            runtime=RuntimeConfig(
                limits=RuntimeLimits(max_iterations=32),
                features=RuntimeFeatures(enable_safety=True),
            ),
        )
    )

    result = await agent.run("Summarize this repository")
    print(result.output)


asyncio.run(main())

Import rule:

Use from loom import ... for the primary application path.
Use from loom.config import ... for advanced configuration objects.
Use from loom.runtime import ... for runtime states, runs, and sessions when you need them directly.

Sessions

Use session() when the application needs continuity across runs.

session = agent.session(SessionConfig(id="demo-user"))

first = await session.run("List three qualities of a good API")
second = await session.run(
    "Summarize the previous answer in one sentence",
    context=RunContext(inputs={"previous_answer": first.output}),
)

print(second.output)

Knowledge Evidence

Use KnowledgeQuery to resolve stable evidence, then attach it to one run through RunContext.

from loom import KnowledgeQuery

knowledge = agent.resolve_knowledge(
    KnowledgeQuery(
        text="What are the production deployment rules?",
        goal="Summarize deployment policy",
        top_k=3,
    )
)

result = await agent.run(
    "Summarize deployment policy",
    context=RunContext(knowledge=knowledge),
)

Streaming, Events, and Artifacts

run = agent.session(SessionConfig(id="stream-demo")).start("Inspect the project layout")

async for event in run.events():
    print(event.type, event.payload)

result = await run.wait()
artifacts = await run.artifacts()

Harness — Long-Running Agent Orchestration

Loom implements the Harness pattern for long-running, quality-controlled agent tasks. Three mechanisms work together:

1 · Context Reset with Structured Handoff

Every time the context pressure (ρ) reaches the renewal threshold, ContextRenewer performs a full context reset and produces a HandoffArtifact — a structured document that cold-starts the next sprint with full situational awareness.

from loom.types import HandoffArtifact

# HandoffArtifact is produced automatically by ContextManager.renew()
# and is accessible via context_manager.last_handoff
handoff = context_manager.last_handoff

print(handoff.goal)            # original goal, never compressed
print(handoff.sprint)          # which renewal this is
print(handoff.progress_summary)
print(handoff.open_tasks)      # remaining plan steps carried forward

# Inject into the next sprint's system prompt
system_msg = handoff.to_system_prompt()

Unlike plain context compression, HandoffArtifact explicitly separates what was accomplished, what still needs to be done, and the goal that never changes — so the agent never loses its bearings after a context reset.

2 · Generator–Evaluator Loop (GAN-style)

GeneratorEvaluatorLoop separates generation from judgment to eliminate self-praise bias. The Evaluator first negotiates verifiable success criteria (SprintContract), then scores the Generator's output in each round. The loop continues until PASS or max_sprints is exhausted.

from loom.orchestration import GeneratorEvaluatorLoop, SprintContract

loop = GeneratorEvaluatorLoop(
    generator=gen_manager,
    evaluator=eval_manager,
    event_bus=bus,          # optional — publishes sprint.passed / sprint.failed
)

results = await loop.run("Build a REST API for user authentication", max_sprints=5)

for r in results:
    print(f"Sprint {r.sprint}: {'PASS' if r.passed else 'FAIL'}")
    print(f"  Criteria: {r.contract.criteria}")
    print(f"  Critique: {r.critique}")

Each SprintResult carries:

contract — the SprintContract with criteria agreed before this sprint
output — what the Generator produced
critique — the Evaluator's judgment (fed into the next sprint's prompt on FAIL)
passed — whether this sprint cleared the bar

3 · Sprint Contract — Negotiated Success Criteria

Before each sprint, the Evaluator generates explicit, verifiable criteria. This prevents the Generator from gaming the evaluation, and makes quality gates inspectable and auditable.

from loom.orchestration import SprintContract

contract = SprintContract(
    sprint=1,
    goal="Build a REST API for user auth",
    criteria=[
        "POST /register returns 201 with a user ID",
        "POST /login returns a signed JWT on success",
        "Invalid credentials return 401, not 500",
    ],
    eval_tools=["pytest", "httpx"],
)

AgentHarness — One-Stop Entry Point

AgentHarness wires all three mechanisms into a single call: an optional Planner expands the brief into a spec, then the Generator–Evaluator loop refines the output.

from loom.orchestration import AgentHarness, HarnessResult

harness = AgentHarness(
    generator=gen_manager,
    evaluator=eval_manager,   # omit for single-shot mode
    planner=plan_manager,     # omit to skip spec expansion
    max_sprints=5,
    event_bus=bus,
)

result: HarnessResult = await harness.run(
    "Build a CLI tool that converts CSV to JSON with streaming support"
)

print(result.spec)          # planner-expanded specification
print(result.output)        # final generator output
print(result.passed)        # did the evaluator approve?
print(result.sprints)       # how many rounds were needed
print(result.critique)      # last evaluator feedback

HarnessResult fields:

Field	Type	Description
`spec`	`str`	Planner-expanded brief, or original brief if no planner
`output`	`str`	Final Generator output
`passed`	`bool`	True if Evaluator approved the last sprint
`sprints`	`int`	Total sprints executed
`critique`	`str`	Last Evaluator feedback
`sprint_results`	`list[SprintResult]`	Full per-sprint history

Extensible Configuration

Loom keeps configuration extensible through stable config objects on the public API:

AgentConfig: top-level stable entry for one agent
knowledge: reusable knowledge sources for evidence and retrieval
policy: tool access controls, context-specific governance, rate limits
memory: session-level memory options
heartbeat: watch sources, interval, entropy threshold
safety_rules: veto rules for dangerous operations
runtime: engine-level limits and features

Example:

agent = create_agent(
    AgentConfig(
        model=ModelRef.anthropic("claude-sonnet-4"),
        instructions="You are a deployment assistant",
        knowledge=[
            KnowledgeSource.inline(
                "deployment-docs",
                [
                    KnowledgeDocument(content="Staging deploys are automatic.", title="Staging"),
                    KnowledgeDocument(content="Production deploys require approval.", title="Production"),
                ],
                description="Internal deployment notes",
            )
        ],
        policy=PolicyConfig(
            context=PolicyContext.named("deployment"),
            tools=ToolPolicy(
                access=ToolAccessPolicy(allow=["deploy"]),
                rate_limits=ToolRateLimitPolicy(max_calls_per_minute=10),
            ),
        ),
        memory=MemoryConfig(backend=MemoryBackend.in_memory()),
        heartbeat=HeartbeatConfig(
            interval=5.0,
            interrupt_policy=HeartbeatInterruptPolicy(),
            watch_sources=[
                WatchConfig.filesystem(
                    paths=["./src"],
                    method=FilesystemWatchMethod.HASH,
                ),
                WatchConfig.resource(
                    thresholds=ResourceThresholds(cpu_pct=80.0),
                ),
            ],
        ),
        runtime=RuntimeConfig(
            limits=RuntimeLimits(max_iterations=24, max_context_tokens=120000),
        ),
        safety_rules=[
            SafetyRule.when_argument_equals(
                name="no_prod_deploy",
                reason="Production deployment is blocked",
                tool_name="deploy",
                argument="env",
                value="production",
            )
        ],
    )
)

Architecture

loom/agent.py           ← Public agent API
loom/runtime/           ← Sessions, runs, loop (Reason→Act→Observe→Δ), heartbeat
loom/context/           ← Context partitions, compression, renewal + HandoffArtifact
loom/memory/            ← Session, working, semantic, persistent memory
loom/tools/             ← Tool registry, executor, governance pipeline
loom/orchestration/     ← Task planning, multi-agent coordination,
│                         GeneratorEvaluatorLoop, AgentHarness, SprintContract
loom/safety/            ← Permissions, hooks, veto authority
loom/ecosystem/         ← Skills, plugins, MCP bridge, activation
loom/evolution/         ← Self-improvement strategies
loom/providers/         ← Anthropic, OpenAI, Gemini, Qwen, Ollama
loom/types/             ← Core types incl. HandoffArtifact, SprintContract

Capabilities

Category	What Loom provides
Execution loop	Structured Reason → Act → Observe → Δ with automatic state transitions
Context management	Five-partition context, pressure-based compression (snip / micro / collapse / auto), forced renewal at ρ ≥ 1.0
Structured handoff	`HandoffArtifact` carries goal, progress, open tasks, and context snapshot across context resets
Quality iteration	`GeneratorEvaluatorLoop` runs GAN-style sprints with negotiated `SprintContract` criteria
Harness	`AgentHarness` wires Planner → Generator ⇌ Evaluator into one `await harness.run(brief)` call
Multi-agent	`SubAgentManager`, `Coordinator`, `TaskPlanner` for parallel and sequential task graphs
Event bus	`CoordinationEventBus` with entropy-gated publish, sprint events, topic subscriptions
Safety	Veto authority, permission guards, pre/post tool hooks, `safety_rules`
Heartbeat	Background filesystem, resource, and MF-events monitoring with urgency classification
Knowledge	Evidence packs, semantic retrieval, citation tracking across context resets
Sessions	Scoped state, streaming events, artifact collection
Providers	Anthropic, OpenAI, Gemini, Qwen, Ollama with shared client pooling
Ecosystem	Skills, plugins, MCP server bridge

Runtime Reliability

Hierarchical errors make failures easier to classify and handle:
- ProviderError → ProviderUnavailableError / RateLimitError
- ToolError → ToolNotFoundError / ToolPermissionError / ToolExecutionError
- ContextError → ContextOverflowError
Runtime engine emits tool_result events; evolution feedback subscribes via FeedbackLoop.subscribe_to_engine(...) for decoupled reliability tracking.
OpenAI, Anthropic, and Gemini providers support shared client pooling to reuse SDK clients under concurrent load.

License

Apache 2.0 with Commons Clause. See LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.8.2

Apr 30, 2026

0.8.1

Apr 28, 2026

0.8.0

Apr 27, 2026

This version

0.7.4

Apr 19, 2026

0.7.3

Apr 11, 2026

0.7.2

Apr 8, 2026

0.7.1

Apr 6, 2026

0.7.0

Mar 26, 2026

0.6.6

Mar 26, 2026

0.6.5

Mar 5, 2026

0.6.4

Mar 2, 2026

0.6.3

Feb 27, 2026

0.6.2

Feb 24, 2026

0.6.1

Feb 24, 2026

0.6.0

Feb 21, 2026

0.5.7

Feb 13, 2026

0.5.6

Feb 12, 2026

0.5.5

Feb 10, 2026

0.5.4

Feb 10, 2026

0.5.3

Feb 6, 2026

0.5.2

Feb 6, 2026

0.5.1

Feb 5, 2026

0.5.0

Feb 2, 2026

0.4.6

Feb 2, 2026

0.4.5

Jan 29, 2026

0.4.4

Jan 28, 2026

0.4.3

Jan 27, 2026

0.4.2

Jan 25, 2026

0.4.1

Jan 20, 2026

0.4.0

Jan 19, 2026

0.4.0a0 pre-release

Jan 19, 2026

0.3.9

Jan 16, 2026

0.3.8

Jan 14, 2026

0.3.7

Jan 14, 2026

0.3.6

Jan 5, 2026

0.3.4

Dec 27, 2025

0.3.3

Dec 24, 2025

0.3.2

Dec 23, 2025

0.3.1

Dec 23, 2025

0.3.0

Dec 23, 2025

0.2.1

Dec 22, 2025

0.2.0

Dec 20, 2025

0.1.10

Dec 15, 2025

0.1.9

Dec 15, 2025

0.1.8

Dec 15, 2025

0.1.6

Dec 14, 2025

0.1.1

Dec 12, 2025

0.1.0

Dec 10, 2025

0.0.9

Dec 9, 2025

0.0.8

Dec 8, 2025

0.0.7

Nov 23, 2025

0.0.6

Nov 23, 2025

0.0.5

Oct 31, 2025

0.0.4

Oct 27, 2025

0.0.3

Oct 27, 2025

0.0.2

Oct 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

loom_agent-0.7.4.tar.gz (128.8 kB view details)

Uploaded Apr 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

loom_agent-0.7.4-py3-none-any.whl (167.5 kB view details)

Uploaded Apr 19, 2026 Python 3

File details

Details for the file loom_agent-0.7.4.tar.gz.

File metadata

Download URL: loom_agent-0.7.4.tar.gz
Upload date: Apr 19, 2026
Size: 128.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for loom_agent-0.7.4.tar.gz
Algorithm	Hash digest
SHA256	`5523d102d19107f787e14d0a1a68bf814c93850e8c6415c1c58d91aa4e9229f2`
MD5	`854b3070a0b585495dbe67d7c1a5f36b`
BLAKE2b-256	`977745bb4693e85b415f7c6cb727c33af179ea3f1d7f9b637160077ca00434a4`

See more details on using hashes here.

File details

Details for the file loom_agent-0.7.4-py3-none-any.whl.

File metadata

Download URL: loom_agent-0.7.4-py3-none-any.whl
Upload date: Apr 19, 2026
Size: 167.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for loom_agent-0.7.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e0d174d2aeab9d13fe4efe881cd59aaa46a8a074b1d1d801c46f735af44cc86c`
MD5	`8a870655f54b9d1b120c34f0c81f24f7`
BLAKE2b-256	`db7a589fa88f8e561cb282a30dd586114de110ae2e2c557aaa636093a2e9259b`

See more details on using hashes here.

loom-agent 0.7.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Loom

Quick Start

Sessions

Knowledge Evidence

Streaming, Events, and Artifacts

Harness — Long-Running Agent Orchestration

1 · Context Reset with Structured Handoff

2 · Generator–Evaluator Loop (GAN-style)

3 · Sprint Contract — Negotiated Success Criteria

AgentHarness — One-Stop Entry Point

Extensible Configuration

Architecture

Capabilities

Runtime Reliability

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes