SDK and CLI for coordinating LLM agent teams with content-addressed context capsules and reproducible run reports.
Project description
coordpy
A Python-first agent development kit (ADK) for building LLM agents and
agent teams with a familiar Agent / Tool / Runner / Session /
State / Memory / Artifacts mental model — that gives you
content-addressed capsule audit, provenance, and replay for free
underneath. You write import-and-code ADK; every model call, tool call, and
handoff automatically seals into a hash-chained capsule you can re-verify
from bytes and replay against another model. The CLI is a secondary runtime
surface.
PyPI: coordpy-ai · import: coordpy · library front door: coordpy.adk
Why CoordPy
Multi-agent stacks usually pass context around as raw prompts and JSON. That scales until the prompt grows past the model's useful window and the run silently devolves into token cramming — and then when something breaks you can't reconstruct what each agent saw.
CoordPy attacks both edges of that frontier:
- Bounded context, not token cramming. Each agent sees the team
instructions plus the latest N visible handoffs (default
N=4), not the full transcript. Every team run reports the bounded-context savings in real tokens, not vibes. - Auditable handoffs. Every agent output seals into a
TEAM_HANDOFFcapsule with a content-derived ID, declared parents, a per-handoff byte/token budget, and witness fields (prompt_sha256,model_tag) that prove which prompt produced which output. - Replayable runs. A
team_result.jsonmanifest records each turn's prompt, generation params, and capsule CID.coordpy-team replayre-runs the same prompts on a different backend/model at the original sampling settings, so the audit story holds across models.
Install
Requires Python 3.10 or newer.
pip install coordpy-ai
Verify:
coordpy --version # coordpy 1.2.1 (coordpy.sdk.v3.43)
python -c "import coordpy; print(coordpy.__version__)"
The first parenthetical (coordpy.sdk.v3.43) is the
research-line tag exposed at coordpy.SDK_VERSION. It tracks
the underlying research programme and is independent of the
PyPI version.
The only required dependency is NumPy. Optional extras:
| Extra | Pulls in | When you want it |
|---|---|---|
[scientific] |
scipy, networkx |
numerical / graph helpers |
[crypto] |
cryptography |
optional signed-capsule paths |
[dl] |
torch, peft |
the deep-learning research path |
[heavy] |
hnswlib, transformers, RestrictedPython |
full research stack (heavy) |
[docker] |
docker |
Docker-backed sandbox |
[dev] |
ruff, black, mypy, pytest, build, twine |
contributing |
Quickstart — your first agent (Python)
The front door is import-and-code. Tools are plain typed functions (a
trailing tool_context is auto-injected and hidden from the model); a turn
is a stream of events; the answer is the one that satisfies
is_final_response():
from coordpy.adk import Agent, InMemoryRunner, ToolContext
from coordpy.llm_backend import backend_from_env
def lookup_population(city: str, tool_context: ToolContext) -> dict:
"""Return the population of a city."""
table = {"tokyo": 37_400_000, "paris": 11_100_000}
tool_context.state["last_city"] = city
pop = table.get(city.lower())
return ({"status": "success", "city": city, "population": pop}
if pop else {"status": "error", "error": f"unknown city {city!r}"})
assistant = Agent(
name="geo_assistant",
model=backend_from_env(), # any Ollama / OpenAI-compatible backend
instruction="Answer population questions; use the lookup tool.",
tools=[lookup_population],
output_key="answer",
)
runner = InMemoryRunner(agent=assistant, app_name="quickstart")
runner.session_service.create_session(
app_name="quickstart", user_id="u1", session_id="s1")
for event in runner.run(user_id="u1", session_id="s1",
new_message="How big is Tokyo?"):
if event.is_final_response():
print(event.text)
Set a backend with the COORDPY_* environment variables (local Ollama or any
OpenAI-compatible provider; see the env-var blocks below). For a
no-network version that needs no model, run
examples/adk_quickstart.py; for a small
multi-agent app (researcher → writer, with tools + artifacts + memory),
run python -m coordpy.adk.examples.research_assistant
(source).
Capsule audit, provenance & replay — for free
You wrote plain ADK code. Underneath, every step sealed into a content-addressed, hash-chained capsule trail you can verify from bytes:
view = runner.session_capsule_view("s1") # the coordpy.capsule_view.v1 chain
assert runner.verify_session("s1") # re-verified from bytes; tamper-evident
print(runner.session_root_cid("s1")) # deterministic for identical inputs
That is CoordPy's distinctive layer: an ADK that is auditable and replayable
by construction. coordpy.adk.Agent is the primary surface; the
higher-level preset teams and the CLI below build on the same capsule
machinery. (coordpy.adk.Agent is additive and separate from the legacy
coordpy.Agent / create_team surface, which is unchanged.)
Command-line interface (secondary)
The CLI is a secondary runtime surface. The bundled coordpy-team command
drives a curated preset against a local Ollama endpoint or any
OpenAI-compatible provider:
# Local Ollama (no API key needed)
export COORDPY_BACKEND=ollama
export COORDPY_MODEL=qwen2.5:14b
export COORDPY_OLLAMA_URL=http://localhost:11434
coordpy-team run \
--preset quant_desk \
--task examples/scenario_bullish.txt \
--out-dir /tmp/desk-run
That writes four files into /tmp/desk-run:
| file | purpose |
|---|---|
final_output.txt |
the final agent's plain-text answer |
team_capsule_view.json |
the sealed coordpy.capsule_view.v1 chain |
team_result.json |
the coordpy.team_result.v1 manifest used for replay |
team_report.md |
a polished Markdown summary (telemetry + savings + audit) |
Re-verify the chain from bytes alone:
coordpy-capsule verify-view --view /tmp/desk-run/team_capsule_view.json
Replay the same prompts on a different model and compare:
coordpy-team compare \
--preset quant_desk \
--task examples/scenario_bullish.txt \
--backend ollama --model qwen2.5:14b \
--replay-backend ollama --replay-model gemma2:9b \
--out-dir /tmp/desk-compare
The compare report shows whether the per-turn prompt SHAs match
and whether the synthesizer's parsed ACTION agrees across models.
Structured profile runs (RunSpec → RunReport)
The original structured-research path is still stable and useful for reproducible, profile-driven evaluation runs:
import coordpy
report = coordpy.run(coordpy.RunSpec(
profile="local_smoke",
out_dir="/tmp/cp-smoke",
))
assert report["readiness"]["ready"]
assert report["provenance"]["schema"] == "coordpy.provenance.v1"
assert report["capsules"]["chain_ok"]
print(report["capsules"]["root_cid"])
coordpy.run writes seven files into out_dir. The two you
will reach for most are product_report.json (the same shape
as the returned dict) and capsule_view.json (the sealed
capsule chain that coordpy-capsule verify re-hashes); the
others (provenance.json, meta_manifest.json,
readiness_verdict.json, product_summary.txt,
sweep_result.json) are always written and are useful for
audit. The root_cid is the SHA-256 of the run's RUN_REPORT
capsule; it is stable for a given input but differs between
runs because provenance includes a wall-clock timestamp.
Console scripts
| Command | Purpose |
|---|---|
coordpy-team run / replay / sweep / compare |
Drive an AgentTeam preset from the CLI; dump or replay a sealed bundle. The recommended usage front door for new users. |
coordpy-subject report / check / registry / tiers |
Orientation front door: print what CoordPy is, the stable-vs-experimental-vs-historical tier map, and run the hermetic stable-contract harness in one shot (also python -m coordpy.subject). |
coordpy-capsule view / verify / verify-view / audit |
Summarise / re-hash a sealed capsule chain (works on both team and RunSpec runs). |
coordpy --profile <name> --out-dir <dir> |
Run a research profile end to end and write the seven artefacts. |
coordpy-ci --report <product_report.json> |
Apply the CI pass/fail gate to a finished report. |
coordpy-import --jsonl <file> |
Audit a SWE-bench-Lite-style JSONL for compatibility. |
New to this repo? coordpy-subject is the fastest way to learn what is
stable (the SDK/CLI), what is canonical-experimental (the architecture
north-star lineage + the most recent research chain), and what is
historical — without reading the milestone archive. See
docs/reference/W144_COORDPY_SUBJECT_REGISTRY.json.
The research-profile chain is still useful for the structured
RunSpec → RunReport path:
coordpy --profile local_smoke --out-dir /tmp/cp-smoke
coordpy-ci --report /tmp/cp-smoke/product_report.json --min-pass-at-1 1.0
coordpy-capsule view --report /tmp/cp-smoke/product_report.json
coordpy-capsule verify --report /tmp/cp-smoke/product_report.json
To exercise coordpy-import against the bundled mini fixture
(no external file required):
FIXTURE=$(python -c 'import coordpy, os; print(os.path.join(os.path.dirname(coordpy.__file__), "_internal/tasks/data/swe_real_shape_mini.jsonl"))')
coordpy-import --jsonl "$FIXTURE" --out /tmp/audit.json
Agent teams in Python
AgentTeam.from_env reads its backend from COORDPY_*
environment variables and requires a configured backend to
run — either a reachable Ollama server or an OpenAI-compatible
API key. To run a team without a network, see the
SyntheticLLMClient example below.
from coordpy import AgentTeam, agent
team = AgentTeam.from_env(
[
agent("planner", "Break the task into 2-3 concrete steps."),
agent("researcher", "Gather the facts that matter."),
agent("writer", "Write the final answer for the user."),
],
model="gpt-4o-mini",
backend_name="openai",
team_instructions=(
"Reuse visible handoffs instead of restating the task."
),
max_visible_handoffs=3,
task_summary="Answer briefly using only the prior handoffs.",
)
result = team.run("Explain what coordpy does.")
print(result.final_output)
# Bounded-context savings vs naive token cramming.
print(result.cramming_estimate())
# Dump a four-file replayable bundle.
result.dump("/tmp/team-run")
For curated multi-role teams (quant desk, code review, research
writer) skip the role-prompt typing and use the bundled
coordpy.presets:
from coordpy import presets
team = presets.quant_desk_team()
result = team.run(open("examples/scenario_bullish.txt").read())
Local Ollama:
export COORDPY_BACKEND=ollama
export COORDPY_MODEL=qwen2.5:0.5b
export COORDPY_OLLAMA_URL=http://localhost:11434
OpenAI-compatible provider:
export COORDPY_BACKEND=openai
export COORDPY_MODEL=gpt-4o-mini
export COORDPY_API_KEY=...
# Optional, for non-default providers:
# export COORDPY_API_BASE_URL=https://your-provider.example/v1
To run a team without a network or an API key, pass a
SyntheticLLMClient directly:
from coordpy import create_team, agent
from coordpy.synthetic_llm import SyntheticLLMClient
team = create_team(
[agent("planner", "..."), agent("writer", "...")],
backend=SyntheticLLMClient(default_response="ok"),
)
print(team.run("hi").final_output)
The bundled examples/ ladder (01_quickstart.py,
02_quant_desk.py, 03_replay_and_audit.py) drives the same
public surface end-to-end against a real backend.
Public surface
| Surface | Stability |
|---|---|
coordpy.adk ADK (library front door): Agent/LlmAgent, Runner, InMemoryRunner, Session, State, BaseSessionService/InMemorySessionService, Artifact/InMemoryArtifactService, BaseMemoryService/InMemoryMemoryService, FunctionTool, ToolContext/CallbackContext, SequentialAgent/ParallelAgent/LoopAgent, Event/EventActions, ADK_SURFACE_SCHEMA |
Stable (v1) |
coordpy SDK: RunSpec, run, RunReport, SweepSpec, run_sweep, CoordPyConfig, Agent, AgentTurn, ActionDecision, AgentTeam, TeamResult, agent, create_team, replay_team_result, presets, TEAM_RESULT_SCHEMA, profiles, report, ci_gate, import_data, extensions, capsule primitives, schema constants, OpenAICompatibleBackend, OllamaBackend, backend_from_env |
Stable |
Console scripts: coordpy-team, coordpy-capsule, coordpy, coordpy-import, coordpy-ci |
Stable |
On-disk schemas: coordpy.capsule_view.v1, coordpy.team_result.v1, coordpy.provenance.v1, phase45.product_report.v2 |
Stable |
coordpy.__experimental__ (a tuple of names exported under that attribute): research-grade trust-adjudication primitives and the multi-agent coordination ladder behind the research papers |
Experimental, may move or disappear between releases |
The experimental surface ships in the same wheel for
reproducibility and audit. Pin against
coordpy.__experimental__ if you depend on it.
Limitations
coordpyworks at the capsule layer. It does not provide transformer-internal trust transfer or hidden-state access.- The bundled cross-host evidence comes from the small two-node lab where it was generated. Behaviour at larger scales has not been measured.
- Not peer-reviewed. The code, tests, results notes, and theorem registry are public so they can be challenged.
Where to go next
- Contributing:
CONTRIBUTING.md - Releasing to PyPI:
RELEASING.md - Security policy:
SECURITY.md - Changelog:
CHANGELOG.md
License
MIT. See LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file coordpy_ai-1.2.1.tar.gz.
File metadata
- Download URL: coordpy_ai-1.2.1.tar.gz
- Upload date:
- Size: 4.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee2f7c8074407c8556406cd553add774e9c37c77729301017e663c09d36b98d9
|
|
| MD5 |
35c9eea9808c55a20652a4d933abc8f0
|
|
| BLAKE2b-256 |
025091020c97806f12aa69d8a12aa81e00466b37b4b8a65724e346c71c7c61f2
|
Provenance
The following attestation bundles were made for coordpy_ai-1.2.1.tar.gz:
Publisher:
release.yml on adotdong29/CoordPy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
coordpy_ai-1.2.1.tar.gz -
Subject digest:
ee2f7c8074407c8556406cd553add774e9c37c77729301017e663c09d36b98d9 - Sigstore transparency entry: 1872936087
- Sigstore integration time:
-
Permalink:
adotdong29/CoordPy@4d3b5d2409af5b4473e4f514485d3d163649ef47 -
Branch / Tag:
refs/tags/v1.2.1 - Owner: https://github.com/adotdong29
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@4d3b5d2409af5b4473e4f514485d3d163649ef47 -
Trigger Event:
push
-
Statement type:
File details
Details for the file coordpy_ai-1.2.1-py3-none-any.whl.
File metadata
- Download URL: coordpy_ai-1.2.1-py3-none-any.whl
- Upload date:
- Size: 4.7 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b376303d43dad45c7b65b2e899c19bb29ea7f9df96f10cf822265ebf834b62bf
|
|
| MD5 |
c1f9c9d550561dcd1dc88ca278f5c0d8
|
|
| BLAKE2b-256 |
2e6b7004c69adaea1caffc1e56418af4bbd6717cb6b578ef8555e359533a2322
|
Provenance
The following attestation bundles were made for coordpy_ai-1.2.1-py3-none-any.whl:
Publisher:
release.yml on adotdong29/CoordPy
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
coordpy_ai-1.2.1-py3-none-any.whl -
Subject digest:
b376303d43dad45c7b65b2e899c19bb29ea7f9df96f10cf822265ebf834b62bf - Sigstore transparency entry: 1872936247
- Sigstore integration time:
-
Permalink:
adotdong29/CoordPy@4d3b5d2409af5b4473e4f514485d3d163649ef47 -
Branch / Tag:
refs/tags/v1.2.1 - Owner: https://github.com/adotdong29
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@4d3b5d2409af5b4473e4f514485d3d163649ef47 -
Trigger Event:
push
-
Statement type: