A lightweight Python meta-framework for building, experimenting with, and orchestrating LLM-based agents across multiple runtimes.

These details have not been verified by PyPI

Project links

Project description

Choreo-Mini

A lightweight Python meta-framework for building, experimenting with, and orchestrating LLM-based agents.

Modern agent frameworks — LangGraph, CrewAI, AutoGen — each solve similar orchestration problems but introduce fragmented abstractions and steep learning curves. Choreo-Mini provides a Python-native developer experience that lets you prototype agent workflows quickly, run genuine multi-agent experiments, and compile to any target runtime when you're ready.

Instead of forcing you to commit to a single framework, Choreo-Mini acts as an orchestration meta-layer: express your workflow once in plain Python, and compile it to your target runtime if needed.

What makes Choreo-Mini different

Feature	LangGraph	CrewAI	AutoGen	Choreo-Mini
Subclass-based workflow definition	✗	✗	✗	✓
Epistemic belief state per agent	✗	✗	✗	✓
Built-in MARL episode loop	✗	✗	✗	✓
Nash convergence detection	✗	✗	✗	✓
Export to other frameworks	✗	✗	✗	✓
Any OpenAI-compatible endpoint	✓	✓	✓	✓

How it works

Your Python Workflow  (subclass Workflow, define AgentNodes)
        │
        ├── Run directly with any OpenAI-compatible endpoint
        │
        ├── Run MARL experiments (Episode loop, HUF/reward, Nash convergence)
        │
        └── Compile to another runtime (optional)
                    │
              AST Parser + Jinja2
                    │
           ┌────────┼────────┐
           ▼        ▼        ▼
       LangGraph  CrewAI  AutoGen

Core concepts

Workflow subclass pattern

Define agents as instance attributes — no manual state wiring:

from choreo_mini import Workflow, AgentNode, LLM

class TradingAdvisor(Workflow):
    def __init__(self, llm):
        super().__init__("trading-advisor")
        self.analyst  = AgentNode(self, "Analyst",  role="Trade data analyst", llm=llm)
        self.advisor  = AgentNode(self, "Advisor",  role="Senior policy advisor", llm=llm)

    def advise(self, question: str) -> str:
        analysis = self.send("Analyst", question)
        self.beliefs.observe("last_question", question, confidence=1.0)
        reply = self.send("Advisor", analysis.content)
        return reply.content

Epistemic belief state

Every agent and workflow has a confidence-weighted belief map — a capability not found in LangGraph, CrewAI, or AutoGen:

# workflow-level world beliefs
wf.beliefs.observe("tariff_rate", 0.12, confidence=0.9, step=round_)

# per-agent theory-of-mind beliefs
wf.get_agent_belief("Analyst").observe_agent("Canada", "stance", "defensive", confidence=0.7)

# decay beliefs at end of each round (model passage of time)
wf.decay_all_beliefs(factor=0.95)

# snapshot for logging or passing as context to the LLM
print(wf.beliefs.snapshot())

Any OpenAI-compatible endpoint

One class, any provider — no SDK lock-in:

from choreo_mini import LLM, CustomLLM

# OpenAI
llm = LLM(api_key="sk-...", endpoint="https://api.openai.com", model="gpt-4o")

# Anthropic (via OpenAI-compat headers)
llm = LLM(
    endpoint="https://api.anthropic.com",
    model="claude-opus-4-5",
    headers={"x-api-key": "sk-ant-...", "anthropic-version": "2023-06-01"},
)

# Local Ollama (no auth)
llm = LLM(endpoint="http://localhost:11434", model="llama3.2")

# Groq, Together, vLLM, LM Studio, ...
llm = LLM(api_key="...", endpoint="https://api.groq.com/openai", model="llama-3.3-70b-versatile")

# Any callable — great for tests, local models, or rule-based stubs
llm = CustomLLM(lambda prompt, context=None, **kw: f"echo: {prompt}")

The LLM class normalises endpoint URLs (strips accidental /v1 suffixes), omits the Authorization header when no API key is set, enforces a configurable timeout (default 60 s), and surfaces the full API error body on failures.

MARL episode loop

Run multi-agent reinforcement learning experiments with reward functions and Nash convergence detection baked in:

from choreo_mini import Episode, nash_convergence_detector

ep = Episode(
    agents={
        "USA":    usa_workflow.propose,
        "Canada": canada_workflow.propose,
        "Mexico": mexico_workflow.propose,
    },
    env={"tariff_rate": 0.12, "market_access": 0.65, ...},
    reward_fn=huf_reward,           # your Human Utility Function
    env_update_fn=negotiation_step, # how proposals combine into new state
    termination_fn=nash_convergence_detector(window=5, reward_threshold=0.005),
    max_rounds=40,
)

trajectory = ep.run()
print(ep.summary())

Each EpisodeStep records the env snapshot, every agent's action, and every agent's reward — ready to feed a policy-update step.

MARL experiment — CUSMA/USMCA HUF maximisation

examples/marl_huf_experiment.py ships a ready-to-run multi-agent experiment where USA, Canada, and Mexico negotiate five trade parameters to maximise their national Human Utility Function scores:

Parameter	USA preference	Canada preference	Mexico preference
`tariff_rate`	↓	↓	neutral
`market_access`	neutral	↑	↑
`labor_compliance`	↑	neutral	↓
`rules_of_origin`	↑	neutral	↓
`env_score`	↑	↑	neutral

python examples/marl_huf_experiment.py

Sample output:

======================================================================
  CUSMA/USMCA MARL — HUF Maximisation Experiment
======================================================================
  Round      USA   Canada   Mexico   GlobalHUF
  --------------------------------------------
   init   0.6700   0.6830   0.5290      1.8820
      1   0.6700   0.6830   0.5290      1.8820
      ...
     40   0.7840   1.0000   0.6650      2.4490
  --------------------------------------------
  Converged after 40 rounds

  Final trade parameters
  tariff_rate           0.1200  →  0.0000  (▼ 0.1200)
  market_access         0.6500  →  1.0000  (▲ 0.3500)
  labor_compliance      0.5500  →  0.7500  (▲ 0.2000)
  rules_of_origin       0.6200  →  0.6200  (─ 0.0000)
  env_score             0.5800  →  1.0000  (▲ 0.4200)

  Global HUF: 1.8820  →  2.4500  (+0.5680)

Swap in a real LLM to have agents reason over the parameters in natural language:

llm = LLM(api_key="sk-...", endpoint="https://api.openai.com", model="gpt-4o")
usa = USAWorkflow(llm=llm)

Installation

pip install choreo-mini

Or from source:

git clone https://github.com/Sivasathivel/Choreo-mini
cd choreo-mini
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
pytest

Quick start — single agent

from choreo_mini import Workflow, AgentNode, LLM

class Advisor(Workflow):
    def __init__(self, llm):
        super().__init__("advisor")
        self.agent = AgentNode(self, "Advisor", role="Senior trade policy analyst", llm=llm)

    def ask(self, question: str) -> str:
        return self.send("Advisor", question).content

llm = LLM(api_key="sk-...", endpoint="https://api.openai.com", model="gpt-4o-mini")
wf  = Advisor(llm)
print(wf.ask("What are the three main pillars of CUSMA/USMCA?"))

Compile to another runtime (optional)

When you're ready to migrate to LangGraph, CrewAI, or AutoGen, compile your workflow with the CLI:

choreo_mini -f examples/foo2.py -b langgraph -o output/foo2_langgraph.py
choreo_mini -f examples/foo2.py -b crewai    -o output/foo2_crewai.py
choreo_mini -f examples/foo2.py -b autogen   -o output/foo2_autogen.py

OpenAI-compatible remote endpoint example:

choreo_mini -f examples/customer_ops_url.py -b langgraph -o output/customer_ops_langgraph.py

Observability

When enable_profiling=True, every agent call is instrumented automatically:

Metric	Description
`calls`	Number of times the agent was invoked
`total_latency`	Cumulative wall-clock inference time (seconds)
`total_memory`	Cumulative memory delta across calls (bytes)
`history`	Full conversation history per agent

wf = MyWorkflow(enable_profiling=True, llm=llm)
wf.run("some input")
print(wf.get_profile("Analyst"))
# {"Analyst": {"calls": 3, "total_latency": 1.42, "total_memory": 8192}}

Status

Choreo-Mini is an actively developed prototype.

Core runtime (Workflow, AgentNode, LLM, BeliefState, Episode) — stable, 140 tests passing
LangGraph backend — most mature; supports branching, loop budgets, service node dispatch
CrewAI / AutoGen backends — structurally correct scaffolding; runtime fidelity in progress

Roadmap

LLM-driven MARL strategies — replace fixed-delta strategies with agents that reason over belief states to propose actions
Policy update hooks — plug gradient or tabular RL updates directly into the Episode trajectory
MCP server support — expose workflows as tools/resources/prompts via Model Context Protocol
A2A (agent-to-agent) — explicit handoffs, delegation, and structured cross-agent messaging
CrewAI / AutoGen parity — deeper runtime fidelity for routing, loops, and state transitions
CLI improvements — conversion diagnostics, IR inspection output

Author

Sivasathivel Kandasamy — LinkedIn

License

This project is released under the Choreo-Mini Source License.

What is allowed:

Use choreo-mini as a library or dependency inside any project, including commercial applications and internal enterprise deployments.
Modify the source and contribute back.
Keep your larger application closed source when it only depends on choreo-mini and is not itself a derivative of choreo-mini.
Ship a proprietary larger product that uses unmodified choreo-mini as a component, with license notices preserved.

What is not allowed:

Building and selling a product, plugin, extension, or SaaS where choreo-mini is the core value being offered by a third party.
Distributing or hosting a modified derivative of choreo-mini without releasing the derivative source under the same license.
Selling paid access to choreo-mini or a derivative API/service, even when bundled with other paid features.

Other terms: citation is required in public materials and user-facing interfaces (docs, demos, public repos, benchmark reports, websites, or service UI); contributors grant the author relicensing rights; the author reserves the right to publish enterprise/commercial editions. See CONTRIBUTING.md for contribution terms.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

May 14, 2026

0.1.0

Mar 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

choreo_mini-0.2.0.tar.gz (56.5 kB view details)

Uploaded May 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

choreo_mini-0.2.0-py3-none-any.whl (49.1 kB view details)

Uploaded May 14, 2026 Python 3

File details

Details for the file choreo_mini-0.2.0.tar.gz.

File metadata

Download URL: choreo_mini-0.2.0.tar.gz
Upload date: May 14, 2026
Size: 56.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for choreo_mini-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`42b0f152290e077aaabcb4757ed427eb99b907b0ff04368cc9d630f4ee0e7fbd`
MD5	`9ff8205df347e9c0be78aec3c890d9f3`
BLAKE2b-256	`94b16e7999f2845f5ea0957cebde6bec96da97047d878e30ce385626a1f7a017`

See more details on using hashes here.

File details

Details for the file choreo_mini-0.2.0-py3-none-any.whl.

File metadata

Download URL: choreo_mini-0.2.0-py3-none-any.whl
Upload date: May 14, 2026
Size: 49.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.1

File hashes

Hashes for choreo_mini-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`384214537d32973c222b9c0f2643839b3d2f7f98428ba9d742d1999914225c0d`
MD5	`484cf54be1531b04770fe065b143cf27`
BLAKE2b-256	`95b29b4052a30efe367fdbffb03e9560d71f9e4019cdba6491f964d45c1d207e`

See more details on using hashes here.

choreo-mini 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Choreo-Mini

What makes Choreo-Mini different

How it works

Core concepts

Workflow subclass pattern

Epistemic belief state

Any OpenAI-compatible endpoint

MARL episode loop

MARL experiment — CUSMA/USMCA HUF maximisation

Installation

Quick start — single agent

Compile to another runtime (optional)

Observability

Status

Roadmap

Author

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes