Skip to main content

Quartermaster — modular AI agent orchestration framework. Install this to get all packages.

Project description

Quartermaster SDK

Modular AI agent orchestration framework by MindMade.

Quartermaster lets you build AI agent workflows as directed graphs — define nodes (LLM calls, decisions, user input, tools), connect them with edges, and execute them with a pluggable engine.

What's new in v0.4.0

  • Application timeouts -- qm.configure(timeout=, connect_timeout=, read_timeout=) + per-call overrides.
  • Stream cancellation -- with qm.run.stream(...) as stream: context-manager; qm.Cancelled + ctx.cancelled.
  • Native Ollama /api/chat for tool calls (auto-detect) + ollama_tool_protocol= config.
  • Per-node tool scoping -- agent(tools=[...]) strictly enforced; tool_scope="permissive" escape hatch.
  • Inline @tool callables -- agent(tools=[my_func]) accepts bare callables.
  • instruction_form -- Gemma preamble robustness + dict-schema support.
  • qm.configure(telemetry=True) -- sugar for qm.telemetry.instrument().
  • qm.configure(auto_redact_pii=True) -- automatic PII redaction policy.
  • Trace.from_jsonl() / assert_traces_equal() -- round-trip trace serialisation for golden-file tests.
  • SessionStore protocol -- qm.run(graph, input, session=store, session_id=...) + InMemorySessionStore.
  • TypedEvent -- Pydantic base class for typed custom events.
  • python -m quartermaster_sdk.lint check -- static graph linter (QM001--QM005).
  • CircuitBreaker -- CircuitBreaker(failure_threshold=, recovery_timeout=) + CircuitOpenError.
  • Local-GPU cost tracker -- duration_seconds + local_gpu_cost_per_hour support.

Quick Install

# Core framework (graph + providers + tools + nodes + engine)
pip install quartermaster-sdk

# With OpenAI
pip install quartermaster-sdk[openai]

# With everything (all providers, all tools, MCP client, code runner)
pip install quartermaster-sdk[all]

Quick Start (local Ollama, zero config)

ollama pull gemma4:26b      # or any model you've pulled
import quartermaster_sdk as qm

qm.configure(
    provider="ollama",
    base_url="http://localhost:11434",   # or set $OLLAMA_HOST
    default_model="gemma4:26b",
)

# Graph() auto-creates Start; .end() / .build() are both optional when running via qm.run().
result = qm.run(qm.Graph("chat").user().agent(), "Pozdravljen, koliko je ura?")
print(result.text)

Single-shot helpers (no graph visible)

# prompt → str
reply = qm.instruction(system="Respond in Slovenian.", user="Pozdravljen!")

# prompt → Pydantic model (typed JSON extraction)
from pydantic import BaseModel

class Classification(BaseModel):
    category: str
    priority: str

data = qm.instruction_form(Classification, system="Classify.", user=email_body)

Reading specific node outputs with capture_as=

graph = (
    qm.Graph("enrich")
    .agent("Research", tools=[...], capture_as="notes")
    .instruction_form(CustomerData, system="Extract.", capture_as="data")
)
result = qm.run(graph, "VT-Treyd Slovenija")
result["notes"].output_text    # agent's free-text research
result["data"].output_text     # extracted JSON

Streaming (v0.3.0 filtered iterators)

qm.run.stream(...) returns a wrapper you can iterate raw or pipe through a filter — one helper per chunk family:

Filter Yields Use for
.tokens() str (the token text) Typewriter UI — just the text
.tool_calls() ToolCallChunk Dashboard cards: call.tool, call.args
.progress() ProgressChunk prog.message, prog.percent, prog.data
.custom(name=...) CustomChunk Application-defined milestones
(raw for chunk in ...) Chunk union Debugging, pass-through consumers
# Typewriter effect -- tokens only.
for token in qm.run.stream(graph, "Tell me a story").tokens():
    print(token, end="", flush=True)

# Dashboard view -- just the tool calls.
for call in qm.run.stream(graph, "Research Slovenia").tool_calls():
    ui.tool_card(call.tool, call.args)

# Progress cards interleaved with model tokens.
for prog in qm.run.stream(graph, "Crunch the dataset").progress():
    ui.status(prog.message, prog.percent)

# Subscribe to one milestone name only.
for evt in qm.run.stream(graph, "Research").custom(name="source_found"):
    ui.add_source(evt.payload["url"])

Streams are single-pass — the wrapper owns its underlying generator, so picking a second filter (or raw-iterating after a filter) raises RuntimeError("stream already consumed"). Pick one consumer per stream.

The async analogue is available via qm.arun.stream(...) with the same four filter helpers, returning AsyncIterator[...].

Post-mortem Result.trace

Every Result (sync or the terminal DoneChunk.result of a stream) carries a structured Trace built from the full FlowEvent stream:

result = qm.run(graph, "Hello!")

result.trace.text                        # concatenated model output
result.trace.tool_calls                  # list[dict] across every agent node
result.trace.progress                    # list[ProgressEvent]
result.trace.custom(name="source_found") # filtered CustomEvent list
result.trace.by_node["Researcher"].text  # tokens for a single node
print(result.trace.as_jsonl())           # JSONL export for logs / fixtures

Progress events from inside tools

Long-running tools reach the flow's ExecutionContext via qm.current_context() and emit structured events that stream back to the UI alongside model tokens:

from quartermaster_tools import tool

@tool()
def slow_research(topic: str) -> dict:
    ctx = qm.current_context()      # None when called outside a flow -- safe
    if ctx is not None:
        ctx.emit_progress("Gathering sources", percent=0.25, topic=topic)
        ctx.emit_custom("source_found", {"url": "https://example.com"})
    # ... do real work ...
    return {"summary": "..."}

OpenTelemetry instrumentation

pip install 'quartermaster-sdk[telemetry]'
from quartermaster_sdk import telemetry

telemetry.instrument()     # uses the global tracer provider
qm.run(graph, "Hello!")    # every node + tool call is now a span

Spans follow the OpenTelemetry GenAI semantic conventions (gen_ai.system, gen_ai.operation.name, gen_ai.tool.name, gen_ai.usage.input_tokens, …). Point your exporter at Jaeger, Tempo, Honeycomb, Logfire, Phoenix, or any OTLP collector.

Quick Start (cloud provider)

agent = (
    qm.Graph("My Agent")
    .user("What can I help you with?")
    .instruction("Respond", model="gpt-4o", system_instruction="You are a helpful assistant.")
)
result = qm.run(agent, "How does photosynthesis work?")

Sync chat shim (no graph needed)

For one-shot LLM calls from sync code (Celery workers, Django views, CLI scripts) — no asgiref.async_to_sync wrapper required:

from quartermaster_providers.providers.local import OllamaProvider

provider = OllamaProvider(default_model="gemma4:26b")
result = provider.chat(
    messages=[{"role": "user", "content": "Pozdravljen!"}],
    max_output_tokens=128,
    thinking_level="off",
)
print(result.content)        # promoted from `reasoning` if `content` is empty
print(result.usage)          # {prompt_tokens, completion_tokens, total_tokens}

Packages

Package Description
quartermaster-graph Graph schema, builder API, validation
quartermaster-providers LLM provider abstraction (OpenAI, Anthropic, Google, Groq, local)
quartermaster-tools Tool definition, registry, built-in tools
quartermaster-nodes Node execution protocols and implementations
quartermaster-engine Flow execution, traversal, memory, streaming
quartermaster-mcp-client MCP protocol client (standalone)
quartermaster-code-runner Docker sandboxed code execution (standalone)

Documentation

See the docs/ directory:

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quartermaster_sdk-0.4.2.tar.gz (98.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

quartermaster_sdk-0.4.2-py3-none-any.whl (70.1 kB view details)

Uploaded Python 3

File details

Details for the file quartermaster_sdk-0.4.2.tar.gz.

File metadata

  • Download URL: quartermaster_sdk-0.4.2.tar.gz
  • Upload date:
  • Size: 98.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for quartermaster_sdk-0.4.2.tar.gz
Algorithm Hash digest
SHA256 2825701d6444b8743c46645c26b56c6fef6142e8b94c67a8be49312ef5cef373
MD5 90977667d62b40e5dc3e42405e5e508b
BLAKE2b-256 a77223f4dbc5e20816af5ebfdcbc8198709514a1768f4d803e337ae7d45ffaba

See more details on using hashes here.

File details

Details for the file quartermaster_sdk-0.4.2-py3-none-any.whl.

File metadata

File hashes

Hashes for quartermaster_sdk-0.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 dcb527d2f1a6ce1dcdb02540da06099f3dd3be651013b54d611e680d37e01b1d
MD5 4534ce4b7169e6b3cfbafa5ca8624e21
BLAKE2b-256 91abb8e0c8a1100c5d807f444bbafe7553ee25e7e699ab30f3f18a036d9cbd5c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page