A governed dynamic-graph runtime: a stable Supervisor plans, validates, and runs transient LangGraph workflows from a bounded node registry.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

IanT09

These details have not been verified by PyPI

Project description

Dynamic Subgraphs

A governed runtime for LLM-generated workflows. A stable Supervisor turns a prompt into a validated plan — data, never executable code — then compiles and runs it as a transient LangGraph workflow over a bounded, allowlisted vocabulary of node kinds. Recording is opt-in: when enabled, each run is captured as a replayable, diffable, cost-attributed artifact under runs/<run_id>/ — the public SDK defaults to in-memory execution and writes no files.

What that buys you over a free-form agent loop: the model proposes the workflow but never executes arbitrary code; the compiler only instantiates registry-approved node kinds; recursion is depth- and budget-capped; and any run — success or failure — can be recorded for inspection and replay. LangGraph types stay behind the compiler/ and runtime/ boundaries. An optional thin FastAPI layer exposes the supervisor over HTTP.

What can it build?

From one prompt, the planner assembles a transient graph out of a small set of governed node kinds — llm_call, tool_call, branch, reduce, parallel_map, spawn_subagent, spawn_subgraph, wait_for_event, emit_artifact. A few of the shapes it produces (these render in the same Mermaid the engine writes to graph.mmd for every run):

Parallel research → recommend — fan out to independent workers, then reduce:

graph TD
    START([START])
    END([END])
    extract_a["extract_a<br/>tool_call"]
    extract_b["extract_b<br/>tool_call"]
    summarize_a["summarize_a<br/>llm_call"]
    summarize_b["summarize_b<br/>llm_call"]
    compare["compare_and_recommend<br/>reduce"]
    START --> extract_a
    START --> extract_b
    extract_a --> summarize_a
    extract_b --> summarize_b
    summarize_a --> compare
    summarize_b --> compare
    compare --> END

Tool-grounded answer — search the web, answer from it, write a report:

graph TD
    START([START])
    END([END])
    search["web_search<br/>tool_call"]
    answer["answer<br/>llm_call"]
    report["report<br/>emit_artifact"]
    START --> search
    search --> answer
    answer --> report
    report --> END

Dynamic routing — classify the request, then take only the path it warrants:

graph TD
    START([START])
    END([END])
    classify["classify_intent<br/>llm_call"]
    route{"route<br/>branch"}
    answer["answer<br/>llm_call"]
    search["web_search<br/>tool_call"]
    investigate["investigate<br/>spawn_subgraph"]
    START --> classify
    classify --> route
    route -->|simple| answer
    route -->|needs data| search
    route -->|complex| investigate
    search --> answer
    investigate --> answer
    answer --> END

Human-in-the-loop — pause for an external event, then resume:

graph TD
    START([START])
    END([END])
    draft["draft_proposal<br/>llm_call"]
    review["await_approval<br/>wait_for_event"]
    finalize["finalize<br/>llm_call"]
    START --> draft
    draft --> review
    review --> finalize
    finalize --> END

Nested composition — a node plans and runs a child graph on a fresh, isolated state envelope under enforced depth and spend ceilings (recursion that can't run away):

graph TD
    START([START])
    END([END])
    plan["plan<br/>llm_call"]
    investigate["investigate<br/>spawn_subgraph"]
    synthesize["synthesize<br/>reduce"]
    START --> plan
    plan --> investigate
    investigate --> synthesize
    synthesize --> END

How it works

Those graphs are transient — planned, validated, run, and recorded per request. The only thing that stays fixed is the Supervisor, a stable host graph that governs every run:

graph LR
    START([prompt]) --> plan
    plan["plan<br/>(GraphSpec)"] --> validate
    validate["validate<br/>(registry + budgets)"] --> run
    run["compile & run<br/>(transient graph)"] --> record
    record["record<br/>(spec·trace·mermaid·cost)"] --> replay
    replay["replay / diff / audit"] --> respond
    respond([result]) --> END([END])

The planner emits a plan, never code; the compiler only instantiates registry-approved node kinds; and every run — success or failure — is recorded as a replayable, diffable, cost-attributed artifact. That audit trail is the point: it's what a free-form agent loop can't give you.

When (not) to reach for it

Dynamic Subgraphs earns its keep when the shape of the work varies per input and you need the run governed and auditable. Use it when:

The workflow can't be enumerated ahead of time — the right nodes/edges depend on the request (heterogeneous intake, branching investigations, data-dependent recursion whose depth isn't known until runtime).
You need an audit trail: a validated plan, a per-node trace, deterministic replay, and cost attribution — for compliance, debugging, or reproducibility.
The model should propose the workflow but must not execute arbitrary code, and its tool/capability surface must stay allowlisted and budget-capped.

Reach for something simpler when:

The shape is known. If you can draw the DAG ahead of time, hand-author a fixed LangGraph graph — it's cheaper and more predictable. (If you can write the orchestration as a script, you don't need a planner generating it.)
The task is small. A frontier model in a plain tool loop already decomposes a one-to-three-step task in-context; a planning round-trip is pure overhead there.
Your hard problem is global consistency, not orchestration. Isolated child envelopes are great for blast-radius but work against shared canonical state — pair DS with a retrieval/consistency layer rather than expecting it to enforce one.

Install

With uv (recommended):

uv add dynamic-subgraphs                 # slim core (engine only)
uv add "dynamic-subgraphs[openai]"       # + OpenAI provider
uv add "dynamic-subgraphs[anthropic]"    # + Anthropic provider
uv add "dynamic-subgraphs[ollama]"       # + local Ollama provider
uv add "dynamic-subgraphs[api]"          # + FastAPI HTTP surface
uv add "dynamic-subgraphs[cost]"         # + automatic result.cost (LiteLLM prices)
uv add "dynamic-subgraphs[all]"          # everything

Or with pip:

pip install dynamic-subgraphs
pip install "dynamic-subgraphs[openai]"  # same extras: anthropic, ollama, api, cost, all

The core install is intentionally light (langgraph, langchain-core, pydantic, python-dotenv); provider SDKs and the API server are optional extras so you only pull what you use.

Quickstart (development)

# Set up the dev environment (all extras + dev tooling)
uv sync --all-extras

# Run the offline mock demo (free, no tokens)
uv run python -m app.main "compare A and B"

# Run the HTTP API (boots in mock mode by default; needs the `api` extra)
uv run python -m app.api

By default everything runs in mock mode — free and offline. Set DS_PLANNER=llm and DS_PROVIDER=<provider> to use the real planner and grounded tools. The legacy DS_PLANNER=openai value still maps to planner=llm with provider=openai.

Built-in providers (default_model_providers()):

`DS_PROVIDER`	Package	Credentials
`openai`	`langchain-openai`	`OPENAI_API_KEY`
`anthropic`	`langchain-anthropic`	`ANTHROPIC_API_KEY`
`ollama`	`langchain-ollama`	none (local server; `OLLAMA_BASE_URL` optional)

Each role (planner, worker, reducer, subagent, judge) can target a different provider/model through RunConfig's role-specific ModelRef fields; unset roles fall back to the worker model, then to the base provider+model.

SDK usage

The dynamic_subgraphs package is the importable facade — build an EngineConfig, hand it to the engine, then call run():

from dynamic_subgraphs import DynamicSubgraphs, EngineConfig, Model

# Cloud (key from env)
engine = DynamicSubgraphs(EngineConfig(model=Model("openai", "gpt-5.4-nano")))

# ...or a local LM Studio / Ollama server (bring your own URL/key/model)
engine = DynamicSubgraphs(EngineConfig(model=Model.lmstudio("google/gemma-3-27b")))
engine = DynamicSubgraphs(EngineConfig(model=Model.ollama("llama3.1")))

result = engine.run("Compare two sources on X and recommend one.")
result.response      # synthesized answer text
result.values        # {output_key: value, ...}
result.plan          # the generated GraphSpec
result.artifacts     # {filename: Path} (populated only when recording is on)
result.usage         # exact TokenUsage: input/output/total + per-model breakdown
result.cost          # USD (None unless a pricing book is configured — see below)
result.effective_budget  # the host-granted budget (planner request capped by policy)
result.plan_attempts     # planner attempts (>1 if a rejected plan was repaired)

Token usage & cost

result.usage is always populated with the providers' own reported token counts (via LangChain's usage callback — exact, all providers, no estimation).

Cost works automatically with the cost extra (it uses LiteLLM's maintained price map — you don't specify prices, and we don't ship a table that goes stale):

pip install "dynamic-subgraphs[openai,cost]"

engine = DynamicSubgraphs(EngineConfig(model=Model("openai", "gpt-5.4-nano")))
r = engine.run("...")
r.usage.total_tokens   # e.g. 3233   (exact, free, always)
r.cost                 # e.g. 0.0021 (USD — auto-computed)

Without the extra, result.cost is None (tokens are still exact). You can also pass a manual pricing book on EngineConfig to override prices or to cover local / custom-endpoint models LiteLLM doesn't know:

EngineConfig(model=..., pricing={"my-model": {"input_per_1m": 0.5, "output_per_1m": 1.0}})

(If you use LangSmith, it computes cost server-side as well.)

All engine configuration lives on EngineConfig: the per-role models (model, planner_model, worker_model, reducer_model, subagent_model, judge_model), the recording policy, planner mode, runs_dir, providers, checkpointer, the host-owned policy (ExecutionPolicy), and max_plan_attempts (the plan-repair loop).

Governance: host-owned limits & plan repair

The planner proposes a workflow; the host owns the limits. An ExecutionPolicy is the contract — and it's enforced, not advisory:

from dynamic_subgraphs import DynamicSubgraphs, EngineConfig, ExecutionPolicy, Model

engine = DynamicSubgraphs(EngineConfig(
    model=Model("openai", "gpt-5.4-nano"),
    policy=ExecutionPolicy(
        max_nodes=6, max_llm_calls=4, max_depth=2, max_fanout=16,
        allowed_tools=frozenset({"web_search"}),   # host ∩ registry
    ),
))
r = engine.run("...")
r.effective_budget   # the granted budget = min(host, planner request)

Enforced at validation (root and every nested spawn_subgraph child):

Budgets — node/LLM-call counts capped at min(host, planner request); a plan can't grant itself a larger budget.
Allow-sets — tool / subagent / node-kind use is the host ∩ registry intersection; a plan naming a disallowed capability is rejected.
Fan-out — a parallel_map over more items than max_fanout halts before any work fires.
Nesting — a child's budget is the parent's remaining allowance, so a nest can't outspend the root; depth is capped at the tighter of the host and the rail.
Wall-clock — a run that outruns max_wall_seconds is abandoned (a hung runner can't block forever).

When a plan is rejected for a recoverable reason, the supervisor feeds the issues + host limits back into a re-plan, up to max_plan_attempts (default 2 — repair once; set 1 for strict block-and-report). So a too-ambitious plan is re-planned within the limits instead of just failing. Defaults are permissive-but-bounded, so a typical plan is unaffected.

⚠️ Use a capable model for the planner. The planner must emit a valid GraphSpec; small local models (7B-class, and in practice anything below ~20–30B) frequently produce invalid plans and fail. Run small/local models as the worker_model with a stronger planner_model.

Recording (opt-in)

By default the engine writes no files — embedding it never clutters your working tree. Suggestion: set a recording policy while developing or debugging to capture the trace under runs/<run_id>/, then leave it at the default in production / library use:

from dynamic_subgraphs import Recording, Artifact

engine = DynamicSubgraphs(EngineConfig(
    model=Model("openai", "gpt-5.4-nano"),
    recording=Recording.debug(),      # capture everything
    runs_dir="runs",
))

Recording is granular — choose exactly which artifacts to write with the Artifact enum (its values are the filenames) and the Recording policy:

recording=Recording.visual_only()           # just graph.mmd (the diagram)
recording=Recording.all() - {Artifact.SPEC}  # everything except spec.json
recording={Artifact.MERMAID, Artifact.TRACE}  # a raw set works too

Preset	Writes	Use for
`Recording.none()` (default)	nothing	embedding / production
`Recording.all()`	every artifact	full capture
`Recording.debug()`	every artifact	debugging a run
`Recording.visual_only()`	`graph.mmd`	a picture of the graph
`Recording.replayable()`	`spec.json` + `output.json`	enabling `resume`/`replay`

Coding agents can enumerate every valid option via DynamicSubgraphs.capabilities(). See docs/recipes.md for copy-pasteable patterns.

Engine model defaults can be overridden per run() call, so each run picks the models for its own node calls (e.g. a cheap cloud planner with local workers):

result = engine.run(
    "Investigate this task.",
    planner_model=Model("openai", "gpt-5.4-nano"),
    worker_model=Model.lmstudio("openai/gpt-oss-20b"),
)

Documentation

examples/ — runnable, standalone SDK integration examples (one file per pattern).
docs/recipes.md — copy-pasteable SDK patterns (local models, hybrid, recording presets, debugging) + tested-model and latency tables.
docs/api-stability.md — API stability & change policy: what counts as public, SemVer, deprecation, and how we keep changes non-breaking.
docs/evals/ — eval reports (e.g. the gpt-5.4-nano vs claude-haiku-4-5 e2e comparison: latency / tokens / cost / quality, traced via LangSmith).
docs/api.md — the HTTP surface over the supervisor (endpoints, modes, auth, examples).
docs/dynamic-graphs-canonical-design-v1.md — canonical project design and source of truth.
docs/index.md — full documentation index.
AGENTS.md — agent-facing package map and MVP sequence.

Contributing & support

Contributions are welcome — see CONTRIBUTING.md for the dev setup, test, and formatting workflow, and CODE_OF_CONDUCT.md. Found a bug or have a request? Open an issue. For security reports, see SECURITY.md.

Status

Pre-1.0 (0.x) — the public SDK surface is usable and tested, but the API may change between minor versions until 1.0. See CHANGELOG.md.

License

Licensed under the Apache License 2.0. See NOTICE for attribution.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

IanT09

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.5.0

Jun 12, 2026

0.4.0

Jun 5, 2026

0.3.0

Jun 5, 2026

0.2.0

Jun 4, 2026

0.1.0

Jun 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dynamic_subgraphs-0.5.0.tar.gz (185.2 kB view details)

Uploaded Jun 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dynamic_subgraphs-0.5.0-py3-none-any.whl (163.5 kB view details)

Uploaded Jun 12, 2026 Python 3

File details

Details for the file dynamic_subgraphs-0.5.0.tar.gz.

File metadata

Download URL: dynamic_subgraphs-0.5.0.tar.gz
Upload date: Jun 12, 2026
Size: 185.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dynamic_subgraphs-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`2c3931fc3e12dedd7dacb47932cab68658b7127d5bd3c2b138b6c84e411117f2`
MD5	`e1f26dabf5d60e6e2d86730c36280b6c`
BLAKE2b-256	`b6d1ec1419287d26d1e5ba63b13aed2c6d4632eff1c119b9c55373b9c0d1699b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dynamic_subgraphs-0.5.0.tar.gz:

Publisher: publish.yml on Ian-Tharp/Dynamic-Subgraphs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dynamic_subgraphs-0.5.0.tar.gz
- Subject digest: 2c3931fc3e12dedd7dacb47932cab68658b7127d5bd3c2b138b6c84e411117f2
- Sigstore transparency entry: 1797760684
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: Ian-Tharp/Dynamic-Subgraphs@f5fef13075db867c44e373afa2aaaf22063a4663
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/Ian-Tharp
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f5fef13075db867c44e373afa2aaaf22063a4663
- Trigger Event: push

File details

Details for the file dynamic_subgraphs-0.5.0-py3-none-any.whl.

File metadata

Download URL: dynamic_subgraphs-0.5.0-py3-none-any.whl
Upload date: Jun 12, 2026
Size: 163.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dynamic_subgraphs-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`01bffae41d6bfca77fc41f2f24da59b359a7251bb30168472608a2f6d0655958`
MD5	`e073a01d5797444b22c05c0e3383fae8`
BLAKE2b-256	`88414feece86f8dc077261838aacabd16d4b759645f0d47931c3692e5537b8c4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dynamic_subgraphs-0.5.0-py3-none-any.whl:

Publisher: publish.yml on Ian-Tharp/Dynamic-Subgraphs

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dynamic_subgraphs-0.5.0-py3-none-any.whl
- Subject digest: 01bffae41d6bfca77fc41f2f24da59b359a7251bb30168472608a2f6d0655958
- Sigstore transparency entry: 1797760771
- Sigstore integration time: Jun 12, 2026
Source repository:
- Permalink: Ian-Tharp/Dynamic-Subgraphs@f5fef13075db867c44e373afa2aaaf22063a4663
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/Ian-Tharp
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f5fef13075db867c44e373afa2aaaf22063a4663
- Trigger Event: push

dynamic-subgraphs 0.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Dynamic Subgraphs

What can it build?

How it works

When (not) to reach for it

Install

Quickstart (development)

SDK usage

Token usage & cost

Governance: host-owned limits & plan repair

Recording (opt-in)

Documentation

Contributing & support

Status

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance