A DAG-planning agent with deterministic tier routing, fresh context per node, and plan-as-document semantics.

These details have not been verified by PyPI

Project description

Glassrail

A DAG-planning agent with deterministic tier routing, fresh context per node, and plan-as-document semantics.

Every task becomes a validated graph of nodes instead of a ReAct loop: the planner emits a DAG, the validator checks its invariants, and the executor runs it topologically. Each node sees only the upstream outputs it declared it needs, and model selection falls through an ordered set of tiers (local → cloud) by fixed rules rather than by the model's discretion.

Status

Early development — the engine runs end to end (plan → validate → execute over tier routing, with persistence, a typed event stream, and a REST gateway), and the eval framework measures it. The Phase 1 eval gate is met and the first PyPI release is being prepared. Treat APIs as unstable while Glassrail is in 0.x. See CHANGELOG.md for what's landed and the roadmap for what's next.

Principles

DAG planning — every task is a validated graph of nodes, not a ReAct loop.
Fresh context per node — each node sees only the upstream outputs it declared in context_needed.
Plan as document — plans are inspectable, replayable, and visualizable.
Tiered model routing — deterministic fallthrough from local → cloud, with per-tier timeouts.

Requirements

Python 3.12+
uv for dependency management
A model backend (see below) — the agent does nothing without one.

Quickstart

From PyPI, once the first release is published:

uvx glassrail --help
uvx glassrail run "summarise the CAP theorem in three bullets"

From source:

uv sync --all-extras

The agent needs at least one reachable LLM tier. By default tier 0 points at a local OpenAI-compatible server (http://localhost:8080/v1, model qwen3.6-35b-moe) and tiers 1–3 point at OpenRouter. Pick one:

# Option A — a local server on :8080 (e.g. an MLX or llama.cpp OpenAI-compatible
# endpoint). Nothing else to configure; tier 0 is the default.

# Option B — use OpenRouter for the cloud tiers:
export GLASSRAIL_TIER1__API_KEY=sk-or-...

Then run a task:

uv run glassrail run "summarise the CAP theorem in three bullets"

If no tier is reachable, the router walks tier 0 → tier 3 and then fails with All providers exhausted; last error: … — that almost always means no model backend is wired up, not a bug.

Ways to run it

Headless, one-shot — the full engine in one process; prints the result:

uv run glassrail run "<task>"
uv run glassrail run "<task>" --json            # machine-readable result envelope
uv run glassrail run "<task>" --model <name>    # override tier 0's model
uv run glassrail run "<task>" --timeout 120     # wall-clock budget in seconds

The --json envelope includes the accepted plan when planning succeeds and planning_attempts for every planner try, including raw model output plus parse or validation errors. This makes failed plans inspectable from headless runs and eval artefacts.

Gateway + live viewer — start the REST gateway, then watch a task stream:

uv run uvicorn glassrail.gateways.rest:app      # serves on :8000
uv run glassrail tui "<task>"                   # POSTs the task, renders the live DAG + stream

The viewer draws the plan as colour-coded node boxes connected by edges (grouped into dependency layers, each box showing a short summary, recoloured as they run) above a per-node table; --no-dag shows the table alone. See Terminal UI.

Editor / agent clients (ACP) — speak the Agent Client Protocol as a JSON-RPC 2.0 server over stdio, so an ACP client (the in-repo Rust TUI, or Zed) can spawn the agent as a subprocess, submit tasks, and watch the plan and nodes stream back:

uv run glassrail acp                            # JSON-RPC over stdin/stdout; logs to stderr

The in-repo Rust terminal client speaks this protocol — submit a task, watch the plan stream, approve or revise it, all in the terminal. See clients/tui:

cd clients/tui && cargo run -- uv run glassrail acp

It implements initialize, session/new, session/prompt, and session/cancel; the plan and per-node execution arrive as session/update notifications. Before executing, the agent pauses at a plan gate and asks the client to approve via session/request_permission — a client may approve, or reject with free-text feedback to trigger a guided replan. (fs/* and terminal/* are intentionally unsupported — the agent runs its own tools server-side.)

REST API directly — POST /task returns a task_id; follow it over Server-Sent Events or a WebSocket at /task/{id}/events, or poll GET /task/{id}. See Streaming events.

Configuration

Twelve-factor: environment variables (and an optional .env / config.toml), parsed by pydantic-settings. Tiers are nested, so use the __ delimiter:

Setting	Env var	Default
Tier 0 model	`GLASSRAIL_TIER0__MODEL`	`qwen3.6-35b-moe`
Tier 0 endpoint	`GLASSRAIL_TIER0__BASE_URL`	`http://localhost:8080/v1`
Tier 0 timeout (s)	`GLASSRAIL_TIER0__TIMEOUT_S`	`10.0`
Tier 1 API key	`GLASSRAIL_TIER1__API_KEY`	(empty)
HITL plan gate	`GLASSRAIL_CONFIRM_PLANS`	`false`
Tool approval mode	`GLASSRAIL_TOOL_APPROVAL__MODE`	`interactive`
Planner stall char multiplier	`GLASSRAIL_PLANNER_STALL_CHAR_MULTIPLIER`	`4`
Load tool plugins	`GLASSRAIL_LOAD_TOOL_PLUGINS`	`false`

Tiers 1–3 default to OpenRouter models; override any field the same way. With a local model as your only tier, raise GLASSRAIL_TIER0__TIMEOUT_S (e.g. to 120) — a large local model can take longer than the 10 s default, and a timeout is treated as the tier being unavailable.

Generation ceiling

max_generation_tokens (default 20000) is a hard cap on max_tokens sent to any tier for any single request, applied by the router before the request leaves the process. Per-node budgets (below) are the goal; this is the safety backstop that prevents a single generation from consuming unbounded memory on a local model across long multi-step runs. Set it in config.toml or via GLASSRAIL_MAX_GENERATION_TOKENS.

Per-node token budgets

Each node runs with a fresh context; these cap how many tokens it may generate (output), so reasoning and summaries get room while structured micro-calls stay small. Override any field under [budgets] in config.toml (or GLASSRAIL_BUDGETS__<FIELD>):

Budget	Default	Used by
`planner`	16384	the full plan JSON
`think`	8192	multi-step reasoning
`summary`	8192	high-fidelity document/webpage summaries
`synthesis`	4096	combining prior outputs
`result`	4096	the final answer
`decision`	256	a branch label
`extract_args`	512	a tool-args object
`shape_check`	128	a yes/no output gate

These are output caps. How much a node can read is bounded by your served model's context window, not by these.

Planner output that is not valid JSON and exceeds budgets.planner * planner_stall_char_multiplier characters is classified as a stall; the next retry sees a truncated copy of that raw output and is told not to repeat it.

Node prompts

Each node role (planner, decision, think, synthesis, summary, result, and the tool-output shape check) has a system prompt you can override without editing source — under [prompts] in config.toml or GLASSRAIL_PROMPTS__<FIELD>. The defaults live in glassrail.config.prompts. A custom prompt must keep instructing the model to emit the JSON shape its node expects (e.g. a summary prompt must still ask for {"summary": ..., "confidence": ...}).

Tools

Built-in tools (file_read, plus calendar_get / memory_search stubs) always register. Add a first-party tool by decorating a function with @harness.tool(name=..., description=..., parameters=<JSON Schema>).

First-party integrations are bundled but opt-in, configured under [tools.*]. The web integration needs the web extra and is off by default:

web_fetch(url) — fetch a page and extract its main text (for reading or summarising webpages).
web_search(query) — search the web behind a pluggable provider: duckduckgo (no setup) or searxng (point at a self-hosted instance).

The image integration wraps the mflux-generate CLI on macOS and is also off by default:

image_generate(prompt, output_path) — generate a PNG from text using Flux.
image_generate(..., image_path=..., image_strength=...) — image-to-image generation/editing from an existing source image.

Install mflux separately, then either put mflux-generate on PATH or set mflux_bin. The tool is declared write risk, so it asks for approval in interactive mode unless explicitly allowed.

uv sync --extra web                      # installs trafilatura + lxml

[tools]
fs_roots = ["~/work", "/tmp/glassrail-eval"] # optional path confinement for file tools

[tools.web]
fetch = true
search = "duckduckgo"                    # or "searxng" (+ searxng_url)

[tools.image]
enabled = true
mflux_bin = "~/.venvs/mflux/bin/mflux-generate" # optional; empty = auto/PATH

tools.fs_roots confines first-party filesystem paths after ~ expansion and symlink resolution. When unset, file tools keep the current unconfined behavior and log a warning the first time they resolve a path.

Tool Approval

Per-tool approval is configured under [tool_approval]. Policies are: allow (run), ask (prompt an interactive client), and deny (never run). Explicit per-tool overrides win. Without an override, tools declared as write or execute risk default to ask; read and network tools use the configured default. mode = "auto" treats ask as allow for unattended/headless execution, but keeps deny as deny.

[tool_approval]
default = "allow"
mode = "interactive"                     # or "auto"

[tool_approval.overrides]
file_write = "ask"
shell_exec = "deny"
image_generate = "allow"                 # explicit override for a write-risk tool

Third-party plugins advertised through the glassrail.tools entry-point group are a separate opt-in: set GLASSRAIL_LOAD_TOOL_PLUGINS=true (or load_tool_plugins = true) and the runtime discovers and registers them at startup.

Security notes

Glassrail is early 0.x software run by its operator, not a hardened service. Current posture (hardening is tracked in Security baseline):

The REST gateway has no authentication — keep it bound to localhost and do not expose it to untrusted networks.
file_read and image_generate output paths are confined when tools.fs_roots is set. When it is unset, file tools can access any path the process can access and log a one-time warning.
The web tools fetch model-chosen URLs — enabling them means the agent has outbound network access it chooses how to use.
Tool risk levels participate in approval: without an explicit override, write and execute tools ask in interactive mode. mode = "auto" still treats ask as allow, so use explicit deny overrides for tools that must never run unattended.

Evals

Model-quality evals (multi-trial pass@k capability vs pass^k reliability against the real agent) live in the standalone eval-framework/:

cd eval-framework && python3 run.py suite suites/glassrail

The glassrail-cli backend drives the real planner and executor over the agent's own tier routing via glassrail run --json. See Evals.

Development

uv sync --all-extras --group dev
uv run pre-commit install
uv run pytest

See CONTRIBUTING.md for the full check sweep and PR guidelines, CLAUDE.md for the package layout and conventions, and the docs site for the architecture, streaming, observability, and deployment references.

License

Apache-2.0. See LICENSE.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.2

Jun 13, 2026

0.1.1

Jun 13, 2026

This version

0.1.0

Jun 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glassrail-0.1.0.tar.gz (204.7 kB view details)

Uploaded Jun 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

glassrail-0.1.0-py3-none-any.whl (133.4 kB view details)

Uploaded Jun 11, 2026 Python 3

File details

Details for the file glassrail-0.1.0.tar.gz.

File metadata

Download URL: glassrail-0.1.0.tar.gz
Upload date: Jun 11, 2026
Size: 204.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for glassrail-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`57ab1941da48c2d6e1fd7cd69b726b79330325d9b4500bf491b317f64852571a`
MD5	`b6c1fb66742b3ec108fc78a8d5304ff9`
BLAKE2b-256	`cd4b17eb454dddf9eb5637482f09c163080c07af4a9cb1a8e2419750a44ed558`

See more details on using hashes here.

Provenance

The following attestation bundles were made for glassrail-0.1.0.tar.gz:

Publisher: publish.yml on andrew-ellis-engineering/glassrail

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: glassrail-0.1.0.tar.gz
- Subject digest: 57ab1941da48c2d6e1fd7cd69b726b79330325d9b4500bf491b317f64852571a
- Sigstore transparency entry: 1791711328
- Sigstore integration time: Jun 11, 2026
Source repository:
- Permalink: andrew-ellis-engineering/glassrail@5c4b2ab6d65e3dc4ca53eea396134c37134e3b0b
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/andrew-ellis-engineering
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5c4b2ab6d65e3dc4ca53eea396134c37134e3b0b
- Trigger Event: release

File details

Details for the file glassrail-0.1.0-py3-none-any.whl.

File metadata

Download URL: glassrail-0.1.0-py3-none-any.whl
Upload date: Jun 11, 2026
Size: 133.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for glassrail-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0d24dd9e799b9c555470a390c3b4b6f0c0914f1bb44938e2fdd073664903b2c7`
MD5	`dc4dec23dd4e20fa8401287cc4e134e2`
BLAKE2b-256	`203319eb851bc995ff968401a90c0853960fef403b720148877b1f18691f8633`

See more details on using hashes here.

Provenance

The following attestation bundles were made for glassrail-0.1.0-py3-none-any.whl:

Publisher: publish.yml on andrew-ellis-engineering/glassrail

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: glassrail-0.1.0-py3-none-any.whl
- Subject digest: 0d24dd9e799b9c555470a390c3b4b6f0c0914f1bb44938e2fdd073664903b2c7
- Sigstore transparency entry: 1791711428
- Sigstore integration time: Jun 11, 2026
Source repository:
- Permalink: andrew-ellis-engineering/glassrail@5c4b2ab6d65e3dc4ca53eea396134c37134e3b0b
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/andrew-ellis-engineering
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5c4b2ab6d65e3dc4ca53eea396134c37134e3b0b
- Trigger Event: release

glassrail 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Glassrail

Status

Principles

Requirements

Quickstart

Ways to run it

Configuration

Generation ceiling

Per-node token budgets

Node prompts

Tools

Tool Approval

Security notes

Evals

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance