Reliable, trustworthy, trackable AI workflows for science. A Claude Code-native research assistant with a provenance-tracked knowledge graph.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

maxwellsdm

These details have not been verified by PyPI

Project description

WHEELER

Reliable, trustworthy, trackable AI workflows for science.

v0.9.15 Status: Beta

Wheeler is a thinking partner for scientists, built natively on Claude Code. It gives you slash commands for each stage of research: discuss the question, plan the investigation, execute analyses, write up results. Every action is wrapped in a knowledge graph that tracks how research artifacts (papers, code, data, findings, drafts) depend on each other, making every AI-produced result traceable back to the exact script, data, and parameters that produced it.

Runs 100% locally. No API keys, no cloud services. Your data never leaves your machine.

Named after great physicist John Archibald Wheeler, Niels Bohr's longtime collaborator. Wheeler and Bohr worked by talking. Bohr would pace, thinking out loud. Wheeler would push back, sharpen the question, sketch the math. The best ideas emerged from the conversation, not from either person alone. That's the model here.

Quick Start

uvx wheeler init my-research-project
cd my-research-project && claude
/wh:start

That's it. The first command scaffolds the project (.plans/, .wheeler/, wheeler.yaml, .mcp.json) and installs slash commands and agents to ~/.claude/. The second drops you into Claude Code with Wheeler's MCP servers wired up. The third routes you to the right /wh:* command for what you want to do.

For long-lived use install Wheeler globally (faster startup, stable paths in .mcp.json):

uv tool install wheeler
wheeler init my-research-project

Run wheeler doctor any time to verify your setup (Python version, deps, Claude Code, Neo4j connectivity).

Prerequisites: Python 3.11+, uv, Claude Code (Max subscription), and Neo4j Desktop (free). New to all this? Walk through the Getting Started Guide.

From source

git clone https://github.com/maxwellsdm1867/wheeler.git
cd wheeler
uv sync --extra dev              # editable install + tests + ruff + mypy + build
uv run wheeler init ~/my-research-project

bin/setup.sh is still around for the full bootstrap (Neo4j in Docker, schema init, git hooks, zsh completions).

Why Wheeler

Science requires reproducibility. As AI gets embedded in research workflows, the gap between "AI helped me" and "here's the auditable chain of how this result was produced" becomes a credibility problem.

Wheeler is built on four pillars:

Traceable results. When Wheeler creates a finding, it automatically records what script ran, what data it consumed, what papers informed the approach, and when it happened. One tool call builds the full provenance chain. The agent focuses on science; infrastructure handles bookkeeping.

Change propagation. When a script changes or data is updated, Wheeler flags every downstream finding as stale and reduces its stability score. You always know what to trust and what needs re-verification.

Context management. All components read from and write to the same graph, so a finding from data analysis immediately informs subsequent literature searches, experimental design, and manuscript preparation. Information is progressively disclosed and retrieved only when relevant.

Executable research artifact. The knowledge graph moves beyond the static PDF. It is an executable map of discovery: any scientist can inherit the full experimental context of a project, explore how results connect, and build directly on top of prior work.

The Workflow

Wheeler gives you a fluid cycle, not a rigid pipeline. Enter at any point, skip stages, repeat them.

 TOGETHER         you + wheeler, thinking out loud
 discuss  plan  chat  pair  write  note  ask
                         |
                         v  remaining work is grinding
 HANDOFF          propose independent tasks
 handoff          you approve, modify, or keep talking
                         |
                         v
 INDEPENDENT      wheeler works alone
 wh queue "..."   logged, stops at decision points
                         |
                         v
 RECONVENE        results + flags + surprises
 reconvene        back to TOGETHER

Every plan and execution renders a self-contained visual brief: the question and sub-questions, figure mockups (pre-registered sketches) paired with the real result figures, a pipeline flow chart, and the data sources. /wh:discuss reads that brief to interpret the results with you like a colleague, referencing figures by number and running quick checks against the data to strengthen or disprove a point.

A typical session

The flow we design for, end to end:

/wh:discuss — talk through the question until it is sharp. Wheeler asks like a colleague, grounds the conversation in what the graph already knows, and locks the decisions.
/wh:plan — Wheeler structures the investigation into waves of tasks and, before any data is touched, pre-registers the figures: what each one plots and how competing hypotheses would look different in it. On approval it renders a visual brief (question, mockups, pipeline, data sources) so you react to a picture, not prose. Seeing the mockup often sends one more round of sharpening back into the plan.
/wh:execute — Wheeler runs the WHEELER-assigned tasks, logs findings with full provenance, then regenerates the brief as a report: each pre-registered mockup now sits beside its real result figure, success criteria are marked, and result tables tuck into dropdowns.
/wh:discuss (again, on the results) — hand Wheeler the brief and interpret together: what holds, what is fragile, what the next question is. Wheeler references figures by number, pulls related findings from the graph, and can run a quick check against the data to settle a contested point, registering whatever you endorse back into the graph.
/wh:write drafts from the endorsed findings with strict citations, or /wh:plan opens the follow-up investigation. /wh:close sweeps the session into a synthesis.

You can enter at any step, skip stages, or loop steps 2 to 4 as the work demands.

Commands

Command	What it does
`/wh:start`	Route to the right command (or type your task)
`/wh:discuss`	Think like a colleague: sharpen the question, or interpret a plan's results from its brief (runs checks against the data, cites figures by number)
`/wh:plan`	Structure tasks with waves, assignees, checkpoints; render a visual brief with figure mockups
`/wh:execute`	Run analyses, log findings with provenance; pair mockups with the real result figures in a report
`/wh:write`	Draft text with strict citation enforcement
`/wh:ingest`	Bootstrap graph from existing code, data, papers
`/wh:add`	General-purpose ingest: text, DOI, file, URL
`/wh:note`	Quick-capture an insight, observation, or idea
`/wh:compile`	Compile graph into synthesis documents with citations
`/wh:dream`	Consolidate: promote tiers, detect communities, link orphans
`/wh:pair`	Live co-work: scientist drives, Wheeler assists
`/wh:ask`	Query the graph, trace provenance chains
`/wh:status`	Show progress, suggest next action
`/wh:handoff`	Propose tasks for independent execution
`/wh:reconvene`	Review results from independent work

More commands

Command	What it does
`/wh:chat`	Quick discussion, no execution
`/wh:triage`	Triage GitHub issues against planned work
`/wh:report`	Generate work log from graph (time period)
`/wh:close`	End-of-session provenance sweep
`/wh:pause` / `/wh:resume`	Save and restore investigation state
`/wh:update`	Check for Wheeler updates
`/wh:dev-feedback`	File bugs from inside your session

Headless mode

Wheeler can run tasks without you present:

wh queue "search for papers on SRM models"   # sonnet, 10 turns, logged
wh quick "check graph status"                 # haiku, 3 turns, fast
wh dream                                      # graph consolidation

The wh launcher is a bash script in bin/wh that ships only with the source tree, not the PyPI wheel. To enable it after a uv tool install, clone the repo and symlink it: sudo ln -sf $PWD/bin/wh /usr/local/bin/wh. A native wheeler queue / quick / dream is on the roadmap.

Wheeler never does your thinking. Every task gets tagged: SCIENTIST (judgment calls), WHEELER (grinding), or PAIR (collaborative). Decision points are flagged as checkpoints, not guessed at.

How It Works

Provenance-completing tool calls

The core primitive: one tool call creates a finding AND its full W3C PROV-DM provenance chain. You never write this directly; slash commands handle it. But under the hood, this is what happens:

add_finding(
    description="Midget and parasol cells have similar clusters of fitted SRM parameters",
    confidence=0.85,
    execution_kind="script",                    # auto-creates Execution activity
    used_entities="D-abc123,S-def456",          # auto-links inputs
)

Wheeler internally creates the Finding, an Execution activity node, links inputs (Dataset, Script) via USED, links the output via WAS_GENERATED_BY, sets a stability score, and dual-writes to Neo4j and JSON. The provenance chain is always complete because the agent never had to remember to create it.

Stability and invalidation

Every entity carries a stability score (0.0-1.0) encoding epistemic trust: primary data = 1.0, published papers = 0.9, validated scripts = 0.7, LLM-generated findings = 0.3. When an upstream entity changes, stability decays downstream: new = source * (0.8 ^ hops). Changed scripts propagate stale flags through the entire dependency chain.

The knowledge graph

The graph is an index over files, not a document store. Each node stores an ID, type, tier, title, path, and timestamps. Full content lives in knowledge/{id}.json. Human-browsable rendering lives in synthesis/{id}.md (Obsidian-compatible with YAML frontmatter and [[backlinks]]). When you need connections, ask the graph. When you need content, read the file.

11 entity types: Finding, Hypothesis, OpenQuestion, Dataset, Paper, Script, Execution, Document, ResearchNote, Plan, Ledger.

14 relationship types: 6 W3C PROV standard (USED, WAS_GENERATED_BY, WAS_DERIVED_FROM, WAS_INFORMED_BY, WAS_ATTRIBUTED_TO, WAS_ASSOCIATED_WITH) + 8 Wheeler semantic (SUPPORTS, CONTRADICTS, CITES, APPEARS_IN, RELEVANT_TO, AROSE_FROM, DEPENDS_ON, CONTAINS).

50 MCP tools across 5 servers (mutations, queries, search, ops, legacy monolith).

See ARCHITECTURE.md for the complete technical spec: module dependency map, PROV schema, MCP tool listing, hardening patterns, design decisions.

Service integrations

External research tools land in the graph as provenance-tracked nodes. The model is a sandwich: an act reads graph context and shapes the request, the tool's own CLI runs (owning its auth and retries), and one deterministic Python ingest writes the result back through the triple-write. Every call is one Execution whose status is truthful: a failed or incomplete job is recorded as failed with no fabricated outputs (the external-call failsafe), never masquerading as a clean run. Four Ai2 Asta services ship today (Paper Finder, Semantic Scholar, Theorizer, Literature Reports), routed by /wh:asta.

Adding a new service is its own loop: the wheeler-service-creator skill scaffolds the adapter (registry contract, ingest, act, and test) with the failsafe baked in, then a bundled auditor checks data-safety, provenance, and conventions before it lands. See ARCHITECTURE.md "Service Integrations".

What's New

v0.9.15 (2026-06-15): Asta router, three ways in

Name a service, give an intent, or be asked: /wh:asta now takes three routes in: name a service directly (/wh:asta paper-finder) and it dispatches straightaway, hand it a task and it matches the right adapter, or invoke it bare and it asks what you want before doing anything.
It asks to nail down the right service: when more than one adapter could fit a request, the router uses AskUserQuestion to offer the candidate services (each labeled with its description and cost) instead of silently guessing.
Intent first, graph second: with no intent it asks you before touching the graph (the graph cannot tell it what you want), then reads the graph only once it knows the task, so it never grounds on the wrong thing.
Plan and execute route through it: a /wh:plan or /wh:execute step can call the router and forward its plan id, so the dispatched run anchors AROSE_FROM the right plan, and the service descriptions it routes on now match the shipped adapters exactly.

v0.9.14 (2026-06-15): the update badge actually clears

The update badge reliably clears: /wh:update now updates every install the badge checker tracks (the SessionStart hook probes ~/.local/bin/wheeler and the uv-tools path, not just the session's install), then clears the cache, so the ⬆ /wh:update indicator disappears after updating and stays gone on a multi-install machine.
Integrations, framed: a dedicated README "Integrations" section and a forward-looking roadmap entry describe the Asta integration and where it is headed (external research services reading and acting on Wheeler's graph and context).
Resilient e2e tests: the live-Neo4j test suite now degrades to clean skips on a transient Neo4j outage instead of erroring, so a database hiccup no longer blocks a commit.

v0.9.13 (2026-06-15): update fixes + integration docs

/wh:update resolves split-server installs: the update flow looked for the obsolete single wheeler MCP key and failed on modern installs; it now resolves any Wheeler MCP server (split or legacy), preferring the install serving the session.
The update badge clears after updating: wheeler update reliably drops the ⬆ /wh:update statusline badge by clearing the version-check cache in a way that works for every install type, including uv tool.
Service integrations documented: ARCHITECTURE.md, the README, CLAUDE.md, the tech stack, and the getting-started guide now cover the Asta adapters, the external-call failsafe, and how to add a new service with the wheeler-service-creator skill.

Architecture

Claude Code (interactive)
    ├── /wh:* slash commands (.claude/commands/wh/*.md)
    │       ├── /wh:start: intent router (invokes other commands)
    │       ├── YAML frontmatter: tool restrictions per mode
    │       └── System prompt: workflow + provenance protocol
    │
    ├── MCP Servers (50 tools)
    │       ├── wheeler_core (12): health, status, context, search, cypher
    │       ├── wheeler_query (10): read-only query_* tools
    │       ├── wheeler_mutations (18): add_*, link, delete, update, merge
    │       ├── wheeler_ops (10): staleness, citations, consistency
    │       └── wheeler (legacy monolith): same 50 tools, one server
    │
bin/wh (headless)
    └── claude -p with structured logging → .logs/*.json

Code structure

wheeler/
├── models.py                # Pydantic v2: 11 node types, prefix mappings
├── config.py                # YAML loader, Pydantic config models
├── provenance.py            # Stability scoring, invalidation propagation
├── consistency.py           # Cross-layer drift detection and repair
├── mcp_server.py            # Legacy monolith: all 50 tools
├── mcp_core.py              # Split server: health, context, search (12)
├── mcp_query.py             # Split server: query_* read-only (10)
├── mcp_mutations.py         # Split server: add_*, link, delete, update (18)
├── mcp_ops.py               # Split server: staleness, citations (10)
├── mcp_shared.py            # Shared: trace IDs, decorators, config
├── knowledge/               # File I/O: read, write, list, render, migrate
├── graph/                   # Neo4j backend, circuit breaker, schema, context
├── search/                  # Embeddings, RRF fusion, graph-expanded search
├── validation/              # Citation validation, ledger quality metrics
├── tools/graph_tools/       # Provenance-completing mutations + queries
└── workspace.py             # Project file scanner

tests/                        # 1929 tests
docs/                         # Getting started, architecture, project spec

Contributing

Bug reports: Use /wh:dev-feedback from inside a session to file structured issues, or report at GitHub Issues.

Tests: python -m pytest tests/ -v (1929 tests). E2E tests require a running Neo4j: python -m pytest tests/e2e/ -v.

Architecture: See ARCHITECTURE.md for the full technical spec (module dependency map, PROV schema, MCP tool listing, hardening patterns).

Project docs:

Mission — four pillars, target audience, design north star
Tech stack — components, infrastructure patterns, current gaps
Roadmap — shipped versions, v0.9.0 phases, v1.0 criteria
Getting started — install walkthrough with Neo4j Desktop
Project spec — original design specification

Citation

If you use Wheeler in your research, please cite it:

@software{hong_wheeler_2026,
  author    = {Hong, Arthur and Rieke, Fred},
  title     = {{Wheeler: Reliable, trustworthy, trackable AI workflows for science}},
  year      = {2026},
  publisher = {Zenodo},
  doi       = {10.5281/zenodo.20498885},
  url       = {https://doi.org/10.5281/zenodo.20498885}
}

Integrations

Wheeler integrates with external research services so their output lands in the knowledge graph as provenance-tracked nodes, and so those services can act on Wheeler's own work and context. The first integration is AllenAI Asta: Wheeler ships tools (adapters) for four Asta services, Paper Finder, Semantic Scholar, Theorizer, and Literature Reports, routed by /wh:asta. Each call reads the current graph to shape the request, runs the Asta service, and writes the result back with full provenance (what it USED, what it WAS_GENERATED_BY, and how the new results connect to the existing graph). A failed call is recorded as failed rather than silently lost.

The integration layer is provider-agnostic and growing. Adding a new external tool is its own workflow: the wheeler-service-creator skill scaffolds the adapter, bakes in the provenance and failsafe wiring, and audits it before it lands. See ARCHITECTURE.md "Service Integrations" for the design, and the roadmap for where this is headed.

Acknowledgments

Wheeler's Asta integration shells out to the Asta toolkit from the Allen Institute for AI (Ai2). The Paper Finder, Semantic Scholar, Theorizer, and Literature Reports services are Ai2's work (asta.allen.ai); Wheeler does not vendor or reimplement them, it invokes the upstream asta CLI and marshals the results into the knowledge graph with provenance. Credit and thanks to the Ai2 Asta team.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

maxwellsdm

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.9.15

Jun 16, 2026

0.9.14

Jun 16, 2026

0.9.13

Jun 16, 2026

0.9.12

Jun 16, 2026

0.9.11

Jun 11, 2026

0.9.10

Jun 11, 2026

0.9.9

Jun 11, 2026

0.9.8

Jun 10, 2026

0.9.7

Jun 10, 2026

0.9.6

Jun 1, 2026

0.9.5

May 25, 2026

0.9.4

May 21, 2026

0.9.3

May 18, 2026

0.9.2

May 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wheeler-0.9.15.tar.gz (387.1 kB view details)

Uploaded Jun 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

wheeler-0.9.15-py3-none-any.whl (448.2 kB view details)

Uploaded Jun 16, 2026 Python 3

File details

Details for the file wheeler-0.9.15.tar.gz.

File metadata

Download URL: wheeler-0.9.15.tar.gz
Upload date: Jun 16, 2026
Size: 387.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for wheeler-0.9.15.tar.gz
Algorithm	Hash digest
SHA256	`846faa19e54583527cd49c67765982a45400df9b28f6475c392ae078babc89cc`
MD5	`98ca80ab7778234a3124227a08f7fd89`
BLAKE2b-256	`8f9642cf8a2721afca0c58dabb338eb1cfe15ec4a51f8ed43d2e5b59c2dd8d8d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for wheeler-0.9.15.tar.gz:

Publisher: publish.yml on maxwellsdm1867/wheeler

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: wheeler-0.9.15.tar.gz
- Subject digest: 846faa19e54583527cd49c67765982a45400df9b28f6475c392ae078babc89cc
- Sigstore transparency entry: 1832272015
- Sigstore integration time: Jun 16, 2026
Source repository:
- Permalink: maxwellsdm1867/wheeler@69698797af1d972519949e64d2b2c73fd2dacbbf
- Branch / Tag: refs/tags/v0.9.15
- Owner: https://github.com/maxwellsdm1867
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@69698797af1d972519949e64d2b2c73fd2dacbbf
- Trigger Event: release

File details

Details for the file wheeler-0.9.15-py3-none-any.whl.

File metadata

Download URL: wheeler-0.9.15-py3-none-any.whl
Upload date: Jun 16, 2026
Size: 448.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for wheeler-0.9.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da849ed3f8989ab75fb289dc24336faedfb2c5f5e465f33b8edb8ef8f6f3d1e1`
MD5	`6582c04f3fae0694853449769b99b3e6`
BLAKE2b-256	`e04dd425059572a9c7755a69c22bb57f05e25a18b9857c0aafeae009dacda5ff`

See more details on using hashes here.

Provenance

The following attestation bundles were made for wheeler-0.9.15-py3-none-any.whl:

Publisher: publish.yml on maxwellsdm1867/wheeler

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: wheeler-0.9.15-py3-none-any.whl
- Subject digest: da849ed3f8989ab75fb289dc24336faedfb2c5f5e465f33b8edb8ef8f6f3d1e1
- Sigstore transparency entry: 1832272222
- Sigstore integration time: Jun 16, 2026
Source repository:
- Permalink: maxwellsdm1867/wheeler@69698797af1d972519949e64d2b2c73fd2dacbbf
- Branch / Tag: refs/tags/v0.9.15
- Owner: https://github.com/maxwellsdm1867
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@69698797af1d972519949e64d2b2c73fd2dacbbf
- Trigger Event: release

wheeler 0.9.15

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

WHEELER

Quick Start

From source

Why Wheeler

The Workflow

A typical session

Commands

Headless mode

How It Works

Provenance-completing tool calls

Stability and invalidation

The knowledge graph

Service integrations

What's New

Architecture

Contributing

Citation

Integrations

Acknowledgments

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance