Local-first Proof Receipt — prove which AI model, prompt, or workflow is worth trusting.

These details have not been verified by PyPI

Project links

Project description

Orionfold Proof Receipt

Prove what your AI can do before you trust it.

A local-first, hybrid-capable Proof Receipt product. It runs private proof tests across local and cloud AI workflows, compares quality, speed, cost, failure cases, and privacy boundaries, then exports a repeatable Proof Receipt you can keep, rerun, or share.

The central artifact is the Proof Receipt. The user is not here to watch AI run — they are here to decide what AI to trust.

Status: Gate 7 — ship candidate

The full v0 loop works end-to-end. Keyless out of the box: pick the bundled sample dataset, run two deterministic mock candidates, see the leaderboard, inspect failure cases (including a surfaced provider error), and export a Proof Receipt in Markdown, HTML, and JSON — each stamped with a config hash, timestamp, and schema version (currently v3).

Real providers when you configure them: the same loop runs against local and cloud models — Ollama and LM Studio (local), plus OpenAI, OpenRouter, Google Gemini, and Anthropic (cloud) — behind one uniform result boundary. Cloud candidates appear only when their API key resolves; the keyless mock path stays the instant default. See Configure providers below.

Run it: bash scripts/build.sh && uv run orionfold up, open the cockpit, click Run proof, then export a receipt. See a sample under samples/receipts/, a guided walkthrough in docs/demo-script.md, and the release history in CHANGELOG.md.

Quickstart

Prerequisites: Python 3.12+, uv, and pnpm (build-time only).

Develop (API with reload + Vite dev server with /api proxy):

uv sync                       # backend env
pnpm --dir web install        # cockpit deps
uv run orionfold dev          # API at http://127.0.0.1:8787 (reload)
pnpm --dir web dev            # cockpit at http://localhost:5173 (proxies /api)

Run the embedded build (cockpit served by FastAPI — the install-time experience):

bash scripts/build.sh         # build cockpit -> embed -> build wheel
uv run orionfold up           # open http://localhost:8787

Test:

uv run pytest                 # backend (unit + integration; keyless)
pnpm --dir web test           # cockpit (Vitest)
pnpm --dir web exec playwright install chromium  # one-time, for e2e
pnpm --dir web e2e            # Playwright happy-path (boots the embedded build)

Target install: uv tool install orionfold-proof && orionfold up.

Configure providers

The mock candidates need no setup. To prove real models, make their credentials resolvable. Orionfold reads keys from two places, in order:

The system environment (preferred for CI / 12-factor).
A repo-root .env.local file (git-ignored; convenient for local dev).

The system environment wins when both are set; empty or whitespace-only values are treated as absent. A cloud candidate is offered only when its key resolves, so the cockpit never lists a model that can't run. Keys are never logged, printed, or written into any receipt or screenshot.

Create .env.local at the repo root (an example, never commit real keys):

# .env.local — git-ignored. Set only the providers you want to prove.
OPENAI_API_KEY=sk-...
OPENROUTER_API_KEY=sk-or-...
GEMINI_API_KEY=AIza...
ANTHROPIC_API_KEY=sk-ant-...

Local providers need no key — just a reachable server: Ollama (ollama serve, models pulled) and LM Studio (lms server start, a model loaded). Both are always offered.

Defaults and overrides — each profile ships a sensible default model and every knob is env-overridable (set in .env.local or the environment; no code change):

Profile	Default model	Model override	Endpoint override
Ollama (local)	`llama3.2`	`ORIONFOLD_OLLAMA_MODEL`	`OLLAMA_HOST`
LM Studio (local)	`local-model`	`ORIONFOLD_LMSTUDIO_MODEL`	`LMSTUDIO_BASE_URL`
OpenAI	`gpt-4o-mini`	`ORIONFOLD_OPENAI_MODEL`	`OPENAI_BASE_URL`
OpenRouter	`openai/gpt-4o-mini`	`ORIONFOLD_OPENROUTER_MODEL`	`OPENROUTER_BASE_URL`
Gemini	`gemini-2.5-flash`	`ORIONFOLD_GEMINI_MODEL`	—
Anthropic	`claude-haiku-4-5`	`ORIONFOLD_ANTHROPIC_MODEL`	—

The model is part of a candidate's identity and feeds the run's config_hash, so changing it produces a distinct, traceable receipt.

Two cross-cutting knobs apply to every provider:

ORIONFOLD_MAX_TOKENS (default 2048) — per-completion output cap. Raise it for local reasoning models (qwen3, deepseek-r1, gpt-oss), which spend the budget thinking and return empty content at a low cap.
ORIONFOLD_TIMEOUT_S — per-cell idle budget (how long one example may run before it fails as a timed out after … row, never crashing the run). Defaults are per provider class: local 300s (generous — slow local generation), cloud 90s (tighter). Set this to override both at once; raise it for heavy local reasoning models. The connection itself is always capped at ~10s so an unreachable host fails fast.

Other knobs: ORIONFOLD_ENV_FILE (point at a non-default env file) and ORIONFOLD_DB (override the SQLite path; default ~/.orionfold/proof.db).

Estimated costs use a small built-in price table for the default models. An unknown model (e.g. OpenRouter's namespaced ids) shows $0.00 — costs are labeled estimated, never authoritative.

How development is structured

Work proceeds through operator-approved gates (see CLAUDE.md → Release gates):

⏸ Product brief → docs/product-brief.md
⏸ Release charter → docs/release-charter.md
⏸ Architecture → docs/adr/0001-local-first-proof-receipt-architecture.md
Skeleton → 5. Vertical slice → 6. Provider integration → 7. Ship candidate

The ⏸ gates stop for your approval.

Start here (operator)

In Claude Code, kick off product discovery:

Read docs/opportunity.md. Do not code yet.

Use the product-release-interview skill: interview me with AskUserQuestion to clarify the
first release, challenge assumptions, and produce docs/product-brief.md and
docs/release-charter.md for a v0 a solo founder can build quickly.

Bias toward a local-first Proof Receipt product, not a broad cockpit, SaaS platform, or
generic local model runner.

Then, before activating tighter permissions, review and rename .claude/settings.json.example → .claude/settings.json.

What's in this scaffold

CLAUDE.md                         Lean always-on operating guide (< 200 lines)
.claude/
  settings.json.example           Reviewable permissions/model template (rename to activate)
  rules/                          Path-scoped constraints (providers, receipts, storage)
  skills/                         9 procedural skills (interview, vertical slice, reviews, gates)
  agents/                         diff-reviewer · codebase-investigator · security-reviewer
docs/
  opportunity.md                  Market/product source (read once, summarize into brief)
  claude-context-and-ux-addendum.md  Context engineering + UX quality bar
  tech/                           reference-index · docs-update-log · dependency-policy
  ux/                             design system · usability/a11y/visual checklists · copy-deck
  adr/                            architecture decision records (template seeded)
  worklog/                        per-session summaries (template seeded)
src/orionfold/                    Backend: CLI, FastAPI server, domain, providers, storage
web/                              Vite/React cockpit (built + embedded into the wheel)
samples/                          Bundled demo dataset + sample receipts (MD/HTML/JSON)
tests/  scripts/                  pytest (unit + integration), Playwright e2e, build script

Stack (boring on purpose)

Backend: Python 3.12+, uv, FastAPI, Pydantic, Typer, SQLite, httpx, pytest, ruff, pyright.
Frontend: Vite, React, TypeScript, Tailwind, shadcn/Radix, TanStack Query, Zod, React Hook Form, Recharts.
Testing: pytest, Vitest, Playwright (visual + e2e), deterministic mock providers.

Install: uv tool install orionfold-proof && orionfold up → http://localhost:8787. Dev: uv sync && pnpm --dir web install && uv run orionfold dev (see Quickstart above).

PyPI distribution name: orionfold-proof (CLI command orionfold). The brand names orionfold and orionfold-arena are reserved as placeholders for future products.

See CLAUDE.md for full operating guidance and docs/opportunity.md for the strategy.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.3

Jun 28, 2026

0.2.2

Jun 28, 2026

0.2.1

Jun 28, 2026

0.2.0

Jun 28, 2026

0.1.3

Jun 27, 2026

0.1.2

Jun 26, 2026

0.1.1

Jun 26, 2026

This version

0.1.0

Jun 25, 2026

0.0.0

Jun 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orionfold_proof-0.1.0.tar.gz (517.3 kB view details)

Uploaded Jun 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

orionfold_proof-0.1.0-py3-none-any.whl (428.6 kB view details)

Uploaded Jun 25, 2026 Python 3

File details

Details for the file orionfold_proof-0.1.0.tar.gz.

File metadata

Download URL: orionfold_proof-0.1.0.tar.gz
Upload date: Jun 25, 2026
Size: 517.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.0

File hashes

Hashes for orionfold_proof-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e4d5ade532751cbf9f27d7baa9d382d509655fbdeb32b21fd19a7388b03796b6`
MD5	`4dc5bef9e443bdceb410e61efa812643`
BLAKE2b-256	`62ee923b7efe349b86e811b20b800060fb2f3b6324e144239439acc1085e362b`

See more details on using hashes here.

File details

Details for the file orionfold_proof-0.1.0-py3-none-any.whl.

File metadata

Download URL: orionfold_proof-0.1.0-py3-none-any.whl
Upload date: Jun 25, 2026
Size: 428.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.0

File hashes

Hashes for orionfold_proof-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ab08295516842c78637f1df265d9faf7f7a79fd001bb44b1792a1f864c5f4d9c`
MD5	`7fb79fc8582bb22dc1fcb36967223f3d`
BLAKE2b-256	`c1d70193167171d2dd971a9936967ecb547a9fe0180785ea071c1f9b001f97a9`

See more details on using hashes here.

orionfold-proof 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Orionfold Proof Receipt

Status: Gate 7 — ship candidate

Quickstart

Configure providers

How development is structured

Start here (operator)

What's in this scaffold

Stack (boring on purpose)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes