Forecasting as a harness for decision-making

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

MaxGhenis

These details have not been verified by PyPI

Project links

Project description

Farness

Forecasting as a harness for decision-making.

Instead of asking "Is X good?" or "Should I do Y?", farness helps you:

Define what success looks like (KPIs)
Expand your options (including ones you didn't consider)
Make explicit forecasts (with confidence intervals)
Track outcomes to improve calibration over time

Installation

python -m pip install -e /path/to/farness

Quick Start

As a Python package

from farness import Decision, KPI, Option, Forecast, DecisionStore
from datetime import datetime, timedelta

# Create a decision
decision = Decision(
    question="Should I take the new job offer?",
    kpis=[
        KPI(name="income", description="Total comp after 2 years", unit="$"),
        KPI(name="satisfaction", description="Job satisfaction 1-10"),
    ],
    options=[
        Option(
            name="Take new job",
            description="Accept the offer at Company X",
            forecasts={
                "income": Forecast(
                    point_estimate=300000,
                    confidence_interval=(250000, 400000),
                    reasoning="Base + equity, assuming normal vesting",
                ),
                "satisfaction": Forecast(
                    point_estimate=7.5,
                    confidence_interval=(6, 9),
                    reasoning="Interesting work, but unknown team",
                ),
            }
        ),
        Option(
            name="Stay at current job",
            description="Decline and stay",
            forecasts={
                "income": Forecast(
                    point_estimate=250000,
                    confidence_interval=(230000, 280000),
                    reasoning="Known trajectory, likely promotion",
                ),
                "satisfaction": Forecast(
                    point_estimate=6.5,
                    confidence_interval=(6, 7),
                    reasoning="Comfortable but plateauing",
                ),
            }
        ),
    ],
    review_date=datetime.now() + timedelta(days=180),
)

# Save it
store = DecisionStore()
store.save(decision)

Command Line

# List decisions
farness list

# Show a specific decision
farness show abc123

# Check calibration
farness calibration

# See what needs review
farness pending

AI Agent Workflows

farness is not tied to Claude. The Claude Code plugin is the most integrated path today, but the framework also works with Codex and other coding agents that can follow structured instructions or run shell commands.

For agent-agnostic setup and prompt guidance, see docs/agent-workflows.md.

Codex and other coding agents

The CLI is a local decision store and calibration tool. It does not call an LLM or require an API key by itself.

To use the current repo version from source:

python -m pip install -e /path/to/farness
farness new "Should we rewrite the auth layer?" --context "3 incidents this quarter; CTO prefers Rust; team is strongest in Node."

Then give the agent a farness instruction block:

Use the farness workflow for this decision.
1. Define the KPI or outcome that would make the decision successful.
2. Expand the option set beyond the choices already mentioned.
3. Anchor on a relevant reference class or base rate before using the inside view.
4. Show the main mechanism or decomposition that drives the forecast.
5. List the strongest disconfirming evidence, failure modes, or decision traps.
6. Give point estimates with 80% confidence intervals for each option on each KPI.
7. Recommend a review date and say what would be logged later for calibration.

MCP server

If you want a native tool interface instead of prompt copy-paste, run the MCP server from the repo:

python -m pip install -e '/path/to/farness[mcp]'
farness-mcp

It exposes tools for creating, listing, retrieving, saving, and scoring decisions, plus resources/prompts for the farness workflow.

To register it in Codex as a local MCP server:

codex mcp add farness -- uv run --project /path/to/farness --extra mcp farness-mcp

To install the Codex skill, copy or symlink skills/farness into $CODEX_HOME/skills (default ~/.codex/skills) and restart Codex.

Claude Code local skill + MCP

Claude Code can use the same local MCP server and a local skill wrapper:

python -m pip install -e '/path/to/farness[mcp]'
claude mcp add farness -- uv run --project /path/to/farness --extra mcp farness-mcp
mkdir -p ~/.claude/skills
ln -s /path/to/farness/.claude/skills/farness ~/.claude/skills/farness

The plugin path still works if you prefer the slash-command workflow:

claude plugin marketplace add MaxGhenis/farness
claude plugin install farness@maxghenis-plugins

Then either use the local farness skill or /farness:decide if you installed the plugin.

The Framework

Farness implements a structured decision process:

KPI Definition - What outcomes actually matter? Make them measurable.
Option Expansion - Don't just compare A vs B. What about C? What about waiting? What about hybrid approaches?
Reference Class - Start with a relevant outside view or base rate before adjusting for specifics.
Mechanism / Decomposition - Break forecasts into estimable components and causal drivers.
Disconfirming Evidence - Surface the strongest failure modes, traps, and reasons the leading option could be wrong.
Confidence Intervals - Point estimates aren't enough. How uncertain are you?
Tracking - Log decisions and review outcomes to calibrate over time.

Why This Works

Reduces sycophancy - Harder to just agree when making numeric predictions
Forces mechanism thinking - Must reason about cause and effect
Creates accountability - Predictions can be scored later
Separates values from facts - You pick KPIs (values), forecasts are facts
Builds calibration - Track predictions over time to improve

Development

git clone https://github.com/MaxGhenis/farness
cd farness
pip install -e ".[dev,experiments]"
pytest

Paper build:

python3 paper/render_paper.py  # Regenerates figures, HTML, Markdown, and site/public/paper-raw
python3 paper/run_strongest_validation.py  # Runs the strongest reviewer-facing validation on Claude Opus 4.6 and GPT-5.2
python3 paper/run_study1_rerun.py --models gpt-5.4  # Reruns the original Study 1 design with legacy prompt wording
python3 -m farness.experiments stability --strongest-validation --model gpt-5.2  # Single-model equivalent

Publishing to PyPI

The package is published to PyPI from GitHub Releases using PyPI Trusted Publishing.

Setup (one-time):

In PyPI, open the farness project publishing settings:
- https://pypi.org/manage/project/farness/settings/publishing/
Add a GitHub Actions trusted publisher with:
- Owner: MaxGhenis
- Repository name: farness
- Workflow name: publish.yml
- Environment name: leave blank unless you later add a GitHub environment

To publish a new version:

Update version in pyproject.toml
Create a new release on GitHub with a tag (e.g., v0.2.0)
The GitHub Actions workflow will automatically build and publish to PyPI

The repo no longer needs a stored PYPI_API_TOKEN once Trusted Publishing is configured.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

MaxGhenis

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.4

Mar 25, 2026

0.2.3

Mar 24, 2026

0.2.2

Mar 24, 2026

0.2.1

Mar 24, 2026

This version

0.2.0

Mar 24, 2026

0.1.0

Dec 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

farness-0.2.0.tar.gz (77.9 kB view details)

Uploaded Mar 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

farness-0.2.0-py3-none-any.whl (66.9 kB view details)

Uploaded Mar 24, 2026 Python 3

File details

Details for the file farness-0.2.0.tar.gz.

File metadata

Download URL: farness-0.2.0.tar.gz
Upload date: Mar 24, 2026
Size: 77.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for farness-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`8f45f91b2410d48397fb854d2174a6bcc99645a1eb748e542eeab265ffd21c38`
MD5	`ef8751091c4ba8d79c83aa41573ba663`
BLAKE2b-256	`4fbcf05cf6792bf89d2098032535a1ad4be1def5ad170b296124df7ba60f99dc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for farness-0.2.0.tar.gz:

Publisher: publish.yml on MaxGhenis/farness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: farness-0.2.0.tar.gz
- Subject digest: 8f45f91b2410d48397fb854d2174a6bcc99645a1eb748e542eeab265ffd21c38
- Sigstore transparency entry: 1175063348
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: MaxGhenis/farness@a7caec306f3bf76d07660567ff8b4c9c7d89c476
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/MaxGhenis
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a7caec306f3bf76d07660567ff8b4c9c7d89c476
- Trigger Event: release

File details

Details for the file farness-0.2.0-py3-none-any.whl.

File metadata

Download URL: farness-0.2.0-py3-none-any.whl
Upload date: Mar 24, 2026
Size: 66.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for farness-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7db13b4063f74941d484da26b3b3657e91c2c228104a9b14a790c97e50112c1c`
MD5	`19ba4c795d7f866bf8153dd046c41365`
BLAKE2b-256	`a90da5668693a2166ec488e102e4ed3a5529327250812b06133a49533089a37e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for farness-0.2.0-py3-none-any.whl:

Publisher: publish.yml on MaxGhenis/farness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: farness-0.2.0-py3-none-any.whl
- Subject digest: 7db13b4063f74941d484da26b3b3657e91c2c228104a9b14a790c97e50112c1c
- Sigstore transparency entry: 1175063384
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: MaxGhenis/farness@a7caec306f3bf76d07660567ff8b4c9c7d89c476
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/MaxGhenis
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a7caec306f3bf76d07660567ff8b4c9c7d89c476
- Trigger Event: release

farness 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Farness

Installation

Quick Start

As a Python package

Command Line

AI Agent Workflows

Codex and other coding agents

MCP server

Claude Code local skill + MCP

The Framework

Why This Works

Development

Publishing to PyPI

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance