Automated harness optimization for AI agents — make your agent evolve.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

PolyHarness

  _____      _        _    _                                   
 |  __ \    | |      | |  | |                                  
 | |__) |__ | |_   _ | |__| | __ _ _ __ _ __   ___  ___ ___    
 |  ___/ _ \| | | | ||  __  |/ _` | '__| '_ \ / _ \/ __/ __|   
 | |  | (_) | | |_| || |  | | (_| | |  | | | |  __/\__ \__ \   
 |_|   \___/|_|\__, ||_|  |_|\__,_|_|  |_| |_|\___||___/___/   
                __/ |                                          
               |___/

Make your AI Agent evolve automatically.

What is a "harness"? A harness is the code that wraps your AI agent's interaction with a task — including the prompt template, tool configuration, output parsing logic, and any pre/post-processing steps. It's the how your agent solves a problem, not the model itself. PolyHarness iteratively searches for better harness configurations so you don't have to tune them by hand.

Your AI agent runs the same harness every time. Same prompts, same tool config, same strategy — no matter how many times it fails.

PolyHarness addresses that. It records each iteration, evaluates candidate harness changes, and uses the accumulated history to search for better-scoring configurations. You run one command to start the loop.


Self-Evolution	Iteratively searches over harness changes and keeps the full evaluation history in one workspace.
6 Agent Backends	Claude Code · Claw Code · Codex · OpenCode · API direct · Local — plug in any CLI agent.
Full History	Every iteration's code, scores, and traces preserved. The Meta-Harness paper reports that non-Markovian search outperforms blind retries.
Search Tree	Visualize the optimization path. Compare any two candidates with per-task diffs.
One-Command Setup	`ph init --base-harness ... --task-dir ...` — copies files, configures workspace, done.
Closed Loop	init → run → inspect → apply. You choose when to write the best-scoring candidate back to your project.

Backstory

Stanford's Meta-Harness paper (IRIS Lab, 2026) proved a surprising result: harness design is the #1 lever for agent performance — more impactful than model choice, prompt engineering, or fine-tuning.

The key insight? When you give an AI agent access to full diagnostic history — not just the latest score, but every past attempt's code, traces, and failure modes — it can systematically evolve its own harness configuration. The paper called this "non-Markovian search" and showed it outperforms simple best-of-N sampling by a wide margin.

But the paper only released the final optimized artifact (agent.py). The search framework itself was never open-sourced.

PolyHarness fills that gap. It's the open-source engine that makes Meta-Harness search available to everyone — for any agent, any task, any evaluation pipeline.

Think of it this way:

Memory tools (like Supermemory) give agents persistent memory across conversations.

PolyHarness gives agents persistent self-evolution — you get a repeatable way to refine how they work over time.

What PolyHarness Is

PolyHarness is the open-source engine for iteratively searching over an agent's harness.

It builds on ideas from the Meta-Harness paper and the TBench2 results reported there, while focusing this repository on the optimization workflow itself — how harness variants are proposed, evaluated, and revised over repeated runs.

If tools like ForgeCode help you code, PolyHarness helps you search for task-specific harness improvements by iterating on prompts, tool use, and harness logic.

Use PolyHarness

I use AI coding agents

You have Claude Code, Codex, or another agent. You want to tune it for your specific tasks — without manually tweaking prompts.

pip install polyharness
ph init --agent claude-code --template text-classification
ph run
ph apply

You now have a repeatable optimization workspace. Inspect the results, then apply the best-scoring candidate if it improves your evaluation.

→ Jump to Quick Start

I'm building agent frameworks

You're developing an AI agent or tool and want to integrate automated optimization as a feature.

PolyHarness provides a pluggable adapter API — implement 3 methods and your agent can participate in the same search loop.

class MyAgentAdapter(CLIAdapter):
    def build_command(self, prompt, cwd):
        return ["my-agent", "--prompt", prompt]
    def parse_output(self, stdout, stderr, code):
        return CLIResult(...)

→ Jump to Architecture

Quick Start

1. Install

pip install polyharness         # Python >= 3.12
# or
npm install -g polyharness      # Node.js wrapper, auto-installs Python package

2. Check your environment

ph doctor

This auto-detects which agent backends (Claude Code, Codex, etc.) are installed and shows their status.

3. Initialize a workspace

ph init sets up two things:

Who optimizes (--agent) — which AI does the thinking: a CLI tool like claude-code, or an API like api / openai.
What to optimize (--template or --base-harness + --task-dir) — your harness code, test cases, and evaluation script. These three are always needed for ph run to work.

Option A: Use a bundled template (recommended for first run)

PolyHarness ships with ready-to-run templates. One command sets up everything:

ph init --agent api --template text-classification

This copies a complete set of harness + tasks + evaluate script into the workspace automatically:

.ph_workspace/
├── base_harness/
│   └── harness.py          # starting code to optimize
├── tasks/
│   └── test_cases.json     # test inputs + expected outputs
├── evaluate.py             # scoring script
└── config.yaml             # auto-generated

That's it — skip to step 4.

Available templates: text-classification, math-word-problems, code-generation, rag-qa, api-calling.

Option B: Use your own project

You need three files: harness.py (code to optimize), tasks/test_cases.json (test data), and evaluate.py (scoring script). Generate them all with one command:

ph new my-project

This creates:

my-project/
├── base_harness/
│   └── harness.py          # ← edit: your starting logic
├── tasks/
│   └── test_cases.json     # ← edit: your test inputs + expected outputs
└── evaluate.py             # ← edit if needed: scoring logic

Edit the generated files for your task. For example, if you're building a text classifier:

# my-project/base_harness/harness.py
def solve(input_data: str) -> str:
    # A simple starting point — the agent will improve this
    if "good" in input_data.lower():
        return "positive"
    return "negative"

// my-project/tasks/test_cases.json
[
  {"input": "This product is good", "expected": "positive"},
  {"input": "Terrible experience",  "expected": "negative"},
  {"input": "The meeting is at 3pm", "expected": "neutral"}
]

evaluate.py works out of the box — it calls harness.solve(case["input"]), compares with case["expected"], and reports accuracy. Only edit it if your scoring needs custom logic.

Then initialize:

ph init \
  --agent claude-code \
  --base-harness ./my-project/base_harness \
  --task-dir ./my-project

Flag	What to pass	Required?
`--agent`	Who optimizes: `claude-code`, `codex`, `api`, `openai`, etc.	Yes (default `api`)
`--base-harness`	Directory with your starting harness code (at least `harness.py`)	Yes*
`--task-dir`	Directory with `tasks/test_cases.json` and optionally `evaluate.py`	Yes*
`--eval-script`	Path to `evaluate.py`, if it lives outside `--task-dir`	Only if not in task-dir
`--workspace`	Where to create the workspace (default `.ph_workspace`)	No

* Technically optional at init time, but ph run will fail without harness code and test data.

ph init copies everything into an isolated optimization workspace — your original code is never modified.

Configure Your Agent

PolyHarness automatically sandboxes your agent inside this workspace, ensuring it only edits candidate copies and safely reads history traces.

Scenario	How to configure
Supported CLI Tools	Run `ph init --agent <name>`. PolyHarness auto-injects required instructions (e.g., `CLAUDE.md`). (Supported: claude-code, claw-code, codex, opencode)
Anthropic API	Run `ph init --agent api`. Set `export ANTHROPIC_API_KEY="sk-ant-..."` before `ph run`.
OpenAI / Local Models	Run `ph init --agent openai`. Then configure the endpoint — see Local Model Setup below.
Custom CLI path	If your CLI agent uses a non-standard command, edit `config.yaml` in the workspace before running: `proposer: { cli_path: "npx @anthropic-ai/claude-code" }`

4. Run the optimization loop

ph run

The orchestrator: copies your harness → asks the Proposer agent for a candidate change → evaluates the result → stores everything → repeats.

5. Inspect and apply

ph status                      # progress table + elapsed + improvement rate
ph log                         # search tree with delta (Δ) column
ph best                        # best candidate details
ph leaderboard                 # ranked table of all candidates (--tasks for drilldown)
ph compare 0 5                 # diff two iterations (scores + code)
ph diff 5                      # shorthand for: compare 0 5
ph trace 3                     # view stdout/stderr/metrics for iter_3
ph report                      # generate a full markdown report

ph apply                       # write best harness back to base_harness/
ph export ./my-optimized       # or export to any directory
ph clean --keep-best           # remove candidates to free disk space

Try it now (no API key needed)

ph init --agent local --template math-word-problems
ph run --max-iterations 5
ph log

# Search Tree
# └── iter_0  0.3500
#     └── iter_1  0.5000
#         └── iter_2  0.6500
#             └── iter_3  0.9000 ★

The score path above is the current measured result of the bundled math-word-problems example with the repository's local backend, rounded for readability. It is not a paper benchmark or an external project result. The local backend is deterministic; no fixed score uplift is claimed here for Claude Code, Codex, or other real agent backends.

How It Works

PolyHarness runs a Meta-Harness-style search loop — an iterative process where an AI agent proposes, evaluates, and stores harness changes:

┌──────────────────────────────────────────────────────────────┐
│                                                              │
│   You                          PolyHarness                   │
│    │                              │                          │
│    ├── ph init ──────────────────→│ Creates workspace        │
│    │   (harness + tasks + eval)   │ Copies files             │
│    │                              │ Injects CLAUDE.md        │
│    │                              │                          │
│    ├── ph run ───────────────────→│ Starts search loop:      │
│    │                              │                          │
│    │   ┌──────────────────────────┤                          │
│    │   │  Step 1: SELECT parent   │ Best or Tournament       │
│    │   │  Step 2: COPY harness    │ From parent → candidate  │
│    │   │  Step 3: PROPOSE changes │ Agent reads all history  │
│    │   │  Step 4: EVALUATE        │ Run tasks, get scores    │
│    │   │  Step 5: STORE results   │ Code + scores + traces   │
│    │   │  Step 6: CHECK stopping  │ Improved? Patience left? │
│    │   └──────────┬───────────────┤                          │
│    │              └── loop ───────┘                          │
│    │                              │                          │
│    ├── ph log ───────────────────→│ Shows search tree        │
│    ├── ph compare 0 5  ──────────→│ Score deltas + code diff │
│    └── ph apply ─────────────────→│ Writes best back         │
│                                                              │
└──────────────────────────────────────────────────────────────┘

Why it works: non-Markovian search

Traditional approaches: run the agent → check the score → retry. Each attempt is independent.

PolyHarness is different. Every iteration stores:

The complete candidate source code
Per-task scores (not just the overall number)
Full execution traces (stdout, stderr, exit codes)
Metadata (parent candidate, proposer model, changes summary)

The Proposer reads all of this before generating the next candidate. It can see why a previous attempt failed, which specific tasks regressed, and what code changes caused it. This is why the Meta-Harness paper found that full-context search outperforms scores-only search by 15+ percentage points.

Supported Agent Backends

Backend	Command	Use case
`api`	—	Default. Anthropic API direct, just needs `ANTHROPIC_API_KEY`
`openai`	—	OpenAI-compatible API (Ollama, vLLM, LM Studio, etc). Needs `OPENAI_API_KEY`
`claude-code`	`claude -p`	Official Claude Code CLI (Pro/Teams subscription)
`claw-code`	`claw -p`	Open-source Claw Code CLI
`codex`	`codex --quiet`	OpenAI Codex CLI
`opencode`	`opencode -p`	OpenCode CLI
`local`	—	Offline rule-based engine for development & testing

ph doctor auto-detects all available backends and shows their status.

When you run ph init --agent claude-code, PolyHarness automatically generates a CLAUDE.md instruction file in the workspace, telling the agent how to behave as an optimization Proposer. Same for CLAW.md, CODEX.md, OPENCODE.md — each agent's native instruction format.

Local Model Setup

If you're running a local model (Ollama, vLLM, LM Studio, or any OpenAI-compatible server), use the openai backend:

# 1. Initialize (use a template, or --base-harness + --task-dir for your own project)
ph init --agent openai --template text-classification

# 2. Configure your local endpoint
ph config set proposer.model llama3.3
ph config set proposer.base_url http://localhost:11434/v1
ph config set proposer.api_key sk-dummy

# 3. Run
ph run

Or edit .ph_workspace/config.yaml directly:

proposer:
  backend: openai
  model: llama3.3                          # your local model name
  base_url: http://localhost:11434/v1      # Ollama default
  api_key: sk-dummy                        # local models don't need a real key
  max_tokens: 16384
  temperature: 0.7

Common local endpoints:

Tool	`base_url`
Ollama	`http://localhost:11434/v1`
vLLM	`http://localhost:8000/v1`
LM Studio	`http://localhost:1234/v1`
LocalAI	`http://localhost:8080/v1`

Configuration Reference

After ph init, the workspace has a config.yaml with these sections:

search:
  max_iterations: 20          # Maximum search iterations
  early_stop_patience: 5      # Stop after N iterations with no improvement
  parent_selection: best       # Strategy: best | tournament | all

proposer:
  backend: api                 # api | openai | claude-code | claw-code | codex | opencode | local
  model: claude-sonnet-4-20250514  # Model name (for api/openai backends)
  base_url: null               # Custom API endpoint (for openai backend)
  api_key: null                # API key override (null = use env var)
  max_tokens: 16384            # Max output tokens per proposer turn
  temperature: 0.7             # Sampling temperature (0.0 – 2.0)
  cli_path: null               # Custom CLI executable path (auto-detect if null)

evaluator:
  type: python                 # python | docker | custom
  entry: evaluate.py           # Evaluator script entrypoint
  timeout: 300                 # Per-task timeout in seconds

harness:
  language: python             # Harness code language
  entry: harness.py            # Harness entrypoint file
  editable_files:              # Files the Proposer is allowed to modify
    - harness.py
    - prompt_template.txt

You can modify values via CLI: ph config set search.max_iterations 30

Installation

pip (recommended)

pip install polyharness      # Requires Python >= 3.12
ph --version

npm / npx

npm install -g polyharness   # postinstall auto-installs Python package
npx polyharness doctor       # or run without global install

The npm package is a thin Node.js wrapper (bin/ph.mjs) that finds and invokes the Python CLI. It checks: ph on PATH → python -m polyharness → auto-discovers .venv in parent directories.

From source

git clone https://github.com/weijt606/polyharness.git
cd polyharness

python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
# or: pip install anthropic click pydantic pyyaml rich && export PYTHONPATH="$PWD/src"

python -m polyharness --version

CLI Reference

Command	Description
`ph doctor`	Detect installed agents and environment status
`ph new [dir]`	Scaffold a new harness project (generates harness.py + tasks + evaluate.py)
`ph init`	Initialize workspace with auto-copy of harness, tasks, eval script
`ph run`	Start the optimization search loop
`ph status`	Progress table with elapsed time, improvement rate, and delta
`ph log`	Search tree with delta (Δ) column (or `--flat` for table)
`ph best`	Show best candidate: score, per-task breakdown, changes summary
`ph compare A B`	Compare two iterations: score deltas + unified code diff
`ph diff <N>`	Shorthand for `compare 0 <N>`
`ph leaderboard`	Ranked table of all candidates (`--top N`, `--tasks` drilldown)
`ph trace <N>`	View stdout, stderr, metrics, exit code for an iteration
`ph report`	Generate a full markdown report with score trends and per-task table
`ph apply`	Copy best harness back to `base_harness/` (or `--target` dir)
`ph export <dir>`	Export candidate to any directory (with optional `--include-meta`)
`ph clean`	Remove candidate dirs to free disk space (`--keep-best`, `-y`)
`ph config show`	Display the current workspace configuration
`ph config set K V`	Modify a config value via dot-notation (with validation)
`ph upgrade`	Upgrade PolyHarness to the latest version
`ph uninstall`	Uninstall PolyHarness from the current environment (`-y` to skip confirm)

Global flags

-v, --verbose        Show detailed output
-q, --quiet          Suppress non-essential output

`ph init` options

--agent <name>       Backend: claude-code | claw-code | codex | opencode | api | local
--workspace <dir>    Workspace directory (default: current dir)
--base-harness <dir> Copy starting harness code into workspace
--task-dir <dir>     Copy tasks/ folder and evaluate.py into workspace
--eval-script <path> Copy a specific evaluate.py into workspace

`ph run` options

--max-iterations N   Override max iterations
--dry-run            Only evaluate the base harness, skip search
--resume             Continue an interrupted search from where it left off
--backend <name>     Override proposer backend without editing config
--strategy <name>    Override parent selection: best | tournament | all

Examples

The score trajectories below are measured from the bundled examples using the current local backend and are rounded for readability. They are not borrowed from the Meta-Harness paper or from external benchmarks.

Text Classification (sentiment analysis)

ph init --agent local --template text-classification
ph run --max-iterations 3

# iter_0: 0.65 → iter_1: 1.00 ★  (naive word list → expanded lexicon)

Math Word Problems (numerical reasoning)

ph init --agent local --template math-word-problems
ph run --max-iterations 5

# iter_0: 0.35 → iter_1: 0.50 → iter_2: 0.65 → iter_3: 0.90 ★
# (naive multiply → operation detection → averages/% → multi-step reasoning)

Code Generation (function synthesis)

ph init --agent local --template code-generation
ph run --max-iterations 5

# iter_0: 0.27 → iter_1: 0.50 → iter_2: 0.68 → iter_3: 0.95 ★
# (5 keywords → 10 patterns → composite logic → comprehensive coverage)

API Calling (endpoint routing + parameter extraction)

ph init --agent local --template api-calling
ph run --max-iterations 5

# iter_0: 0.19 → iter_1: 0.55 → iter_2: 0.77 → iter_3: 0.87 ★
# (keyword matching → broad routing → param helpers → full regex extraction)

RAG Question Answering (retrieval + answer extraction)

ph init --agent local --template rag-qa
ph run --max-iterations 5

# iter_0: 0.51 → iter_1: 0.79 ★
# (word overlap → stopword-filtered retrieval + sentence scoring)

Project Structure

src/polyharness/
├── cli.py                   # Click CLI — 16 commands/subcommands
├── config.py                # Pydantic config models
├── orchestrator.py          # Meta-Harness search loop + progress bar + error recovery
├── workspace.py             # Filesystem workspace + agent instruction injection
├── search_log.py            # JSONL append-only search log
├── doctor.py                # Environment detection for all backends
├── evaluator/
│   └── evaluator.py         # PythonEvaluator (subprocess)
├── proposer/
│   ├── api_proposer.py      # Anthropic API direct + tool-use loop
│   ├── cli_proposer.py      # CLIProposer — unified subprocess management
│   ├── local_proposer.py    # Offline rule-based (5 task types)
│   └── adapters/            # Per-agent CLI adapters
│       ├── claude_code.py   # claude -p
│       ├── claw_code.py     # claw -p
│       ├── codex.py         # codex --quiet --auto-edit
│       └── opencode.py      # opencode -p

bin/
├── ph.mjs                   # npm wrapper
└── postinstall.mjs          # npm postinstall

tests/                       # 128 tests (pytest)

Local Development

git clone https://github.com/weijt606/polyharness.git && cd polyharness
python -m venv .venv && source .venv/bin/activate
pip install anthropic click pydantic pyyaml rich pytest pytest-cov ruff
export PYTHONPATH="$PWD/src"

python -m pytest tests/      # run tests
ruff check src/ tests/       # lint

Documentation

Product Development — roadmap, user scenarios, success metrics
Technical Architecture — system design & data flow
Meta-Harness Paper — theoretical foundation and paper-reported reference results

Give your agent self-evolution. It's about time.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

weijt606

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.4

May 26, 2026

0.2.3

May 25, 2026

0.2.2

May 24, 2026

0.2.1

Apr 8, 2026

0.2.0

Apr 8, 2026

This version

0.1.3

Apr 8, 2026

0.1.2

Apr 8, 2026

0.1.1

Apr 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

polyharness-0.1.3.tar.gz (80.2 kB view details)

Uploaded Apr 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

polyharness-0.1.3-py3-none-any.whl (74.6 kB view details)

Uploaded Apr 8, 2026 Python 3

File details

Details for the file polyharness-0.1.3.tar.gz.

File metadata

Download URL: polyharness-0.1.3.tar.gz
Upload date: Apr 8, 2026
Size: 80.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for polyharness-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`17128f4c21bff4bf222a764b1e7faf4aa4a8ce8fa276202dac7b743012747fc6`
MD5	`8712d0cf125b9e5a045f9d8f02d2dc24`
BLAKE2b-256	`00c3877960c97b4ae04da0fbab25cdba632689a9d85fc801e7405fe611699702`

See more details on using hashes here.

Provenance

The following attestation bundles were made for polyharness-0.1.3.tar.gz:

Publisher: publish-pypi.yml on weijt606/polyharness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: polyharness-0.1.3.tar.gz
- Subject digest: 17128f4c21bff4bf222a764b1e7faf4aa4a8ce8fa276202dac7b743012747fc6
- Sigstore transparency entry: 1256442442
- Sigstore integration time: Apr 8, 2026
Source repository:
- Permalink: weijt606/polyharness@f96c836f9d0f88c8d63c50511221f87f336098b4
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/weijt606
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@f96c836f9d0f88c8d63c50511221f87f336098b4
- Trigger Event: push

File details

Details for the file polyharness-0.1.3-py3-none-any.whl.

File metadata

Download URL: polyharness-0.1.3-py3-none-any.whl
Upload date: Apr 8, 2026
Size: 74.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for polyharness-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`859375a6a10fb2b692ced64b56ba779e893d183b73adb446dc4287e944c81aef`
MD5	`a689b53e346dbb00175ca21c068e866d`
BLAKE2b-256	`64d415c1c23cac7c533797160d36e33877d4971986abfd5b079582065871a377`

See more details on using hashes here.

Provenance

The following attestation bundles were made for polyharness-0.1.3-py3-none-any.whl:

Publisher: publish-pypi.yml on weijt606/polyharness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: polyharness-0.1.3-py3-none-any.whl
- Subject digest: 859375a6a10fb2b692ced64b56ba779e893d183b73adb446dc4287e944c81aef
- Sigstore transparency entry: 1256442617
- Sigstore integration time: Apr 8, 2026
Source repository:
- Permalink: weijt606/polyharness@f96c836f9d0f88c8d63c50511221f87f336098b4
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/weijt606
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@f96c836f9d0f88c8d63c50511221f87f336098b4
- Trigger Event: push

polyharness 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

PolyHarness

Backstory

What PolyHarness Is

Use PolyHarness

I use AI coding agents

I'm building agent frameworks

Quick Start

1. Install

2. Check your environment

3. Initialize a workspace

Option A: Use a bundled template (recommended for first run)

Option B: Use your own project

4. Run the optimization loop

5. Inspect and apply

Try it now (no API key needed)

How It Works

Why it works: non-Markovian search

Supported Agent Backends

Local Model Setup

Configuration Reference

Installation

pip (recommended)

npm / npx

From source

CLI Reference

Global flags

ph init options

ph run options

Examples

Text Classification (sentiment analysis)

Math Word Problems (numerical reasoning)

Code Generation (function synthesis)

API Calling (endpoint routing + parameter extraction)

RAG Question Answering (retrieval + answer extraction)

Project Structure

Local Development

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`ph init` options

`ph run` options