Skip to main content

Run Harbor tasks in the cloud with scheduling, monitoring, and persistent state

Project description

Oddish CLI

Run Harbor tasks on local or hosted Oddish infrastructure.

oddish is a Python CLI for submitting Harbor tasks, running multi-trial sweeps, monitoring experiments, and pulling logs and artifacts back to disk. If you already use harbor run, Oddish adds persistent state, retries, queueing, and better operational tooling around the same task format.

Python 3.12+ is required.

Quick Start

uv pip install oddish

# Hosted Oddish
export ODDISH_API_KEY="ok_..."

# For local/self-hosted Oddish instead:
# export ODDISH_API_URL="http://localhost:8000"

# Submit a run
oddish run -d swebench@1.0 -a codex -m openai/gpt-5.2 --n-trials 3

# Watch progress
oddish status

# Pull logs and artifacts locally
oddish pull <task_id> --watch

For hosted usage, the CLI targets Oddish Cloud by default and expects ODDISH_API_KEY. For local/self-hosted usage, point the CLI at your API with ODDISH_API_URL; localhost does not require auth by default.

Installation

uv pip install oddish

Common environment variables:

# Hosted Oddish
export ODDISH_API_KEY="ok_..."

# Local or self-hosted Oddish
export ODDISH_API_URL="http://localhost:8000"

# Optional dashboard override
export ODDISH_DASHBOARD_URL="https://www.oddish.app"

Need to deploy your own stack? See ../SELF_HOSTING.md. Need package internals, architecture, or development notes? See AGENTS.md.

Commands

The installed console script is:

oddish --help

Available commands:

  • oddish run submits a task, dataset, or sweep config
  • oddish status shows system, task, or experiment status
  • oddish cancel stops all in-flight runs for a task
  • oddish pull downloads logs and artifact files locally
  • oddish delete deletes task data or resets local infrastructure

oddish run

Use oddish run for:

  • a single local Harbor task directory
  • a local dataset directory containing multiple tasks
  • a Harbor registry dataset via --dataset
  • a YAML or JSON sweep config via --config

Examples:

# Local task
oddish run ./my-task -a claude-code -m anthropic/claude-sonnet-4-5

# Local dataset
oddish run ./my-dataset -a codex -m openai/gpt-5.2 --n-trials 3

# Harbor registry dataset
oddish run -d swebench@1.0 -a codex -m openai/gpt-5.2 --n-trials 3

# Filter a dataset
oddish run -d swebench@1.0 -t "django__*" -l 10 -a claude-code

# Submit in the background
oddish run ./my-task -a claude-code --background

Common flags:

  • -a, --agent selects the agent
  • -m, --model selects the model
  • --n-trials runs multiple trials per task
  • -d, --dataset pulls tasks from the Harbor registry
  • -c, --config loads a YAML or JSON sweep config
  • -t, --task-name and -x, --exclude-task-name filter tasks by glob
  • -l, --n-tasks limits how many tasks run
  • -e, --env selects the execution environment
  • --experiment groups runs into an explicit experiment
  • -w, --watch / --no-watch watches task progress until completion
  • --background submits and returns immediately
  • -q, --quiet suppresses output
  • --run-analysis runs post-trial analysis and verdict generation
  • --publish publishes experiment for public read-only access
  • --priority sets priority (low or high)
  • --disable-verification skips running task tests

Supported --env values:

  • docker
  • daytona
  • e2b
  • modal
  • runloop
  • gke

When --env is omitted:

  • local API URLs default to docker
  • hosted Oddish (*.modal.run) defaults to modal
  • other remote APIs default to docker

Sweep Configs

oddish run -c sweep.yaml accepts YAML or JSON. A minimal config:

agents:
  - name: claude-code
    model_name: anthropic/claude-sonnet-4-5
    n_trials: 3
  - name: codex
    model_name: openai/gpt-5.2
    n_trials: 3

dataset: swebench@1.0
n_tasks: 10
priority: low

Per-agent overrides such as environment variables, kwargs, and timeouts are passed through Harbor agent config fields.

oddish status

Examples:

# System overview
oddish status

# Watch a task
oddish status <task_id> --watch

# Watch an experiment
oddish status --experiment <experiment_id> --watch

oddish cancel

Cancel all in-flight runs for a task without deleting any data. Queued jobs are removed, running trials are marked as failed, and Modal worker containers are terminated. Completed trials and their results are preserved.

oddish cancel <task_id>
oddish cancel <task_id> --force   # skip confirmation

oddish pull

Examples:

# Pull one trial
oddish pull <trial_id>

# Keep syncing a task while it runs
oddish pull <task_id> --watch --interval 5

# Pull an entire experiment, including task files
oddish pull <experiment_id> --include-task-files

By default, pull output is written to ./.oddish/<target>. You can also configure what to download using --no-logs, --no-files, and --structured (for structured logs), or change the output directory with --out.

oddish delete

Examples:

# Delete a task and its trials
oddish delete <task_id>

# Delete an entire experiment
oddish delete --experiment <experiment_id>

# Stop local infrastructure but keep data
oddish delete --stop-only

Typical Workflow

# 1. Submit a run
oddish run -d swebench@1.0 -a claude-code -m anthropic/claude-sonnet-4-5

# 2. Inspect or watch it
oddish status
oddish status <task_id> --watch

# 3. Pull outputs when you want them locally
oddish pull <task_id> --watch

More Technical Docs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oddish-0.1.4.tar.gz (17.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

oddish-0.1.4-py3-none-any.whl (19.9 kB view details)

Uploaded Python 3

File details

Details for the file oddish-0.1.4.tar.gz.

File metadata

  • Download URL: oddish-0.1.4.tar.gz
  • Upload date:
  • Size: 17.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.11 {"installer":{"name":"uv","version":"0.9.11"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for oddish-0.1.4.tar.gz
Algorithm Hash digest
SHA256 0bed2082714ab952db6373732c69965ad6da1276c2d6b73ae8b2c9090e790270
MD5 ecb346b852e96aca1d5fa6f4a895b11b
BLAKE2b-256 ac11b1279129c5c19892053c103e8e99eccc4a101eb87b703a68bbcb7b329d2f

See more details on using hashes here.

File details

Details for the file oddish-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: oddish-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 19.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.11 {"installer":{"name":"uv","version":"0.9.11"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for oddish-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 2490e70acdf90a251339debd6c7772a0e5a68e9e7e631db497a8942106e54969
MD5 034453b576e61f93ffe813a497b6f7b7
BLAKE2b-256 d51ef99404ebaeaedc583dcbb4a3aff3c55044f130d7dcc49a212a5df3dce78f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page