Skip to main content

Postgres-backed eval scheduler for Harbor agent tasks — queuing, retries, and monitoring

Project description

Oddish

Python License

Oddish

Run evals on Harbor tasks at scale with queuing, retries, and monitoring.

Overview

Oddish extends Harbor with:

  • Provider-aware queuing and automatic retries for LLM providers
  • Real-time monitoring via dashboard or CLI
  • Postgres-backed state plus S3 for artifacts

Harbor compatibility: replace harbor run with oddish run.

Quick Start

1. Install

uv pip install -e .

2. Generate an Oddish API key here

export ODDISH_API_KEY="ok_..."

3. Submit a job

# Run a single agent
oddish run -d terminal-bench@2.0 -a codex -m gpt-5.2-codex --n-trials 3
# Or sweep multiple agents
oddish run -d terminal-bench@2.0 -c sweep.yaml
Example sweep.yaml
agents:
  - name: claude-code
    model_name: anthropic/claude-sonnet-4-5
    n_trials: 3
  - name: codex
    model_name: openai/gpt-5.2-codex
    n_trials: 3
  - name: gemini-cli
    model_name: google/gemini-3-flash-preview
    n_trials: 3

4. Monitor Progress

oddish status

Commands

  • oddish run — submit a job
  • oddish status — monitor progress
  • oddish clean — cleanup jobs

Documentation

Technical documentation lives in AGENTS.md.

Instructions for self-hosting are in SELF_HOSTING.md.

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oddish-0.1.0.tar.gz (936.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

oddish-0.1.0-py3-none-any.whl (89.4 kB view details)

Uploaded Python 3

File details

Details for the file oddish-0.1.0.tar.gz.

File metadata

  • Download URL: oddish-0.1.0.tar.gz
  • Upload date:
  • Size: 936.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for oddish-0.1.0.tar.gz
Algorithm Hash digest
SHA256 0723c75028fe208450e12c45be42f8de2e8ac4402fe4d21aab84ce21aa6982cc
MD5 558eaa906d785568e608637cca6afd5c
BLAKE2b-256 ff6cf33b7a21e70a58f2b4c682cecc83f894f47b8edbd9e8fdb3ebee9d07cc1a

See more details on using hashes here.

File details

Details for the file oddish-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: oddish-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 89.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for oddish-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f99489c1cbd1fa3cbe418f67c0fc5ce98e5df86ef1e734c82b8ccb50ceaaeea2
MD5 80988952267a2538562e45b5150140c0
BLAKE2b-256 b64754630b6d2039e7d0a5aa481168b3ac15838f1eb13517e42ef7f00d44aea5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page