Postgres-backed eval scheduler for Harbor agent tasks — queuing, retries, and monitoring
Project description
Oddish
Run evals on Harbor tasks at scale with queuing, retries, and monitoring.
Overview
Oddish extends Harbor with:
- Provider-aware queuing and automatic retries for LLM providers
- Real-time monitoring via dashboard or CLI
- Postgres-backed state plus S3 for artifacts
Harbor compatibility: replace harbor run with oddish run.
Quick Start
1. Install
uv pip install -e .
2. Generate an Oddish API key here
export ODDISH_API_KEY="ok_..."
3. Submit a job
# Run a single agent
oddish run -d terminal-bench@2.0 -a codex -m gpt-5.2-codex --n-trials 3
# Or sweep multiple agents
oddish run -d terminal-bench@2.0 -c sweep.yaml
Example sweep.yaml
agents:
- name: claude-code
model_name: anthropic/claude-sonnet-4-5
n_trials: 3
- name: codex
model_name: openai/gpt-5.2-codex
n_trials: 3
- name: gemini-cli
model_name: google/gemini-3-flash-preview
n_trials: 3
4. Monitor Progress
oddish status
Commands
oddish run— submit a joboddish status— monitor progressoddish clean— cleanup jobs
Documentation
Technical documentation lives in AGENTS.md.
Instructions for self-hosting are in SELF_HOSTING.md.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file oddish-0.1.0.tar.gz.
File metadata
- Download URL: oddish-0.1.0.tar.gz
- Upload date:
- Size: 936.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0723c75028fe208450e12c45be42f8de2e8ac4402fe4d21aab84ce21aa6982cc
|
|
| MD5 |
558eaa906d785568e608637cca6afd5c
|
|
| BLAKE2b-256 |
ff6cf33b7a21e70a58f2b4c682cecc83f894f47b8edbd9e8fdb3ebee9d07cc1a
|
File details
Details for the file oddish-0.1.0-py3-none-any.whl.
File metadata
- Download URL: oddish-0.1.0-py3-none-any.whl
- Upload date:
- Size: 89.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f99489c1cbd1fa3cbe418f67c0fc5ce98e5df86ef1e734c82b8ccb50ceaaeea2
|
|
| MD5 |
80988952267a2538562e45b5150140c0
|
|
| BLAKE2b-256 |
b64754630b6d2039e7d0a5aa481168b3ac15838f1eb13517e42ef7f00d44aea5
|