A framework for building and using sandboxed environments for agentic evals and rollouts.
Project description
Harbor
Harbor is a framework for evals, post-training, and prompt optimization using agentic environments.
Installation
uv tool install harbor
pip install harbor
Getting started
Run the following command to see a list of all available commands:
harbor --help
Running an eval
The primary command is harbor run, which is used to run evals or generate rollouts.
harbor run --help
To view registered datasets, run
harbor datasets list
Running a registered dataset
To evaluate an agent and model one of these datasets, you can use the following command:
harbor run -d "<dataset@version>" -m "<model>" -a "<agent>"
Harbor will automatically download registered datasets.
Running a local dataset
Local datasets (directories of tasks) can also be run using
harbor run -p "<path/to/dataset>" -m "<model>" -a "<agent>"
Running a cloud sandbox
To run using a cloud sandbox provider like Daytona, you can use the following command:
harbor run -d "<dataset@version>" -m "<model>" -a "<agent>" --env "daytona" -n 32
If you run a cloud sandbox using an API model, trials become I/O bounded rather than compute bounded, which means you can typically parallelize far above your CPU count (the example command above runs 32 trials concurrently).
Sandboxed agent evaluations are often slow, because they can require many turns to complete and each command requires time to execute. Horizontal scaling becomes the only viable way to accelerate experimentation, so we recommend using a cloud sandbox provider like Daytona.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file harbor-0.1.2.tar.gz.
File metadata
- Download URL: harbor-0.1.2.tar.gz
- Upload date:
- Size: 114.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
35cedafa05e891a1619749d8024a10bb1d99715560968ad5930e44a609fcd721
|
|
| MD5 |
4f24bd3f848863c495362aa70c46d191
|
|
| BLAKE2b-256 |
12b34350e8e33ac51eba857df3c413336feb78f6aa77762b6663def92e3d9340
|
File details
Details for the file harbor-0.1.2-py3-none-any.whl.
File metadata
- Download URL: harbor-0.1.2-py3-none-any.whl
- Upload date:
- Size: 169.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1478347edbfcc1ced2122815a67392dada4a40d715972a2e5a280858d457b7a5
|
|
| MD5 |
08883ff2266b59797dde7cd459083344
|
|
| BLAKE2b-256 |
b9e28ec074131878477edaaa3dc7f567977568df55b857083f1e804cc30ef70d
|