Skip to main content

Benchmarks for BO on LLM Tasks

Project description

CI codecov License: MIT Python

A benchmark suite for Bayesian optimization of expensive LLM tasks. Each problem is backed by a pretrained neural-network surrogate or tabular data from real LLM experiments, so evaluations are fast and reproducible without running real LLM training.

Documentation

Full documentation is available at bolt-bench.readthedocs.io.

Installation

pip install bolt-bench

Quick Start

import torch
from bolt import HPO

# 7-dim HPO problem: returns a scalar surrogate of eval score
prob = HPO(noise_std=0.001, negate=False)

X = torch.Tensor([[0, 2, 2, 2, 0.5, 30, 2]])  # one candidate configuration
y = prob(X)  # shape: (1,)

Problems

Problem Class Dims Notes
HPO HPO 7 mixed params (continuous, discrete, categorical)
HPO multi-fidelity (token) HPOMultiFidelityToken 8 mixed params (continuous, discrete, categorical), fidelity: continuous ∈ [0, 1] (training tokens)
HPO multi-fidelity (model) HPOMultiFidelityModel 8 mixed params (continuous, discrete, categorical), fidelity: discrete ∈ {0, 1} (model size)
Data mixture DMCurriculum 6 two simplex constraints
Data mixture MO DMCurriculumMO 6 two simplex constraints, multi-objective (3)
Data mixture with heteroscedastic noise DMCurriculumHet 6 two simplex constraints, heteroscedastic noise
Prompt optimization (128-dim) PO128 128 discrete candidate set
Prompt optimization (256-dim) PO256 256 discrete candidate set
Prompt optimization (512-dim) PO512 512 discrete candidate set
Prompt optimization (768-dim) PO768 768 discrete candidate set

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bolt_bench-0.1.0.tar.gz (94.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bolt_bench-0.1.0-py3-none-any.whl (19.8 kB view details)

Uploaded Python 3

File details

Details for the file bolt_bench-0.1.0.tar.gz.

File metadata

  • Download URL: bolt_bench-0.1.0.tar.gz
  • Upload date:
  • Size: 94.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bolt_bench-0.1.0.tar.gz
Algorithm Hash digest
SHA256 b66e04fce2240b65595901e47512533397ac1ee30d4c9bd07bbd9db062e9471d
MD5 53b1373dafd778b2fbcd84b40049bdea
BLAKE2b-256 edea88ef4da5db4b94b8aeac77680528936a9139d9a22a4ffb4b39913d46c74a

See more details on using hashes here.

Provenance

The following attestation bundles were made for bolt_bench-0.1.0.tar.gz:

Publisher: release.yml on chewwt/bolt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file bolt_bench-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: bolt_bench-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 19.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bolt_bench-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3a5b60aaff49b6ecc2792fccd4a91e67a79801dc7833b7c5c59b65561e30dee8
MD5 ea64de77549ba1ced961dd928e17114e
BLAKE2b-256 373fc81647364486094006200a11a0b6239d18ab68ebf983cb94832f098130ba

See more details on using hashes here.

Provenance

The following attestation bundles were made for bolt_bench-0.1.0-py3-none-any.whl:

Publisher: release.yml on chewwt/bolt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page