Skip to main content

This repository contains code for the DiscoBench modular benchmark for automated algorithm discovery.

Project description

DiscoBench: An Open-Ended Benchmark For Algorithm Discovery

DiscoBench Logo

This repository contains code for the DiscoBench modular benchmark for automated algorithm discovery.

Quick Start

Install DiscoBench:

pip install discobench

List available domains:

discobench get-domains

Create a task:

discobench create-task --task-domain OnPolicyRL

See the full documentation for detailed usage. Please note that each task_domain has its own set of requirements which may need to be installed.

Every domain includes references in discobench/tasks/<task_domain>/utils/_reference.txt.

Task Domains

Task Domain Modules Datasets Description
BayesianOptimisation acq_fn, acq_optimizer, domain, next_queries, surrogate, surrogate_optimizer 11 synthetic functions (Ackley, Branin, Bukin, Cosine, DropWave, EggHolder, Griewank, Hartmann, HolderTable, Levy). Optimization of black-box functions using surrogate models to find global minima/maxima.
BrainSpeechDetection loss, networks, optim 7 LibriBrain tasks. Detecting speech features directly from brain activity data.
ComputerVisionClassification loss, networks, optim, preprocess CIFAR10, CIFAR10C, CIFAR10LT, CIFAR100, FashionMNIST, MNIST, OxfordFlowers, StanfordCars, TinyImageNet. Image classification on a range of datasets.
ContinualLearning optim, regularizer, replay, sampler, scheduler PermutedMNIST, SplitCIFAR100, TinyImageNetSplit. Training a model on continually changing data, such that it can adapt to new data without losing old capabilities.
GreenhouseGasPrediction data_processing, model 4 Mauna Loa Time-series (CO2, N2O, SF6, CH4). Time-series forecasting of atmospheric greenhouse gas concentrations.
LanguageModelling loss, networks, optimizer OPCFineWebCode, OPCFineWebMath, LMFineWeb, TinyStories. Training transformer-based models on code, mathematics, and narrative text.
ModelUnlearning loss MUSE, TOFU, WMDP_Cyber. Fine-tuning pretrained models to remove specific knowledge or data points while retaining others.
OffPolicyRL q_update, policy, networks, optim, rb, train 4 MinAtar games. Value-based RL for training an agent in MinAtar.
OnPolicyRL loss, networks, optim, train 4 MinAtar, 7 Brax, 2 Craftax. Training an RL agent in a range of different RL environments using PPO-style algorithms.
UnsupervisedEnvironmentDesign sample_levels, train_step, train_loop, config 3 Kinetix sizes, Minigrid. Generating and curating training environments/levels to improve RL agent generalization.

Development Setup

1. Set Up Your Development Environment

Install the environment and the pre-commit hooks with:

make install

This will also generate your uv.lock file.

Contributing

We welcome contributions! DiscoBench grows stronger with more tasks and domains.

See CONTRIBUTING.md for detailed development guidelines.

Citation

If you use DiscoBench in your research, please cite:

@article{goldie2025discobench,
  title={DiscoBench: An Open-Ended Benchmark For Algorithm Discovery},
  author={Alexander D. Goldie and Zilin Wang and Adrian Hayler and Deepak Nathani and Edan Toledo and Ken Thampiratwong and Aleksandra Kalisz and Michael Beukman and Alistair Letcher and Shashank Reddy and Clarisse Wibault and Theo Wolf and Charles O'Neill and Jakob N. Foerster and Shimon Whiteson and Roberta Raileanu},
  year={2025}
}

License

DiscoBench is released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

discobench-0.1.0.tar.gz (1.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

discobench-0.1.0-py3-none-any.whl (588.9 kB view details)

Uploaded Python 3

File details

Details for the file discobench-0.1.0.tar.gz.

File metadata

  • Download URL: discobench-0.1.0.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for discobench-0.1.0.tar.gz
Algorithm Hash digest
SHA256 f183b3c6ebb0dcd7ad27b61c7e19a872941ab23195e0561fe6d74bb59687c894
MD5 f6a43875417898588a9bcc73bc3dec26
BLAKE2b-256 5e3510108c052dc7dd5dfab03133c82e6e6f5252a37b4ba2d59b2ba85454a716

See more details on using hashes here.

File details

Details for the file discobench-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: discobench-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 588.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for discobench-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 f97e4022002311bb9d6ec5d7c6f0feb11bb138fff185919cd795eb8969bce44a
MD5 8e61ebb9f207c290299796d2d984aa1f
BLAKE2b-256 2c8b848e67a6d5136f6b7bd8a105a4b978f6292a90e8c0339aa7bb75d115495d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page