Optimizers from the nucleobench package.

These details have not been verified by PyPI

Project links

Operating System
- OS Independent
Programming Language

Project description

NucleoBench: A Large-Scale Benchmark of Neural Nucleic Acid Design Algorithms

We have developed a new, large-scale benchmark to compare modern nucleic acid sequence design algorithms (NucleoBench). We also present a new hybrid design algorithm that outperforms existing designers (AdaBeam). Please see https://github.com/move37-labs/nucleobench for more details.

NucleBench is a large-scale comparison of modern sequence design algorithms across 16 biological tasks (such as transcription factor binding and gene expression) and 9 design algorithms. NucleoBench, compares design algorithms on the same tasks and start sequences across more than 400K experiments, allowing us to derive unique modeling insights on the importance of using gradient information, the role of randomness, scaling properties, and reasonable starting hyperparameters on new problems. We use these insights to present a novel hybrid design algorithm, AdaBeam, that outperforms existing algorithms on 11 of 16 tasks and demonstrates superior scaling properties on long sequences and large predictors. Our benchmark and algorithms are freely available online.

We describe NucleoBench and AdaBeam in the paper "NucleoBench: A Large-Scale Benchmark of Neural Nucleic Acid Design Algorithms", to appear at the 2025 ICML GenBio Workshop.

This repo is intended to be used in a few days:

Run any of the NucleoBench design algorithms on a new design problem.
Run AdaBeam on a new design problem.
Run a new design algorithm on NucleoBench tasks, and avoid recomputing performances for existing designers.

results

Setup
- Installation
  - PyPi
  - Source
  - Docker
Usage
Citation

Setup

NucleoBench is provided via PyPi, source, or Docker.

Installation

PyPi

pip install nucleobench  # optimizers and tasks
pip install nucleopt  # smaller, faster install for just optimizers

Then you can use it in python:

from nucleobench import optimizations
opt = optimizations.get_optimization('beam_search_unordered')  # Any optimizer name.

Source

# Clone the repo.
git clone https://github.com/move37-labs/nucleobench.git
cd nucleobench

# Create and activate the conda environment.
conda env create -f environment.yml
conda activate nucleobench

# Run all the unittests.
pytest nucleobench/

You can also run the integration tests, which require an internet connection:

pytest docker_entrypoint_test.py

Docker

To help deploy NucleoBench to the cloud, we've created a Docker container. To build it yourself, see the top of Dockerfile for instructions. One way of creating a docker file is:

docker build -t nucleobench -f Dockerfile .

Usage

Recipes

See the recipes/colab folder for examples of how to run the designers with PyPi. See the recipes/docker folder for examples of how to run the designers with Docker. See the recipes/python folder for examples of how to run the designers with the cloned github repo.

Python, commandline

An example of how to run on the commandline, using Python:

python -m docker_entrypoint \
    --model bpnet \
        --protein 'ATAC' \
    --optimization adabeam \
        --beam_size 2 \
        --n_rollouts_per_root 4 \
        --mutations_per_sequence 2 \
        --rng_seed 0 \
    --max_seconds 240 \
    --optimization_steps_per_output 5 \
    --proposals_per_round 2 \
    --output_path ./python_recipe/adabeam_atac \
    --start_sequence {YOUR START SEQUENCE}

Docker, commandline

An example of how to run on the commandline, using Docker:

readonly output="./output/docker_recipe/adabeam_atac"
mkdir -p "${output}"
readonly fullpath="$(realpath $output)"

docker build -t nucleobench-docker -f Dockerfile .
docker run \
    -v "${fullpath}":"${fullpath}"  \
    "${docker_image_name}" \
    --model bpnet \
        --protein 'ATAC' \
    --optimization adabeam \
        --beam_size 2 \
        --n_rollouts_per_root 4 \
        --mutations_per_sequence 2 \
        --rng_seed 0 \
    --max_seconds 240 \
    --optimization_steps_per_output 5 \
    --proposals_per_round 2 \
    --output_path ${fullpath} \
    --start_sequence {YOUR START SEQUENCE}

Python, code

Below is an example of how to download NucleoBench and use it:

"""Initialize the task."""
from nucleobench import models
# Design for a simple task: count the number of occurances of a particular substring.
# See `nucleobench.models.__init__` for a registry of tasks, or add your own.
model_obj = models.get_model('substring_count')

# Every task has some baseline, default arguments to initialize. We can use
# these to demonstrate, or modify them for custom behavior. We do both, to
# demonstrate.
model_init_args = model_obj.debug_init_args()
model_init_args['substring'] = 'ATGTC'
model_fn = model_obj(**model_init_args)

"""Initialize the designer."""
from nucleobench import optimizations
# Pick a design algorithm that attemps to solve the task. In this case,
# maximize the number of substrings.
opt_obj = optimizations.get_optimization('adabeam')
# Every task has some baseline, default arguments to initialize. We can use
# these to demonstrate, or modify them for custom behavior. We do both, to
# demonstrate.
opt_init_args = opt_obj.debug_init_args()
opt_init_args['model_fn'] = model_fn
opt_init_args['start_sequence'] = 'A' * 100
designer = opt_obj(**opt_init_args)

"""Run the designer and show the results."""
designer.run(n_steps=100)
ret = designer.get_samples(1)
ret_score = model_fn(ret)
print(f'Final score: {ret_score[0]}')
print(f'Final sequence: {ret[0]}')

Citation

Please cite the following publication when referencing NucleoBench or AdaBeam:

@inproceedings{nucleobench,
  author    = {Joel Shor and Erik Strand and Cory Y. McLean},
  title     = {{NucleoBench: A Large-Scale Benchmark of Neural Nucleic Acid Design Algorithms}},
  booktitle = {GenBio ICML 2025},
  year = {2025},
  publisher = {PMLR},
  url = {https://www.biorxiv.org/content/10.1101/2025.06.20.660785},
  doi = {10.1101/2025.06.20.660785},
}

Project details

These details have not been verified by PyPI

Project links

Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

2.0.3

Apr 12, 2026

2.0.2

Apr 12, 2026

2.0.1

Apr 12, 2026

2.0.0

Apr 10, 2026

1.0.5

Mar 13, 2026

1.0.4

Aug 30, 2025

1.0.3

Aug 7, 2025

This version

1.0.2

Jul 12, 2025

1.0.1

Jun 26, 2025

1.0.0

Jun 21, 2025

0.1.7

Apr 24, 2025

0.1.6

Apr 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nucleopt-1.0.2.tar.gz (55.7 kB view details)

Uploaded Jul 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nucleopt-1.0.2-py3-none-any.whl (75.2 kB view details)

Uploaded Jul 12, 2025 Python 3

File details

Details for the file nucleopt-1.0.2.tar.gz.

File metadata

Download URL: nucleopt-1.0.2.tar.gz
Upload date: Jul 12, 2025
Size: 55.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for nucleopt-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`8aa5d51c9e8b36001f41ad2a1c9f736957bb6863822d41f9bafcc9a90ab4a44d`
MD5	`31510d71e4f2f9e450504f4bcd88ce03`
BLAKE2b-256	`9bf478bbb0b00e07c129bcb6f248e64cb797f2ecde49b2e23f806d5bb5bfdb0a`

See more details on using hashes here.

File details

Details for the file nucleopt-1.0.2-py3-none-any.whl.

File metadata

Download URL: nucleopt-1.0.2-py3-none-any.whl
Upload date: Jul 12, 2025
Size: 75.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for nucleopt-1.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cc731eec1f700dd6b4525c11b2d2a9e1c13a383c898b36e9c79bc7a3319380b3`
MD5	`98a2dc7300ff5bf5993022e637dfe49e`
BLAKE2b-256	`6665077a81b62d1641e89ae4bc0cd9a27ab5b9e556e221aa9ab6c96f5fa007fd`

See more details on using hashes here.

nucleopt 1.0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

NucleoBench: A Large-Scale Benchmark of Neural Nucleic Acid Design Algorithms

Contents

Setup

Installation

PyPi

Source

Docker

Usage

Recipes

Python, commandline

Docker, commandline

Python, code

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes