Accompanying set of benchmarks for BioEmu 1.0 paper.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Jose.Jimenez microsoft

These details have not been verified by PyPI

Project description

Biomolecular Emulator - Benchmarks (BioEmu-Benchmarks)

Accompanying benchmark code for the BioEmu paper. For the BioEmu sampling code please check here.

Installation
Benchmarks
Usage
Citation
Get in touch

Installation

bioemu-benchmarks is provided as a pip-installable package, requiring Python >= 3.10, < 3.13:

pip install bioemu-benchmarks

For development, clone the repository and install with dev dependencies:

pip install -e ".[dev]"

Available benchmarks

bioemu-benchmarks implements the following benchmarks for evaluating the emulation performance of models:

multiconf_ood60: Measures local protein conformational changes on a set that is different from the training set in terms of sequence similarity.
multiconf_domainmotion: Measures global protein domain motions. Main metric measured is global RMSD.
singleconf_localunfolding: Measures local protein unfolding. Main metric measured is a custom fraction of native contacts calculation on a pre-defined set of residues.
multiconf_crypticpocket: Measures pocket backbone changes upon ligand binding. Default metric is local RMSD on a predefined set of residues in, or close to, a given binding pocket.
md_emulation: Measure the match between model samples and target molecular dynamics distribution on low dimensional free energies surfaces. Includes free energy MAE and RMSE, as well as coverage of the samples as metrics.
folding_free_energies: Measures the ability to predict folding free energies ($\Delta G$ and $\Delta\Delta G$) based on the provided samples. Main metrics are the MAEs, as well as correlation coefficients on both metrics.

For details of the different benchmarks, please refer to the SI of the accompanying publication.

Usage

bioemu-benchmarks provides both a Python interface as well as a regular CLI. The CLI is intended to provide quick access to the different benchmark, while the python API provides more flexibility.

Sample format

To run the benchmarks in this repository you will need to prepare samples in .xtc format so that they can be loaded as mdtraj.Trajectory objects. Each .xtc file needs an accompanying .pdb file which defines the topology. When loading samples, for each .xtc file, the code will look for either a .pdb file of the same name or failing that, a topology.pdb file in the same directory. For example, you can store your samples like this

my_samples/
├── foo.pdb
├── foo.xtc
├── bar.pdb
├── bar.xtc
...

or like this (which bioemu will do)

my_samples/
├── foo/
├──── samples.xtc
├──── topology.pdb
├── bar/
├──── samples.xtc
├──── topology.pdb

In order to know which sequences to sample for each benchmark, users can check the testcases.csv file under the assets/<benchmark_type>/<benchmark> folder on this repo. Alternatively, either the python API or CLI can be used to get sample specifications (see below for examples).

Bash CLI

Upon installation, the bioemu-bench benchmark script is added to the PATH. This script provides a simple CLI for running benchmarks, collecting results and getting benchmark specifications (sequences to samples and recommended number of samples).

Loading samples and running benchmarks

To run a single or multiple benchmarks, the eval mode of the script is used:

bioemu-bench eval <output_dir> --benchmarks / -b [...] --sample_dirs / -s [...]

<output_dir> is the directory to which results will be written and --benchmarks specifies the benchmarks to evaluate on the given samples. It can take a single or list of benchmarks as input (for available benchmarks see bioemu-bench eval --help). There also is the option to specify --benchmarks all, in which case all benchmarks are run. The --sample_dirs option takes the path to a directory (or multiple directories) from which samples will be loaded (expecting the format described above).

bioemu-bench eval will collect results in the <output_dir>, with each requested benchmark getting its own subdirectory:

<output_dir>
├── benchmark_metrics.json
├── domainmotion
│   ├── ...
│   └── results.pkl
...   ...
├── folding_free_energies
│   ├── ...
│   └── scatter_dG.png
└── md_emulation
    ├── ...
    └── results_projections.npz

The file benchmark_metrics.json collects aggregate metrics for all benchmarks in json format.

Getting sample specifications

The specs mode of bioemu-bench can be used to generate a CSV file collecting sequence information and recommended number of samples for the requested benchmarks:

bioemu-bench specs <output_csv> --benchmarks/-b [...]

<output_csv> is the output CSV generated by the script. --benchmarks once again specifies the benchmarks for which this information should be generated.

Python API

The python API provides a set of tools for evaluating samples according to a benchmark, the central ones being:

Benchmark: an Enum that defines the available benchmarks in the repository.
IndexedSamples: used for loading, validating and optionally filtering samples via the filter_unphysical_samples function available under that module.
Evaluator functions: These define the evaluations to be done and are called on a instance of IndexedSamples. evaluator_utils.py provides a function for retrieving the evaluator function for each Benchmark.
BenchmarkResults classes: Each Evaluator returns a BenchmarksResults instance as an output. This classes collect the benchmark results and offer utilities for storing results (save_results), plotting (plot) and assessing aggregate metrics (get_aggregate_metrics).

Loading samples and running benchmarks

An example for performing the ood60 benchmark on a set of samples would look like the following:

from bioemu_benchmarks.benchmarks import Benchmark
from bioemu_benchmarks.samples import IndexedSamples, filter_unphysical_samples, find_samples_in_dir
from bioemu_benchmarks.evaluator_utils import evaluator_from_benchmark

# Specify the benchmark you want to run (e.g., OOD60)
benchmark = Benchmark.MULTICONF_OOD60

# This validates samples
sequence_samples = find_samples_in_dir("/path/to/your/sample_dir")
samples = IndexedSamples.from_benchmark(benchmark=benchmark, sequence_samples=sequence_samples)

# Filter unphysical-looking samples from getting evaluated
samples, _sample_stats = filter_unphysical_samples(samples)

# Instanstiate an evaluator for a given benchmark
evaluator = evaluator_from_benchmark(benchmark=benchmark)

# `results` has methods for plotting / computing summary metrics
results = evaluator(samples)
results.plot('/path/to/result/plots') 
results.save_results('/path/to/result/metrics/')

Getting sample specifications

The python API also offers a way to get information on the sequences to sample for a benchmark from its metadata attribute:

from bioemu_benchmarks.benchmarks import Benchmark

# Returns a pandas df with info about the Multiconf OOD60 benchmark
metadata = Benchmark.MULTICONF_OOD60.metadata

Citation

If you are using our code or model, please consider citing our work:

@article{bioemu2025,
  title={Scalable emulation of protein equilibrium ensembles with generative deep learning},
  author={Lewis, Sarah and Hempel, Tim and Jim{\'e}nez-Luna, Jos{\'e} and Gastegger, Michael and Xie, Yu and Foong, Andrew YK and Satorras, Victor Garc{\'\i}a and Abdin, Osama and Veeling, Bastiaan S and Zaporozhets, Iryna and others},
  journal={Science},
  pages={eadv9817},
  year={2025},
  publisher={American Association for the Advancement of Science}
}

Licensing

The code of this project is licensed under the MIT License. See the LICENSE file for details. The accompanying dataset is licensed under the Community Data License Agreement – Permissive, Version 2.0 (CDLA-Permissive-2.0). See the DATASET_LICENSE for details.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Get in touch

If you have any questions not covered here, please create an issue or contact the BioEmu team by writing to the corresponding author on our paper.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Jose.Jimenez microsoft

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.0.2.post1

Apr 20, 2026

0.0.1

Feb 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioemu_benchmarks-0.0.2.post1.tar.gz (28.3 MB view details)

Uploaded Apr 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bioemu_benchmarks-0.0.2.post1-py3-none-any.whl (10.2 MB view details)

Uploaded Apr 20, 2026 Python 3

File details

Details for the file bioemu_benchmarks-0.0.2.post1.tar.gz.

File metadata

Download URL: bioemu_benchmarks-0.0.2.post1.tar.gz
Upload date: Apr 20, 2026
Size: 28.3 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bioemu_benchmarks-0.0.2.post1.tar.gz
Algorithm	Hash digest
SHA256	`e7487931af39475bf7630683a6244b69e5f176e1c94499e0d01e7bd0a202bb27`
MD5	`b5d5e0632dd61352aa8c3e1df6db6b2d`
BLAKE2b-256	`be3fd54c13335a17b575e6f6ab13212a8f9dee411995eae8c4ad9afa190f76d3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for bioemu_benchmarks-0.0.2.post1.tar.gz:

Publisher: publish.yml on microsoft/bioemu-benchmarks

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: bioemu_benchmarks-0.0.2.post1.tar.gz
- Subject digest: e7487931af39475bf7630683a6244b69e5f176e1c94499e0d01e7bd0a202bb27
- Sigstore transparency entry: 1342325512
- Sigstore integration time: Apr 20, 2026
Source repository:
- Permalink: microsoft/bioemu-benchmarks@34e3a2a6bba0b27c0db16c88596214e6950886c0
- Branch / Tag: refs/tags/v.0.0.2post1
- Owner: https://github.com/microsoft
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@34e3a2a6bba0b27c0db16c88596214e6950886c0
- Trigger Event: release

File details

Details for the file bioemu_benchmarks-0.0.2.post1-py3-none-any.whl.

File metadata

Download URL: bioemu_benchmarks-0.0.2.post1-py3-none-any.whl
Upload date: Apr 20, 2026
Size: 10.2 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bioemu_benchmarks-0.0.2.post1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6be31aa3a04d2a2a01a69eb93dd53614e2211eebb7abcd55394ba7df8cfb373f`
MD5	`629cdce47de309bed8463bcb3eec7300`
BLAKE2b-256	`909cd0cf61827ee30665d7b41e044bd0deee48f52dac034f979a1df8e463747a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for bioemu_benchmarks-0.0.2.post1-py3-none-any.whl:

Publisher: publish.yml on microsoft/bioemu-benchmarks

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: bioemu_benchmarks-0.0.2.post1-py3-none-any.whl
- Subject digest: 6be31aa3a04d2a2a01a69eb93dd53614e2211eebb7abcd55394ba7df8cfb373f
- Sigstore transparency entry: 1342325532
- Sigstore integration time: Apr 20, 2026
Source repository:
- Permalink: microsoft/bioemu-benchmarks@34e3a2a6bba0b27c0db16c88596214e6950886c0
- Branch / Tag: refs/tags/v.0.0.2post1
- Owner: https://github.com/microsoft
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@34e3a2a6bba0b27c0db16c88596214e6950886c0
- Trigger Event: release

bioemu-benchmarks 0.0.2.post1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Biomolecular Emulator - Benchmarks (BioEmu-Benchmarks)

Table of Contents

Installation

Available benchmarks

Usage

Sample format

Bash CLI

Loading samples and running benchmarks

Getting sample specifications

Python API

Loading samples and running benchmarks

Getting sample specifications

Citation

Licensing

Trademarks

Get in touch

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance