A package for benchmarking the performance of arbitrary functions

These details have not been verified by PyPI

Project links

Project description

Bencher

Continuous Integration Status

Read the Docs

Install

pip install holobench

Intro

Bencher is a tool to make it easy to benchmark the interactions between the input parameters to your algorithm and its resulting performance on a set of metrics. It calculates the cartesian product of a set of variables

Parameters for bencher are defined using the param library as a config class with extra metadata that describes the bounds of the search space you want to measure. You must define a benchmarking function that accepts an instance of the config class and return a dictionary with string metric names and float values.

Parameters are benchmarked by passing in a list N parameters, and an N-Dimensional tensor is returned. You can optionally sample each point multiple times to get back a distribution and also track its value over time. By default the data will be plotted automatically based on the types of parameters you are sampling (e.g, continuous, discrete), but you can also pass in a callback to customize plotting.

The data is stored in a persistent database so that past performance is tracked.

Assumptions

The input types should also be of one of the basic datatypes (bool, int, float, str, enum, datetime) so that the data can be easily hashed, cached and stored in the database and processed with seaborn and xarray plotting functions. You can use class inheritance to define hierarchical parameter configuration class types that can be reused in a bigger configuration classes.

Bencher is designed to work with stochastic pure functions with no side effects. It assumes that when the objective function is given the same inputs, it will return the same output +- random noise. This is because the function must be called multiple times to get a good statistical distribution of it and so each call must not be influenced by anything or the results will be corrupted.

Pseudocode of bencher

Enumerate a list of all input parameter combinations
for each set of input parameters:
    pass the inputs to the objective function and store results in the N-D array

    get unique hash for the set of inputs parameters
    look up previous results for that hash
    if it exists:
        load historical data
        combine latest data with historical data
    
    store the results using the input hash as a key
deduce the type of plot based on the input and output types
return data and plot

Resource Management with `sampling_context`

If your benchmark holds external resources (DB pools, GPU handles, simulators) you may want to release them before the interactive result viewer starts. Wrapping the entire bn.run() call in a with block won't work — the context stays open while the Panel/Bokeh server blocks:

# Anti-pattern: resources held during the entire viewing session
with gpu_context():
    bn.run(my_bench, show=True)

Instead, pass the context manager as sampling_context. It wraps only the sampling phase; its __exit__ runs before the server starts:

bn.run(my_bench, show=True, sampling_context=gpu_context())

save and publish still execute inside the context (during sampling), so results are persisted before the resource is released.

Demo

if you have pixi installed you can run a demo example with:

pixi run demo

An example of the type of output bencher produces can be seen here:

https://blooop.github.io/bencher/

Examples

Most features are demonstrated in the auto-generated examples under bencher/example/generated/.

Run pixi run generate-docs to regenerate the full example gallery. Key sections include:

generated/N_float/ — Parameter sweeps with 0–3 float inputs, with/without repeats and over-time tracking
generated/plot_types/ — All supported plot types (scatter, line, heatmap, surface, etc.)
generated/result_types/ — Result types: images, videos, strings, booleans, paths, datasets
generated/composable_containers/ — Combining results with different composition strategies
generated/sampling/ — Custom values, levels, uniform, int vs float
generated/optimization/ — Single and multi-objective optimization with Optuna
generated/advanced/ — Time events, caching, aggregation over time
generated/regression/ — Performance regression detection
generated/statistics/ — Error bands, distributions, repeats comparison

A few hand-written examples remain for unique functionality:

example_simple_float.py — Minimal getting-started example
example_image.py / example_video.py — Image and video result types
example_self_benchmark.py — Bencher self-introspection
example_workflow.py — Multi-stage optimization workflow

Documentation

More documentation is needed for the examples and general workflow.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.99.0

May 14, 2026

1.98.0

Apr 27, 2026

1.97.0

Apr 27, 2026

1.96.0

Apr 26, 2026

1.95.0

Apr 26, 2026

1.94.0

Apr 25, 2026

1.93.0

Apr 25, 2026

1.92.0

Apr 22, 2026

1.91.0

Apr 22, 2026

1.90.0

Apr 22, 2026

1.89.0

Apr 21, 2026

1.88.0

Apr 20, 2026

1.87.0

Apr 19, 2026

1.86.0

Apr 19, 2026

1.85.1

Apr 19, 2026

1.85.0

Apr 18, 2026

1.84.0

Apr 15, 2026

1.83.0

Apr 12, 2026

1.82.1

Apr 11, 2026

1.82.0

Apr 9, 2026

1.81.0

Apr 8, 2026

1.80.2

Apr 7, 2026

1.80.1

Apr 6, 2026

1.80.0

Apr 6, 2026

1.79.0

Apr 5, 2026

1.78.0

Apr 2, 2026

1.77.0

Apr 2, 2026

1.76.0

Apr 1, 2026

1.75.2

Mar 31, 2026

1.75.1

Mar 31, 2026

1.75.0

Mar 30, 2026

1.74.0

Mar 29, 2026

1.73.1

Mar 27, 2026

1.73.0

Mar 26, 2026

1.72.5

Mar 25, 2026

1.72.4

Mar 24, 2026

1.72.3

Mar 24, 2026

1.72.2

Mar 23, 2026

1.72.1

Mar 22, 2026

1.72.0

Mar 22, 2026

1.71.0

Mar 21, 2026

1.70.5

Mar 21, 2026

1.70.4

Mar 20, 2026

1.70.3

Mar 20, 2026

1.70.2

Mar 20, 2026

1.70.1

Mar 19, 2026

1.70.0

Mar 18, 2026

1.69.0

Mar 18, 2026

1.68.0

Mar 17, 2026

1.67.0

Mar 16, 2026

1.66.3

Mar 14, 2026

1.66.2

Mar 13, 2026

1.66.1

Mar 13, 2026

1.66.0

Mar 12, 2026

1.65.0

Mar 11, 2026

1.64.0

Mar 11, 2026

1.63.0

Mar 9, 2026

1.62.0

Mar 6, 2026

1.61.0

Feb 21, 2026

1.60.0

Jan 24, 2026

1.59.0

Jan 24, 2026

1.58.1

Oct 10, 2025

1.58.0

Oct 1, 2025

1.57.0

Sep 23, 2025

1.56.1

Sep 16, 2025

1.56.0

Sep 16, 2025

1.55.0

Sep 9, 2025

1.54.2

Sep 8, 2025

1.54.1

Sep 5, 2025

1.54.0

Sep 5, 2025

1.53.1

Aug 30, 2025

1.53.0

Aug 29, 2025

1.52.0

Aug 27, 2025

1.51.3

Aug 27, 2025

1.51.2

Aug 11, 2025

1.51.1

Aug 8, 2025

1.51.0

Aug 7, 2025

1.50.0

Aug 6, 2025

1.49.0

Aug 5, 2025

1.48.0

Aug 1, 2025

1.47.0

Jul 19, 2025

1.46.0

Jul 8, 2025

1.45.0

Jul 8, 2025

1.44.0

Jun 23, 2025

1.43.0

Jun 2, 2025

1.42.0

May 19, 2025

1.41.0

Feb 23, 2025

1.40.1

Feb 11, 2025

1.40.0

Feb 4, 2025

1.39.0

Feb 3, 2025

1.38.0

Feb 3, 2025

1.36.2

Jan 27, 2025

1.36.1

Jan 26, 2025

1.36.0

Jan 21, 2025

1.35.0

Jan 18, 2025

1.34.0

Jan 17, 2025

1.33.2

Jan 2, 2025

1.33.1

Jan 2, 2025

1.33.0

Dec 14, 2024

1.32.0

Dec 5, 2024

1.31.1

Nov 24, 2024

1.31.0

Nov 24, 2024

1.30.3

Nov 24, 2024

1.30.2

Nov 16, 2024

1.30.1

Nov 2, 2024

1.30.0

Oct 2, 2024

1.29.0

Jul 1, 2024

1.28.1

Jun 25, 2024

1.28.0

Jun 24, 2024

1.27.0

Jun 16, 2024

1.26.3

Jun 15, 2024

1.25.2 yanked

Jun 14, 2024