Significance Analysis for HPO-algorithms performing on multiple benchmarks
Project description
Significance Analysis
This package is used to analyse datasets of different HPO-algorithms performing on multiple benchmarks.
Note
As indicated with the v0.x.x
version number, Significance Analysis is early stage code and APIs might change in the future.
Documentation
Please have a look at our example. The dataset should have the following format:
system_id (algorithm name) |
input_id (benchmark name) |
metric (mean/estimate) |
optional: bin_id (budget/traininground) |
---|---|---|---|
Algorithm1 | Benchmark1 | x.xxx | 1 |
Algorithm1 | Benchmark1 | x.xxx | 2 |
Algorithm1 | Benchmark2 | x.xxx | 1 |
... | ... | ... | ... |
Algorithm2 | Benchmark2 | x..xxx | 2 |
In this dataset, there are two different algorithms, trained on two benchmarks for two iterations each. The variable-names (system_id, input_id...) can be customized, but have to be consistent throughout the dataset, i.e. not "mean" for one benchmark and "estimate" for another. The Significance Analysis
function is then called with the dataset and the variable-names as parameters.
Optionally the dataset can be binned according to a fourth variable (bin_id) and the analysis is conducted on each of the bins seperately, as shown in the code example above. To do this, provide the name of the bin_id-variable, the bin intervals and the labels for thems.
Installation
Using pip
pip install significance-analysis
Using R, >=4.0.0 install packages: Matrix, emmeans, lmerTest
Usage
- Generate data from HPO-algorithms on benchmarks, saving data according to our format.
- Call function
checkSignificance
on dataset, while specifying variable-names
In code, the usage pattern can look like this:
from signficance_analysis import checkSignificance
# 1. Generate/import dataset
data = pd.read_pickle("./exampleDataset.pkl")
# 2. Analyse dataset
checkSignificance(data, "mean", "surrogate_aquisition", "benchark")
For more details and features please have a look at our example.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for significance_analysis-0.1.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8cce0dc293119c4fa1a016b2aa195b6edcd9e0ae8a1b7f3784df43b2efb793ff |
|
MD5 | 4ad4213e1b133a2a4b9aa74a5fbb0b5e |
|
BLAKE2b-256 | 086dd32bf772964f0ce1bd2da270fb12dbee2ef5303f9d3e080a4648136ec619 |
Hashes for significance_analysis-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 35aba5665c3f816a6ac170f3d5a058de43d375ce9620e0b6d1baebf1d52f89bc |
|
MD5 | c0a0beb72e795a51f0f169bab8b99d3a |
|
BLAKE2b-256 | cba441b30e8fbd46ecf4d50301aef7d2aa1fcb4b47e050663d310ba142886a6f |