Bayesian Tuning and Bandits

These details have not been verified by PyPI

Project links

Project description

BTB An open source project from Data to AI Lab at MIT.

A simple, extensible backend for developing auto-tuning systems.

License: MIT
Development Status: Pre-Alpha
Documentation: https://mlbazaar.github.io/BTB
Homepage: https://github.com/MLBazaar/BTB

Overview

BTB ("Bayesian Tuning and Bandits") is a simple, extensible backend for developing auto-tuning systems such as AutoML systems. It provides an easy-to-use interface for tuning models and selecting between models.

It is currently being used in several AutoML systems:

ATM, a distributed, multi-tenant AutoML system for classifier tuning
MIT's system for the DARPA Data-driven discovery of models (D3M) program
AutoBazaar, a flexible, general-purpose AutoML system

Try it out now!

If you want to quickly discover BTB, simply click the button below and follow the tutorials!

Install

Requirements

BTB has been developed and tested on Python 3.6, 3.7 and 3.8

Also, although it is not strictly required, the usage of a virtualenv is highly recommended in order to avoid interfering with other software installed in the system where BTB is run.

Install with pip

The easiest and recommended way to install BTB is using pip:

pip install baytune

This will pull and install the latest stable release from PyPi.

If you want to install from source or contribute to the project please read the Contributing Guide.

Quickstart

In this short tutorial we will guide you through the necessary steps to get started using BTB to select between models and tune a model to solve a Machine Learning problem.

In particular, in this example we will be using BTBSession to perform solve the Wine classification problem by selecting between the DecisionTreeClassifier and the SGDClassifier models from scikit-learn while also searching for their best hyperparameter configuration.

Prepare a scoring function

The first step in order to use the BTBSession class is to develop a scoring function.

This is a Python function that, given a model name and a hyperparameter configuration, evaluates the performance of the model on your data and returns a score.

from sklearn.datasets import load_wine
from sklearn.linear_model import SGDClassifier
from sklearn.metrics import f1_score, make_scorer
from sklearn.model_selection import cross_val_score
from sklearn.tree import DecisionTreeClassifier


dataset = load_wine()
models = {
    'DTC': DecisionTreeClassifier,
    'SGDC': SGDClassifier,
}

def scoring_function(model_name, hyperparameter_values):
    model_class = models[model_name]
    model_instance = model_class(**hyperparameter_values)
    scores = cross_val_score(
        estimator=model_instance,
        X=dataset.data,
        y=dataset.target,
        scoring=make_scorer(f1_score, average='macro')
    )
    return scores.mean()

Define the tunable hyperparameters

The second step is to define the hyperparameters that we want to tune for each model as Tunables.

from btb.tuning import Tunable
from btb.tuning import hyperparams as hp

tunables = {
    'DTC': Tunable({
        'max_depth': hp.IntHyperParam(min=3, max=200),
        'min_samples_split': hp.FloatHyperParam(min=0.01, max=1)
    }),
    'SGDC': Tunable({
        'max_iter': hp.IntHyperParam(min=1, max=5000, default=1000),
        'tol': hp.FloatHyperParam(min=1e-3, max=1, default=1e-3),
    })
}

Start the searching process

Once you have defined a scoring function and the tunable hyperparameters specification of your models, you can start the searching for the best model and hyperparameter configuration by using the btb.BTBSession.

All you need to do is create an instance passing the tunable hyperparameters scpecification and the scoring function.

from btb import BTBSession

session = BTBSession(
    tunables=tunables,
    scorer=scoring_function
)

And then call the run method indicating how many tunable iterations you want the BTBSession to perform:

best_proposal = session.run(20)

The result will be a dictionary indicating the name of the best model that could be found and the hyperparameter configuration that was used:

{
    'id': '826aedc2eff31635444e8104f0f3da43',
    'name': 'DTC',
    'config': {
        'max_depth': 21,
        'min_samples_split': 0.044010284821858835
    },
    'score': 0.907229308339589
}

How does BTB perform?

We have a comprehensive benchmarking framework that we use to evaluate the performance of our Tuners. For every release, we perform benchmarking against 100's of challenges, comparing tuners against each other in terms of number of wins. We present the latest leaderboard from latest release below:

Number of Wins on latest Version

tuner	with ties	without ties
`Ax.optimize`	220	32
`BTB.GCPEiTuner`	139	2
`BTB.GCPTuner`	252	90
`BTB.GPEiTuner`	208	16
`BTB.GPTuner`	213	24
`BTB.UniformTuner`	177	1
`HyperOpt.tpe`	186	6
`SMAC.HB4AC`	180	4
`SMAC.SMAC4HPO_EI`	220	31
`SMAC.SMAC4HPO_LCB`	205	16
`SMAC.SMAC4HPO_PI`	221	35

Detailed results from which this summary emerged are available here.
If you want to compare your own tuner, follow the steps in our benchmarking framework here.
If you have a proposal for tuner that we should include in our benchmarking get in touch with us at dailabmit@gmail.com.

Citing BTB

If you use BTB, please consider citing the following paper:

@article{smith2019mlbazaar,
  author = {Smith, Micah J. and Sala, Carles and Kanter, James Max and Veeramachaneni, Kalyan},
  title = {The Machine Learning Bazaar: Harnessing the ML Ecosystem for Effective System Development},
  journal = {arXiv e-prints},
  year = {2019},
  eid = {arXiv:1905.08942},
  pages = {arxiv:1904.09535},
  archivePrefix = {arXiv},
  eprint = {1905.08942},
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.0

Jul 28, 2023

0.4.0

Dec 30, 2020

0.3.13.dev0 pre-release

Nov 20, 2020

0.3.12

Sep 8, 2020

0.3.12.dev0 pre-release

Jul 27, 2020

0.3.11

Jun 12, 2020

0.3.10

May 29, 2020

0.3.9

May 18, 2020

0.3.9.dev0 pre-release

May 13, 2020

0.3.8

May 8, 2020

0.3.7

Apr 15, 2020

0.3.6

Mar 4, 2020

0.3.6.dev1 pre-release

Feb 25, 2020

0.3.6.dev0 pre-release

Feb 12, 2020

0.3.5

Jan 21, 2020

0.3.4

Dec 24, 2019

0.3.3

Dec 11, 2019

0.3.2

Dec 10, 2019

0.3.1

Nov 25, 2019

0.3.0

Nov 12, 2019

0.2.5

Mar 15, 2019

0.2.4

Jan 21, 2019

0.2.3

Nov 14, 2018

0.2.2

Oct 11, 2018

0.2.1

Jun 5, 2018

0.2.0

Jun 4, 2018

0.1.2

May 3, 2018

0.1.1

Apr 28, 2018

0.1.0

Apr 26, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

baytune-0.5.0.tar.gz (58.8 kB view details)

Uploaded Jul 28, 2023 Source

Built Distribution

baytune-0.5.0-py3-none-any.whl (75.2 kB view details)

Uploaded Jul 28, 2023 Python 3

File details

Details for the file baytune-0.5.0.tar.gz.

File metadata

Download URL: baytune-0.5.0.tar.gz
Upload date: Jul 28, 2023
Size: 58.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for baytune-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`b46e42ad3f18acc59746ed7db604c8ab0a8e2daae42588c5649ab1097717f075`
MD5	`62bc01be0f3f6fb0e0f810937bbb0da2`
BLAKE2b-256	`1f9262299cdae8539ebf877eb3b8b41295f0394d8ee0078af4c8c1d807d6e793`

See more details on using hashes here.

File details

Details for the file baytune-0.5.0-py3-none-any.whl.

File metadata

Download URL: baytune-0.5.0-py3-none-any.whl
Upload date: Jul 28, 2023
Size: 75.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for baytune-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fd226d739cbfb2086901345e58f807ab70cc4f8ed4ecd10578b4b7a470ded0b3`
MD5	`411039059449c927d6cf0fbe3809d834`
BLAKE2b-256	`2fae3cb956891a7a1dafeb85fce50d84ca9a7917d372e2dcec5dd3587d43f752`

See more details on using hashes here.

baytune 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Overview

Try it out now!

Install

Requirements

Install with pip

Quickstart

Prepare a scoring function

Define the tunable hyperparameters

Start the searching process

How does BTB perform?

Number of Wins on latest Version

More tutorials

Citing BTB

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes