mabalgs

Multi-armed bandit algorithms

Project description

# Multi-Armed Bandit Algorithms
Multi-Armed Bandit (MAB) is a problem in which a fixed limited set of resources must be allocated between competing (alternative) choices in a way that maximizes their expected gain, when each choice's properties are only partially known at the time of allocation, and may become better understood as time passes or by allocating resources to the choice.

In the problem, each machine provides a random reward from a probability distribution specific to that machine. The objective of the gambler is to maximize the sum of rewards earned through a sequence of lever pulls. The crucial tradeoff the gambler faces at each trial is between "exploitation" of the machine that has the highest expected payoff and "exploration" to get more information about the expected payoffs of the other machines. The trade-off between exploration and exploitation is also faced in machine learning.

The main problems that the MAB help to solve is the split of the population in online experiments.

## Installing
```
pip install mabalgs
```

## Algorithms (Bandit strategies)

### UCB1 (Upper Confidence Bound)
Is an algorithm for the multi-armed bandit that achieves regret that grows only logarithmically with the number of actions taken, with no prior knowledge of the reward distribution required.

#### Get a selected arm
```python
from mab import algs

ucb_with_two_arms = algs.UCB1(2)
ucb_with_two_arms.select()
```

#### Reward an arm
```python
from mab import algs

ucb_with_two_arms = algs.UCB1(2)
my_arm = ucb_with_two_arms.select()
ucb_with_two_arms.reward(my_arm)
```
----------------

## References
- [Wikipedia MAB](https://en.wikipedia.org/wiki/Multi-armed_bandit)
- [A Survey of Online Experiment Design
with the Stochastic Multi-Armed Bandit](https://arxiv.org/pdf/1510.00757.pdf)
- [Finite-time Analysis of the Multiarmed Bandit Problem](https://link.springer.com/article/10.1023%2FA%3A1013689704352?LI=true)

Project details

Release history Release notifications | RSS feed

0.6.8

Jun 30, 2021

0.6.7

Jun 30, 2021

0.6.6

May 10, 2021

0.6.5

May 4, 2021

0.6.4

Jul 23, 2019

0.6.3

Jul 23, 2019

0.6.2

Jul 23, 2019

0.6.1

Jul 23, 2019

0.6.0

Jul 23, 2019

0.5.9

Feb 25, 2019

0.5.8

Feb 25, 2019

0.5.7

Feb 1, 2019

0.5.6

Jan 31, 2019

0.5.5

Jan 31, 2019

0.5.4

Jan 31, 2019

0.5.3

Jan 31, 2019

0.5.2

Jan 28, 2019

0.5.1

Jan 28, 2019

0.5.0

Jan 28, 2019

0.4.9

Jan 28, 2019

0.4.8

Jan 28, 2019

0.4.7

Jan 26, 2019

0.4.6

Jan 26, 2019

0.4.5

Jan 25, 2019

0.4.4

Jan 25, 2019

0.4.3

Jan 25, 2019

This version

0.4.2

Jan 25, 2019

0.4.1

Jan 25, 2019

0.4

Jan 25, 2019

0.3

Jan 24, 2019

0.2

Jan 24, 2019

0.1.1

Jan 24, 2019

0.1

Jan 24, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mabalgs-0.4.2.tar.gz (3.0 kB view details)

Uploaded Jan 25, 2019 Source

File details

Details for the file mabalgs-0.4.2.tar.gz.

File metadata

Download URL: mabalgs-0.4.2.tar.gz
Upload date: Jan 25, 2019
Size: 3.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: Python-urllib/3.6

File hashes

Hashes for mabalgs-0.4.2.tar.gz
Algorithm	Hash digest
SHA256	`a392d6226810e1c92891ac9aceac693cae977a96f8dee48a615d1b3874e6d38e`
MD5	`aedbd20a4ce97fd5612e9e9d60666480`
BLAKE2b-256	`8936f2dc73d0a4f8688b7541629907fe867ff905d36e7d4b4709c02deb20ed21`