Thompson Sampling

Project description

thompson-sampling

Thompson Sampling Multi-Armed Bandit for Python

This project is an implementation of a Thompson Sampling approach to a Multi-Armed Bandit. The goal of this project is to easily create and maintain Thompson Sampling experiments.

Currently this project supports experiments where the response follows a Bernoulli or Poisson distribution. Further work will be done to allow for experiments that follow other distributions, with recommendations/collaboration welcome.

Usage

Setting up the experiment:

The following method will instantiate the experiment with default priors.

from thompson_sampling.bernoulli import BernoulliExperiment

experiment = BernoulliExperiment(arms=2)

If you want set your own priors using the Priors module:

from thompson_sampling.bernoulli import BernoulliExperiment
from thompson_sampling.priors import BetaPrior

pr = BetaPrior()
pr.add_one(mean=0.5, variance=0.2, effective_size=10, label="option1")
pr.add_one(mean=0.6, variance=0.3, effective_size=30, label="option2")
experiment = BernoulliExperiment(priors=pr)

Getting an action:

Randomly chooses which arm to "pull" in the multi-armed bandit:

experiment.choose_arm()

Updating reward:

Updating the information about the different arms by adding reward information:

rewards = [{"label":"option1", "reward":1}, {"label":"option2", "reward":0}]
experiment.add_rewards(rewards)

Installation

Pip

pip install thompson-sampling

Project details

Release history Release notifications | RSS feed

0.0.4

Feb 4, 2020

This version

0.0.3

May 15, 2019

0.0.1

Feb 7, 2019

0.0.0

Feb 1, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thompson-sampling-0.0.3.tar.gz (4.3 kB view hashes)

Uploaded May 15, 2019 Source

Built Distribution

thompson_sampling-0.0.3-py3-none-any.whl (8.0 kB view hashes)

Uploaded May 15, 2019 Python 3

Hashes for thompson-sampling-0.0.3.tar.gz

Hashes for thompson-sampling-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`dc47772a3864492297416160a230d309a17f256c163cd9b8a3dec9a1b33a886a`
MD5	`f5f95e121db2c70d210b256bacf40875`
BLAKE2b-256	`8886b525f9fcf479a040c6408f97b1bb79107981d5cdf0365b1531d5e99e6ab0`

Hashes for thompson_sampling-0.0.3-py3-none-any.whl

Hashes for thompson_sampling-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9178b05045473a5d853adf239ae9312bf758469990639ff4ac17f9fc76f2d221`
MD5	`49e12114d11fb7b06b462b1c44f55f4e`
BLAKE2b-256	`fb03ade6cf35bdc63047943ea8d3e63d62366c83988714cd11c1deeacfb4c003`