Library to compute Gittins and Whittle index for Markovian Bandits
Project description
Markovian Bandits
This repository contains a python library to compute whittle indices or test indexability of finite-state Markovian bandit problems.
Installation
To install, just run:
pip install markovianbandit-pkg
Example
To test with a few randomly generated examples:
import markovianbandit as bandit
model = bandit.random_restless(dim=4, seed=42)
print(model.whittle_indices()) # should print array([ 0.87536099, -0.08765819, -0.15279431, -0.51905682])
print(model.get_P0P1R0R1())
model = bandit.random_restless(4, seed=2791)
print(model.is_indexable()) # should print False
model = bandit.random_rested(dim=4)
print(model.gittins_indices(discount=.8)) # computes gittins index
To compute the Whittle or Gittins index from P0 and P1:
P0 = [[.5, .5], [.25, .75]]
P1 = [[1, 0], [0.5, 0.5]]
R0 = [0.5, 0.5]
R1 = [2, 1]
model = bandit.restless_bandit_from_P0P1_R0R1(P0, P1, R0, R1)
print(model.whittle_indices())
model = bandit.rested_bandit_from_P1_R1(P1, R1)
print(model.gittins_indices(discount=0.5))
Note that the algorithm complexity to compute Whittle index is $O(n^3)$. It can compute indices of bandits in less than one seconds to bandits of up to $1000$ states (i.e. $1000\times 1000$ matrices). The loop below should take less than one second.
import time
for dim in [10, 100, 1000]:
ts = time.time()
model = bandit.random_restless(dim=dim, seed=42)
model.whittle_indices()
print("Model of size", dim, "computed in", time.time() - ts, "seconds")
Reference
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for markovianbandit_pkg-0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 82a6dea3560271f1585a9774b54f9c9897d4291fc22e03a0da9224b605d09e14 |
|
MD5 | c2a81b2b950eb83a473c883f23aea87d |
|
BLAKE2b-256 | b54435b0006be09533c27c22a2d1a435dc827be4e59d395b5e2ecbd858fb95a7 |