Library to compute Gittins and Whittle index for Markovian Bandits
Project description
Markovian Bandits
This repository contains a python library to compute whittle or indices for finite-state Markovian bandit problems.
Installation
To install, just run:
pip install markovianbandit-pkg
Example
To test with a few randomly generated examples:
import markovianbandit as bandit
model = bandit.random_restless(dim=4, seed=42)
print(model.whittle_indices()) # should print array([ 0.87536099, -0.08765819, -0.15279431, -0.51905682])
print(model.get_P0P1R0R1())
model = bandit.random_restless(4, seed=2791)
print(model.is_indexable()) # should print False
model = bandit.random_rested(dim=4)
print(model.gittins_indices(discount=.8)) # computes gittins index
To compute the Whittle or Gittins index from P0 and P1:
P0 = [[.5, .5], [.25, .75]]
P1 = [[1, 0], [0.5, 0.5]]
R0 = [0.5, 0.5]
R1 = [2, 1]
model = bandit.restless_bandit_from_P0P1_R0R1(P0, P1, R0, R1)
print(model.whittle_indices())
model = bandit.rested_bandit_from_P1_R1(P1, R1)
print(model.gittins_indices(discount=0.5))
Note that the algorithm complexity to compute Whittle index is $O(n^3)$. It can compute indices of bandits in less than one seconds to bandits of up to $1000$ states (i.e. $1000\times 1000$ matrices). The loop below should take less than one second.
import time
for dim in [10, 100, 1000]:
ts = time.time()
model = bandit.random_restless(dim=dim, seed=42)
model.whittle_indices()
print("Model of size", dim, "computed in", time.time() - ts, "seconds")
Reference
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for markovianbandit-pkg-0.1.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0ea7473285092f5615ef0dcaaa99129ce67dca974d273ccffc4caa4455920d5e |
|
MD5 | 8aec4fa0e89c07db9049177572f5e22a |
|
BLAKE2b-256 | a5f519478d4ebfe2290b6c3058809f01cb04c5d8fe661358b9d4bcb04195c061 |
Close
Hashes for markovianbandit_pkg-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 19a0853c664ca046ace135af06ad4d10eb2db285b02e0a09fbb8c4c6212ecabd |
|
MD5 | 6cf7440aac7d8a16f8bddf89eb2955f2 |
|
BLAKE2b-256 | db635f57e17fc1f6249b2700fe6b1129eaf9f017686589266d3357176546423b |