Implementation of Exponential weighting for Exploration and Exploitation with Experts.
Project description
EXP4
A python implementation of Exponential weighting for Exploration and Exploitation with Experts. Based on this blog post.
This algorithm is useful for non-stochastic Contextual Multi Armed Bandits.
Table of Contents
Installation
If you just need to use exp4
, you can just run:
$ pip install exp4
For developers, note that this project uses the poetry python package/dependency management tool. Please familarize yourself with it and then run:
$ poetry install
Usage
exp4
is centered around the exp4.exp4
function which creates a
co-routine for selecting arms given expert advice.
The protocol is as follows:
- The expert constructs an expert advice matrix.
- Each row contains the corresponding experts advice vector.
- The advice vector provides probabilities for each arm.
- The expert sends a tuple of loss and advice.
- The loss corresponds to the previous round.
- The first round's loss is ignored.
- The advice correspond to the current round.
An example is given below.
from exp4 import exp4
player = exp4()
loss = None # Will be ignored.
advice = [
[1/3, 1/3, 1/3], # Expert 1
[2/3, 1/3, 0], # Expert 2
]
arm = player.send((loss, advice))
assert arm in range(3)
loss = 1 / (1 + arm) # Arbitrary loss assigned to arm.
advice = [
[0, 0, 1], # Expert 1
[0, 0, 1], # Expert 2
]
arm = player.send((loss, advice))
assert arm == 2
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file exp4-0.1.4.tar.gz
.
File metadata
- Download URL: exp4-0.1.4.tar.gz
- Upload date:
- Size: 3.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.7 CPython/3.9.6 Linux/5.13.5-1-MANJARO
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2c8e4f72d30cc6ef0324eb1c1d49963b20cb0523a1b17b60f49f1b23281ae480 |
|
MD5 | 6bd5c18b88cb7ae0dcd01c216d5b8e35 |
|
BLAKE2b-256 | 0a1fac89e395303e9248cf8b8f2650ef76a2e36ba939957df3069898f4793743 |
File details
Details for the file exp4-0.1.4-py3-none-any.whl
.
File metadata
- Download URL: exp4-0.1.4-py3-none-any.whl
- Upload date:
- Size: 4.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.7 CPython/3.9.6 Linux/5.13.5-1-MANJARO
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ef11848f628e6c2b7b612680bc3cfee86556a726a52d75bbc751047cf55783c1 |
|
MD5 | e48dd5756e571515f73bd1f707b0d106 |
|
BLAKE2b-256 | 5f0b2906d603beb90d3739db0a09d872f502e514cc53324fc2547a823188ea7b |