Skip to main content

Implementation of Exponential weighting for Exploration and Exploitation with Experts.

Project description

EXP4

A python implementation of Exponential weighting for Exploration and Exploitation with Experts. Based on this blog post.

This algorithm is useful for non-stochastic Contextual Multi Armed Bandits.

Build Status PyPI version License: MIT

Table of Contents

Installation

If you just need to use exp4, you can just run:

$ pip install exp4

For developers, note that this project uses the poetry python package/dependency management tool. Please familarize yourself with it and then run:

$ poetry install

Usage

exp4 is centered around the exp4.exp4 function which creates a co-routine for selecting arms given expert advice.

The protocol is as follows:

  1. The expert constructs an expert advice matrix.
    • Each row contains the corresponding experts advice vector.
    • The advice vector provides probabilities for each arm.
  2. The expert sends a tuple of loss and advice.
    • The loss corresponds to the previous round.
    • The first round's loss is ignored.
    • The advice correspond to the current round.

An example is given below.

player = exp4()

loss = None           # Will be ignored.
advice = [
    [1/3, 1/3, 1/3],  # Expert 1 
    [2/3, 1/3, 0],    # Expert 2
]
arm = player.send((loss, advice))
assert arm in range(3)

loss = 1 / (1 + arm)  # Arbitrary loss assigned to arm.
advice = [
    [0, 0, 1],        # Expert 1
    [0, 0, 1],        # Expert 2
]
arm = player.send((loss, advice))
assert arm == 2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

exp4-0.1.3.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

exp4-0.1.3-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file exp4-0.1.3.tar.gz.

File metadata

  • Download URL: exp4-0.1.3.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.7 CPython/3.9.6 Linux/5.13.5-1-MANJARO

File hashes

Hashes for exp4-0.1.3.tar.gz
Algorithm Hash digest
SHA256 9f551f6e3abe8adc704428ae068118b93992a1de959e5cae1c05e84df6a1f118
MD5 d4432ad8751e209ea79cf27504a23899
BLAKE2b-256 d9595aabb9df2b4183981753465c7462689dbc5100c82b1f9f2716e98342f3eb

See more details on using hashes here.

File details

Details for the file exp4-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: exp4-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.7 CPython/3.9.6 Linux/5.13.5-1-MANJARO

File hashes

Hashes for exp4-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 f816b425199b34d220a5fd2ca5face2350bb391fa88268d1073cd1372b3e5957
MD5 60f06c38c6fea3d54e423d038ba0ace5
BLAKE2b-256 0bcc82d044b5adbd1f31b5ffa823936419d993ca51f191b96a0166d01eb7fe94

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page