A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies
Project description
epsilon-policies
A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies.
Features
- Exploration – Select random actions to discover new possibilities.
- Exploitation – Choose the best-known action based on current estimates.
- Fixed Exploration–Then–Exploitation – Explore for a fixed period, then fully exploit.
- ε-Greedy – Balance exploration and exploitation with a probability parameter.
- ε-Greedy with UCB – Enhance ε-greedy with Upper Confidence Bound for better action selection.
Installation
pip install decisionbandit
import decisionbandit as dcb
# Example: ε-greedy
reward_prob=[0.2,0.3,0.4]
action = dcb.epsilon_greedy(reward_prob,epsilon=0.1, n_arms=3,t_steps=100)
print("Selected action:", action)
MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
decisionbandit-0.1.1.tar.gz
(2.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file decisionbandit-0.1.1.tar.gz.
File metadata
- Download URL: decisionbandit-0.1.1.tar.gz
- Upload date:
- Size: 2.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee735ee76a5918619a2fa25fdbe0dc9631045791e10d7b9386bdd61f9c223cd5
|
|
| MD5 |
98edaf507479c629a3f6cd0f5b1f1201
|
|
| BLAKE2b-256 |
de7d704eb82f2c3302b5a5bea88c23473fe6d121a27e24171eab755cbd9a5243
|
File details
Details for the file decisionbandit-0.1.1-py3-none-any.whl.
File metadata
- Download URL: decisionbandit-0.1.1-py3-none-any.whl
- Upload date:
- Size: 3.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b573f344af730817aa0bd727e28f8b7420a1b0038b920cf253889e05c364d4cb
|
|
| MD5 |
997ad3f3a6883e3d5c049e88bbf61a7a
|
|
| BLAKE2b-256 |
ebb3929363fd5bf6484626658f378e0be3fc3c3e37d76c7635adfa8308a079cb
|