A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies
Project description
epsilon-policies
A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies.
Features
- Exploration – Select random actions to discover new possibilities.
- Exploitation – Choose the best-known action based on current estimates.
- Fixed Exploration–Then–Exploitation – Explore for a fixed period, then fully exploit.
- ε-Greedy – Balance exploration and exploitation with a probability parameter.
- ε-Greedy with UCB – Enhance ε-greedy with Upper Confidence Bound for better action selection.
Installation
pip install decisionbandit
import decisionbandit as dcb
# Example: ε-greedy
action = dcb.epsilon_greedy(q_values=[1.0, 0.5, 0.2], epsilon=0.1)
print("Selected action:", action)
MIT License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
decisionbandit-0.1.0.tar.gz
(2.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file decisionbandit-0.1.0.tar.gz.
File metadata
- Download URL: decisionbandit-0.1.0.tar.gz
- Upload date:
- Size: 2.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b52f2e267bca1fe3568db3228787003bad9650adbf67d0bcce349263e66ecfb0
|
|
| MD5 |
0592fbceee2cc8846dc3ee23d557a0ad
|
|
| BLAKE2b-256 |
78fcef458755560a8cf306bd1b4e3b1f5230793bc80094ebe137f8123c460dab
|
File details
Details for the file decisionbandit-0.1.0-py3-none-any.whl.
File metadata
- Download URL: decisionbandit-0.1.0-py3-none-any.whl
- Upload date:
- Size: 3.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
966f00d73b30dacb56fba51097a01d4d5ee186a9d53526e6a338f61f5187688f
|
|
| MD5 |
39bc67d47f88849d4f6ac72ad3b90b85
|
|
| BLAKE2b-256 |
3f8d4ab6dd1014da698cbd960a351954aaacdcd1eddad3ddb539417b8dd0a15d
|