Skip to main content

A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies

Project description

epsilon-policies

A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies.

Features

  • Exploration – Select random actions to discover new possibilities.
  • Exploitation – Choose the best-known action based on current estimates.
  • Fixed Exploration–Then–Exploitation – Explore for a fixed period, then fully exploit.
  • ε-Greedy – Balance exploration and exploitation with a probability parameter.
  • ε-Greedy with UCB – Enhance ε-greedy with Upper Confidence Bound for better action selection.

Installation

pip install decisionbandit


import decisionbandit as dcb

# Example: ε-greedy
action = dcb.epsilon_greedy(q_values=[1.0, 0.5, 0.2], epsilon=0.1)
print("Selected action:", action)


MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

decisionbandit-0.1.0.tar.gz (2.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

decisionbandit-0.1.0-py3-none-any.whl (3.1 kB view details)

Uploaded Python 3

File details

Details for the file decisionbandit-0.1.0.tar.gz.

File metadata

  • Download URL: decisionbandit-0.1.0.tar.gz
  • Upload date:
  • Size: 2.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for decisionbandit-0.1.0.tar.gz
Algorithm Hash digest
SHA256 b52f2e267bca1fe3568db3228787003bad9650adbf67d0bcce349263e66ecfb0
MD5 0592fbceee2cc8846dc3ee23d557a0ad
BLAKE2b-256 78fcef458755560a8cf306bd1b4e3b1f5230793bc80094ebe137f8123c460dab

See more details on using hashes here.

File details

Details for the file decisionbandit-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: decisionbandit-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 3.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for decisionbandit-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 966f00d73b30dacb56fba51097a01d4d5ee186a9d53526e6a338f61f5187688f
MD5 39bc67d47f88849d4f6ac72ad3b90b85
BLAKE2b-256 3f8d4ab6dd1014da698cbd960a351954aaacdcd1eddad3ddb539417b8dd0a15d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page