Skip to main content

A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies

Project description

epsilon-policies

A collection of exploration–exploitation strategies for reinforcement learning, including ε-greedy and related policies.

Features

  • Exploration – Select random actions to discover new possibilities.
  • Exploitation – Choose the best-known action based on current estimates.
  • Fixed Exploration–Then–Exploitation – Explore for a fixed period, then fully exploit.
  • ε-Greedy – Balance exploration and exploitation with a probability parameter.
  • ε-Greedy with UCB – Enhance ε-greedy with Upper Confidence Bound for better action selection.

Installation

pip install decisionbandit


import decisionbandit as dcb

# Example: ε-greedy

reward_prob=[0.2,0.3,0.4]
action = dcb.epsilon_greedy(reward_prob,epsilon=0.1, n_arms=3,t_steps=100)
print("Selected action:", action)


MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

decisionbandit-0.1.1.tar.gz (2.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

decisionbandit-0.1.1-py3-none-any.whl (3.1 kB view details)

Uploaded Python 3

File details

Details for the file decisionbandit-0.1.1.tar.gz.

File metadata

  • Download URL: decisionbandit-0.1.1.tar.gz
  • Upload date:
  • Size: 2.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for decisionbandit-0.1.1.tar.gz
Algorithm Hash digest
SHA256 ee735ee76a5918619a2fa25fdbe0dc9631045791e10d7b9386bdd61f9c223cd5
MD5 98edaf507479c629a3f6cd0f5b1f1201
BLAKE2b-256 de7d704eb82f2c3302b5a5bea88c23473fe6d121a27e24171eab755cbd9a5243

See more details on using hashes here.

File details

Details for the file decisionbandit-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: decisionbandit-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 3.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for decisionbandit-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b573f344af730817aa0bd727e28f8b7420a1b0038b920cf253889e05c364d4cb
MD5 997ad3f3a6883e3d5c049e88bbf61a7a
BLAKE2b-256 ebb3929363fd5bf6484626658f378e0be3fc3c3e37d76c7635adfa8308a079cb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page