Dueling Bandit Algorithms in Python
Project description
duelpy
This is a python package for solving with Preference Based Multi Armed Bandit problems, also known as dueling bandits. Refer to this paper for an overview of the field.
You can compare the implemented algorithms in an experiment by running
python3 -m duelpy.experiments.cli
The experiments are still rather limited. The command-line interface can
currently only run regret-based comparisons in a limited set of configurations.
Pass the --help
flag for more information.
See the documentation for more information about the implemented algorithms.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
duelpy-0.1.0.tar.gz
(85.1 kB
view hashes)
Built Distribution
duelpy-0.1.0-py3-none-any.whl
(118.3 kB
view hashes)