DM-Gym: A set of environments for developing reinforcement learning agents for Data Mining problems.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
Programming Language

Project description

DM-Gym

Data Mining Gym Environment for Reinforcement Learning

Installation

You can download the git repository directly and keep the dm_gym folder inside your project folder.

You could also use the following steps to install DM-Gym in your system to be accessible anywhere:

git clone https://github.com/ashwin-M-D/DM-Gym.git
cd DM-Gym
pip install -e

The package is also in the pypi repository so it can be installed using pip.

pip install dm-gym

Testing

To test the environment using the test codes provided, you need to have ray installed. Please use the conda environment file provided to setup your environment. Then, install DM-Gym as mentioned above and proceed with running the python notebooks provided. All of this can be done as follows.

## Installing DM-Gym
git clone https://github.com/ashwin-M-D/DM-Gym.git
cd DM-Gym
pip install -e

## Creating the conda environment
cd testing
cd conda_envs
conda env create -f dmgym_environment.yml

## Activate conda environment and cd to the folder containing the experiment files.
conda activate myenv_dmgym_testing
cd ..
cd experiments

Available Environments

Clustering:

All these environments involve records which arrive in a random order and they are classified into one of k clusters. The value of k is predefined similar to k-means clustering.

Basically the input / state space is a single record from the dataset and the output is a discreet variable which is an integer between 0 and k-1, each specifying a specific cluster.
- clustering-v0: Reward function is negative of log(db-index)
  
  This is a poor performing environment.
- clustering-v1: Reward function is based on both the distance and also the db-index.
  
  This performs better than clustering-v0. However, it is suggested to use one of the other 2 clustering environments.
- clustering-v2: Uses a different reward system which is either p-1 or p at each step. Based on the paper "A Reinforcement Learning Approach to Online Clustering" [1]. Please use a low gamma value with this environment for optimal results.
- clustering-v3: This has the best performance among all the clustering environments. It converts the problem into a classification problem internally. However, to showcase true capabilities of RL, this should not be used. Use a low gamma value with this environment.
Classification:

Classification is done by reading a single record at a time and checking the output of your RL agent against the class it belongs to.
- classification-v0: This has very good performance and the reward function is defined as 1 if the output of the agent and the class it actually belongs to match and -1 if they don't match. It is again recommended to use a low gamma value for this environment.

Environments planned for the future

Linear Regression environments.
More Classification environments.

Notes:

See Testing folder to see examples of each of the environments and their outputs
Documentation for all available functions is available in the documentation folder. This folder will be updated regularly to make sure there are no ambiguity in the usage of the environments

References

Likas, A., 1999. A reinforcement learning approach to online clustering. Neural computation, 11(8), pp.1915-1932. PDF
Hubbs, C.D., Perez, H.D., Sarwar, O., Sahinidis, N.V., Grossmann, I.E. and Wassick, J.M., 2020. OR-Gym: A Reinforcement Learning Library for Operations Research Problems. arXiv preprint arXiv:2008.06319. PDF GitHub

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 4 - Beta
Intended Audience
- Developers
Programming Language

Release history Release notifications | RSS feed

This version

0.1.6b0 pre-release

Oct 19, 2021

0.1.5b2 pre-release

Oct 18, 2021

0.1.5b0 pre-release

Oct 17, 2021

0.1.4b0 pre-release

Oct 15, 2021

0.1.3b0 pre-release

Oct 15, 2021

0.1.2.2b0 pre-release

Oct 15, 2021

0.1.2.1b0 pre-release

Oct 15, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dm-gym-0.1.6b0.tar.gz (12.8 kB view hashes)

Uploaded Oct 19, 2021 Source

Hashes for dm-gym-0.1.6b0.tar.gz

Hashes for dm-gym-0.1.6b0.tar.gz
Algorithm	Hash digest
SHA256	`708d3f4cc05e52d2d769eaf9da3af26967b673eaeb2076d125dce2455b907e7b`
MD5	`d4b1f43a0b801dc477d2c5e476599217`
BLAKE2b-256	`e06fbbcc8729951ced416c449d4955a97a8d54f23dcbc567c8f1fae56e49de1a`