Skip to main content

Value-Function-Based Reinforcement-Learning Library for Education and Research: Python3 Fork

Project description

RLPy3

Azure Pipelines PyPI

Why Fork?

This is my fork of RLPy.

RLPy is a great reinforcement learning library with some good features like custom representations and value function viewers. However, it is no longer maintained and the PyPI version of RLPy is not compatible with the latest numpy C-API, which causes runtime errors.

So I decided to fork the repository and provide a maintained version of RLPy only for Python>=3.6, named RLPy3. You can download wheels of RLPy3 from PyPI.

In addition, I pushed lots of refactorings, bug fixes, new environments such as FixedRewardGridWorld, and new algorithms such as PSRL. I also removed some modules (like parallel-job utilities) to keep this library maintainable by sufficiently low efforts.

To see what was changed, see the new CHANGELOG.

Install

The latest version is 2.0.0b1, which you can install by:

pip3 install rlpy3 -U --pre

Screenshots

pictures/GridWorld4x5Domain.png

An agent that explores GridWorld.

pictures/GridWorld4x5Value.png

Value function of the agent.

Original README: RLPy - Reinforment Learning Framework

RLPy is a framework to conduct sequential decision making experiments. The current focus of this project lies on value-function-based reinforcement learning. The project is distributed under the 3-Clause BSD License.

Install

Installation instructions can be found at http://rlpy.readthedocs.org/en/latest/install.html

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Built Distributions

rlpy3-2.0.0-cp38-cp38-win_amd64.whl (702.7 kB view hashes)

Uploaded cp38

rlpy3-2.0.0-cp37-cp37m-win_amd64.whl (700.5 kB view hashes)

Uploaded cp37

rlpy3-2.0.0-cp36-cp36m-win_amd64.whl (700.2 kB view hashes)

Uploaded cp36

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page