Skip to main content

A framework for learning about and experimenting with reinforcement learning algorithms

Project description

RLFlow

https://img.shields.io/pypi/v/rlflow.svg https://img.shields.io/travis/tpbarron/rlflow.svg Documentation Status Updates

A framework for learning about and experimenting with reinforcement learning algorithms. It is built on top of TensorFlow and TFLearn and is interfaces with the OpenAI gym (universe should work, too). It aims to be as modular as possible so that new algorithms and ideas can easily be tested. I started it to gain a better understanding of core RL algorithms and maybe it can be useful for others as well.

Features

Algorithms (future algorithms italicized):

  • MDP algorithms

    • Value iteration
    • Policy iteration
  • Temporal Difference Learning

    • SARSA
    • Deep Q-Learning
    • Policy gradient Q-learning
  • Gradient algorithms

    • Vanilla policy gradient
    • Deterministic policy gradient
    • Natural policy gradient
  • Gradient-Free algorithms

    • Cross entropy method

Function approximators (defined by TFLearn model):

  • Linear
  • Neural network
  • RBF

Works with any OpenAI gym environment.

Future Enhancements

  • Improved TensorBoard logging
  • Improved model snapshotting to include exploration states, memories, etc.
  • Any suggestions?

Fixes

  • Errors / warnings on TensorFlow session save

License

History

0.1.0 (2016-12-15)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for rlflow, version 0.1.2
Filename, size File type Python version Upload date Hashes
Filename, size rlflow-0.1.2.tar.gz (12.5 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page