Skip to main content

fastrl is a reinforcement learning library that extends Fastai. This project is not affiliated with fastai or Jeremy Howard.

Project description

fastrl

This is a temporary location for fastrl version 2. Currently in a giant refactor. The previous source code can be found here.

CI Status pypi fastrl version Conda fastrl version Docker Image Latest Docker Image-Dev Latest

Anaconda-Server Badge fastrl python compatibility fastrl license

{% include warning.html content='Even before fastrl==2.0.0, all Models should converge reasonably fast, however HRL models DADS and DIAYN will need ' %}re-balancing and some extra features that the respective authors used.

Overview

Here is change

Fastai for computer vision and tabular learning has been amazing. One would wish that this would be the same for RL. The purpose of this repo is to have a framework that is as easy as possible to start, but also designed for testing new agents.

Documentation is being served at https://josiahls.github.io/fastrl/ from documentation directly generated via nbdev in this repo.

Current Issues of Interest

Data Issues

  • data and async_data are still buggy. We need to verify that the order that the data being returned is the best it can be for our models. We need to make sure that "dones" are returned and that there are new duplicate (unless intended)
  • Better data debugging. Do environments skips steps correctly? Do n_steps work correct?

Whats new?

As we have learned how to support as many RL agents as possible, we found that fastrl==1.* was vastly limited in the models that it can support. fastrl==2.* will leverage the nbdev library for better documentation and more relevant testing. We also will be building on the work of the ptan1 library as a close reference for pytorch based reinforcement learning APIs.

1 "Shmuma/Ptan". Github, 2020, https://github.com/Shmuma/ptan. Accessed 13 June 2020.

Install

PyPI (Not implemented yet)

Placeholder here, there is no pypi package yet. It is recommended to do traditional forking.

(For future, currently there is no pypi persion)pip install fastrl==2.0.0 --pre

Conda (Not implimented yet)

conda install -c josiahls fastrl

source activate fastrl && python setup.py develop

Docker (highly recommend)

Install: Nvidia-Docker

Install: docker-compose

docker-compose pull && docker-compose up

Contributing

After you clone this repository, please run nbdev_install_git_hooks in your terminal. This sets up git hooks, which clean up the notebooks to remove the extraneous stuff stored in the notebooks (e.g. which cells you ran) which causes unnecessary merge conflicts.

Before submitting a PR, check that the local library and notebooks match. The script nbdev_diff_nbs can let you know if there is a difference between the local library and the notebooks.

  • If you made a change to the notebooks in one of the exported cells, you can export it to the library with nbdev_build_lib or make fastai2.
  • If you made a change to the library, you can export it back to the notebooks with nbdev_update_lib.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastrl-0.0.9.tar.gz (16.0 kB view details)

Uploaded Source

Built Distribution

fastrl-0.0.9-py3-none-any.whl (14.1 kB view details)

Uploaded Python 3

File details

Details for the file fastrl-0.0.9.tar.gz.

File metadata

  • Download URL: fastrl-0.0.9.tar.gz
  • Upload date:
  • Size: 16.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.56.2 CPython/3.9.1

File hashes

Hashes for fastrl-0.0.9.tar.gz
Algorithm Hash digest
SHA256 d87ff9e5e7d4a670c1b8a2f3f8ac67112165d1adb8e76c4efba9772448826d8d
MD5 c844ad1c7e211ec9b5fb41706339b1aa
BLAKE2b-256 0a2803709d0a55c5e171cfb8386ebfb2785d88087f408ab6585eda0d234e76d3

See more details on using hashes here.

File details

Details for the file fastrl-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: fastrl-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 14.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.56.2 CPython/3.9.1

File hashes

Hashes for fastrl-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 1e660ba2014a648ffe8205273f56c83d918db5ac58fc7311bf5da3fb2fbc0aae
MD5 7026f2f339d42f3294b2db1ec16db103
BLAKE2b-256 6ab1d54d54e0980cc3b53004eeb100d0b089260ac36728046c8280649329db56

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page