fastrl is a reinforcement learning library that extends Fastai. This project is not affiliated with fastai or Jeremy Howard.
Project description
fastrl
# hide
from nbdev.showdoc import *
from nbdev.imports import *
if not os.environ.get("IN_TEST", None):
assert IN_NOTEBOOK
assert not IN_COLAB
assert IN_IPYTHON
Warning: Even before fastrl==2.0.0, all Models should converge reasonably fast, however HRL models
DADS
andDIAYN
will need re-balancing and some extra features that the respective authors used.
Overview
Here is change
Fastai for computer vision and tabular learning has been amazing. One would wish that this would be the same for RL. The purpose of this repo is to have a framework that is as easy as possible to start, but also designed for testing new agents.
Documentation is being served at https://josiahls.github.io/fastrl/ from
documentation directly generated via nbdev
in this repo.
Current Issues of Interest
Data Issues
- data and async_data are still buggy. We need to verify that the order that the data being returned is the best it can be for our models. We need to make sure that “terminateds” are returned and that there are new duplicate (unless intended)
- Better data debugging. Do environments skips steps correctly? Do n_steps work correct?
Whats new?
As we have learned how to support as many RL agents as possible, we
found that fastrl==1.*
was vastly limited in the models that it can
support. fastrl==2.*
will leverage the nbdev
library for better
documentation and more relevant testing. We also will be building on the
work of the ptan
1 library as a close reference for pytorch
based reinforcement learning APIs.
1 “Shmuma/Ptan”. Github, 2020, https://github.com/Shmuma/ptan. Accessed 13 June 2020.
Install
PyPI (Not implemented yet)
Placeholder here, there is no pypi package yet. It is recommended to do traditional forking.
(For future, currently there is no pypi
persion)pip install fastrl==2.0.0 --pre
Conda (Not implimented yet)
conda install -c fastchan -c josiahls fastrl
source activate fastrl && python setup.py develop
Docker (highly recommend)
Install: Nvidia-Docker
Install: docker-compose
docker-compose pull && docker-compose up
Contributing
After you clone this repository, please run nbdev_install_git_hooks
in
your terminal. This sets up git hooks, which clean up the notebooks to
remove the extraneous stuff stored in the notebooks (e.g. which cells
you ran) which causes unnecessary merge conflicts.
Before submitting a PR, check that the local library and notebooks
match. The script nbdev_diff_nbs
can let you know if there is a
difference between the local library and the notebooks. * If you made a
change to the notebooks in one of the exported cells, you can export it
to the library with nbdev_build_lib
or make fastai2
. * If you made
a change to the library, you can export it back to the notebooks with
nbdev_update_lib
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file fastrl-0.0.45.tar.gz
.
File metadata
- Download URL: fastrl-0.0.45.tar.gz
- Upload date:
- Size: 49.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 285046111100827bc7d199789bc495a89478cd89e28a087a76395fbf8d5beb61 |
|
MD5 | 3b5e01918ce86eede77ffb48de68359d |
|
BLAKE2b-256 | 8c7cd3d35d2c07afa9dc719f18adda1bb01a224ddf0906ff50b2d13bd996dcaa |
File details
Details for the file fastrl-0.0.45-py3-none-any.whl
.
File metadata
- Download URL: fastrl-0.0.45-py3-none-any.whl
- Upload date:
- Size: 62.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8f9c40f17c8b81358b3fa0cd8502ac732025b2f73d2c2f94aa93987b9c54f14f |
|
MD5 | 1b25518dc33a7bee64cebcfaf6d2af5d |
|
BLAKE2b-256 | 65461bdf0738018954d7268ff09f3b7d2fe0af0d8dd863c8de71b6cf2c3b64c6 |