fastrl is a reinforcement learning library that extends Fastai. This project is not affiliated with fastai or Jeremy Howard.

These details have not been verified by PyPI

Project links

Homepage

Project description

fastrl

Warning: This is in alpha, and so uses latest torch and torchdata, very importantly torchdata. The base API, while at the point of semi-stability, might be changed in future versions, and so there will be no promises of backward compatiblity. For the time being, it is best to hard-pin versions of the library.

Warning: Even before fastrl==2.0.0, all Models should converge reasonably fast, however HRL models DADS and DIAYN will need re-balancing and some extra features that the respective authors used.

Overview

Fastai for computer vision and tabular learning has been amazing. One would wish that this would be the same for RL. The purpose of this repo is to have a framework that is as easy as possible to start, but also designed for testing new agents.

This version fo fastrl is basically a wrapper around torchdata.

It is built around 4 pipeline concepts (half is from fastai):

DataLoading/DataBlock pipelines
Agent pipelines
Learner pipelines
Logger plugins

Documentation is being served at https://josiahls.github.io/fastrl/ from documentation directly generated via nbdev in this repo.

Basic DQN example:

from fastrl.loggers.core import *
from fastrl.loggers.vscode_visualizers import  *
from fastrl.agents.dqn.basic import *
from fastrl.agents.dqn.target import *
from fastrl.data.block import *
from fastrl.envs.gym import *
import torch

# Setup Loggers
logger_base = ProgressBarLogger(epoch_on_pipe=EpocherCollector,
                 batch_on_pipe=BatchCollector)

# Setup up the core NN
torch.manual_seed(0)
model = DQN(4,2)
# Setup the Agent
agent = DQNAgent(model,[logger_base],max_steps=10000)
# Setup the DataBlock
block = DataBlock(
    GymTransformBlock(agent=agent,nsteps=2,nskips=2,firstlast=True), # We basically merge 2 steps into 1 and skip. 
    (GymTransformBlock(agent=agent,nsteps=2,nskips=2,firstlast=True,n=100,include_images=True),VSCodeTransformBlock())
)
dls = L(block.dataloaders(['CartPole-v1']*1))
# Setup the Learner
learner = DQNLearner(model,dls,logger_bases=[logger_base],bs=128,max_sz=20_000,nsteps=2,lr=0.001,
                     batches=1000,
                    dp_augmentation_fns=[
                        # Plugin TargetDQN code
                        TargetModelUpdater.insert_dp(),
                        TargetModelQCalc.replace_dp()
                    ])
learner.fit(10)
#learner.validate()

Whats new?

As we have learned how to support as many RL agents as possible, we found that fastrl==1.* was vastly limited in the models that it can support. fastrl==2.* will leverage the nbdev library for better documentation and more relevant testing, and torchdata is the base lib. We also will be building on the work of the ptan¹ library as a close reference for pytorch based reinforcement learning APIs.

¹ “Shmuma/Ptan”. Github, 2020, https://github.com/Shmuma/ptan. Accessed 13 June 2020.

Install

PyPI

Below will install the alpha build of fastrl.

Cuda Install

pip install fastrl==0.0.* --pre --extra-index-url https://download.pytorch.org/whl/nightly/cu113

Cpu Install

pip install fastrl==0.0.* --pre --extra-index-url https://download.pytorch.org/whl/nightly/cpu

Docker (highly recommend)

Install: Nvidia-Docker

Install: docker-compose

docker-compose pull && docker-compose up

Contributing

After you clone this repository, please run nbdev_install_hooks in your terminal. This sets up git hooks, which clean up the notebooks to remove the extraneous stuff stored in the notebooks (e.g. which cells you ran) which causes unnecessary merge conflicts.

Before submitting a PR, check that the local library and notebooks match. The script nbdev_clean can let you know if there is a difference between the local library and the notebooks. * If you made a change to the notebooks in one of the exported cells, you can export it to the library with nbdev_build_lib or make fastai2. * If you made a change to the library, you can export it back to the notebooks with nbdev_update_lib.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.0.47

Oct 3, 2022

0.0.46

Sep 24, 2022

0.0.45

Aug 27, 2022

0.0.44

Aug 14, 2022

0.0.43

Feb 20, 2022

0.0.42

Feb 20, 2022

0.0.41

Feb 20, 2022

0.0.40

Sep 26, 2021

0.0.39

Sep 26, 2021

0.0.37

Sep 26, 2021

0.0.36

Jul 31, 2021

0.0.35

Jul 31, 2021

0.0.34

Jul 31, 2021

0.0.33

Jul 31, 2021

0.0.32

Jul 31, 2021

0.0.31

Jul 30, 2021

0.0.30

Jul 19, 2021

0.0.29

Jul 19, 2021

0.0.28

Jul 6, 2021

0.0.27

Jul 5, 2021

0.0.26

Jul 5, 2021

0.0.25

Jun 4, 2021

0.0.24

Jun 4, 2021

0.0.23

Jun 1, 2021

0.0.22

May 3, 2021

0.0.21

May 3, 2021

0.0.20

Apr 18, 2021

0.0.19

Apr 18, 2021

0.0.18

Apr 18, 2021

0.0.17

Apr 12, 2021

0.0.16

Apr 3, 2021

0.0.15

Mar 14, 2021

0.0.14

Mar 7, 2021

0.0.13

Mar 7, 2021

0.0.11

Feb 23, 2021

0.0.10

Feb 21, 2021

0.0.9

Feb 16, 2021

0.0.8

Feb 14, 2021

0.0.7

Feb 14, 2021

0.0.5

Feb 14, 2021

0.0.4

Feb 14, 2021

0.0.3

Feb 14, 2021

0.0.2

Feb 14, 2021

0.0.1

Feb 13, 2021

0.0.0

Dec 25, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastrl-0.0.47.tar.gz (80.1 kB view details)

Uploaded Oct 3, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fastrl-0.0.47-py3-none-any.whl (97.9 kB view details)

Uploaded Oct 3, 2022 Python 3

File details

Details for the file fastrl-0.0.47.tar.gz.

File metadata

Download URL: fastrl-0.0.47.tar.gz
Upload date: Oct 3, 2022
Size: 80.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.10.7

File hashes

Hashes for fastrl-0.0.47.tar.gz
Algorithm	Hash digest
SHA256	`df63b18ce5840eff0395d729c0f34cf70be3a79e51d3c4e34145e545345f7d45`
MD5	`eab196c0cecbb15260026cbcd61a7664`
BLAKE2b-256	`102e0984dba8ba770cf443b16ee9ffec1b07daf26e578d43bb3a9c0b73b7d943`

See more details on using hashes here.

File details

Details for the file fastrl-0.0.47-py3-none-any.whl.

File metadata

Download URL: fastrl-0.0.47-py3-none-any.whl
Upload date: Oct 3, 2022
Size: 97.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.10.7

File hashes

Hashes for fastrl-0.0.47-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9f99bddff435b328e0a5a736a8d381f89f4b2eb5402fec0578fcb520d6c76884`
MD5	`ce0567e3fda30f4e2fe62edfc17dcee3`
BLAKE2b-256	`ba0cf495faf5691692a045deb94db82715238a5fd65bf4663115ded292dc3411`

See more details on using hashes here.

fastrl 0.0.47

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

fastrl

Overview

Whats new?

Install

PyPI

Docker (highly recommend)

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes