Algorithm and utilities for deep reinforcement learning
Project description
Rainy
Reinforcement learning utilities and algrithm implementations using PyTorch.
API documentation
COMING SOON
Supported python version
Python >= 3.6.1
Implementation Status
| Algorithm | Multi Worker(Sync) | Recurrent | Discrete Action | Continuous Action | MPI support |
|---|---|---|---|---|---|
| DQN/Double DQN | :x: | :x: | :heavy_check_mark: | :x: | :x: |
| BootDQN/RPF | :x: | :x: | :heavy_check_mark: | :x: | :x: |
| DDPG | :x: | :x: | :x: | :heavy_check_mark: | :x: |
| TD3 | :x: | :x: | :x: | :heavy_check_mark: | :x: |
| SAC | :x: | :x: | :x: | :heavy_check_mark: | :x: |
| PPO | :heavy_check_mark: | :heavy_check_mark:(1) | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: |
| A2C | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :x: |
| ACKTR | :heavy_check_mark: | :x:(2) | :heavy_check_mark: | :heavy_check_mark: | :x: |
| AOC | :heavy_check_mark: | :x: | :heavy_check_mark: | :heavy_check_mark: | :x: |
(1): It's very unstable
(2): Needs https://openreview.net/forum?id=HyMTkQZAb implemented
Sub packages
- intrinsic-rewards
- Contains an implementation of RND(Random Network Distillation)
References
DQN (Deep Q Network)
DDQN (Double DQN)
Bootstrapped DQN
RPF(Randomized Prior Functions)
DDPQ(Deep Deterministic Policy Gradient)
TD3(Twin Delayed Deep Deterministic Policy Gradient)
SAC(Soft Actor Critic)
A2C (Advantage Actor Critic)
- http://proceedings.mlr.press/v48/mniha16.pdf , https://arxiv.org/abs/1602.01783 (A3C, original version)
- https://blog.openai.com/baselines-acktr-a2c/ (A2C, synchronized version)
ACKTR (Actor Critic using Kronecker-Factored Trust Region)
PPO (Proximal Policy Optimization)
AOC (Advantage Option Critic)
- https://arxiv.org/abs/1609.05140 (DQN-like option critic)
- https://arxiv.org/abs/1709.04571 (A3C-like option critic called A2OC)
Implementaions I referenced
Thank you!
https://github.com/openai/baselines
https://github.com/ikostrikov/pytorch-a2c-ppo-acktr
https://github.com/ShangtongZhang/DeepRL
https://github.com/chainer/chainerrl
https://github.com/Thrandis/EKFAC-pytorch (for ACKTR)
https://github.com/jeanharb/a2oc_delib (for AOC)
https://github.com/sfujim/TD3 (for DDPG and TD3)
https://github.com/vitchyr/rlkit (for SAC)
License
This project is licensed under Apache License, Version 2.0 (LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file rainy-0.5.3.tar.gz.
File metadata
- Download URL: rainy-0.5.3.tar.gz
- Upload date:
- Size: 53.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f8d28673f719e50bd17ed1df87932bed8336668ef06f3ab221ae19e058dbeee4
|
|
| MD5 |
b11d6f12675f941fe60f08548c733389
|
|
| BLAKE2b-256 |
b7a83885a778d7578b10a7d2e8bc59bef2c9e2798d085b81faec3338e7b73f24
|
File details
Details for the file rainy-0.5.3-py3-none-any.whl.
File metadata
- Download URL: rainy-0.5.3-py3-none-any.whl
- Upload date:
- Size: 76.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.28.1 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ebffc50b2df12786b69e444ca449d73110b4c47dc01362134612637900110015
|
|
| MD5 |
808810957dccb9b59b2dfd8ba35d385f
|
|
| BLAKE2b-256 |
42d68ab20d9be8acb3d7d7cc22b3e1cb243fea55fb51e27caa89cc76adb902a3
|