reagent

Facebook RL

Project description

Banner

Applied Reinforcement Learning @ Facebook

Overview

ReAgent is an open source end-to-end platform for applied reinforcement learning (RL) developed and used at Facebook. ReAgent is built in Python and uses PyTorch for modeling and training and TorchScript for model serving. The platform contains workflows to train popular deep RL algorithms and includes data preprocessing, feature transformation, distributed training, counterfactual policy evaluation, and optimized serving. For more detailed information about ReAgent see the white paper here.

The platform was once named "Horizon" but we have adopted the name "ReAgent" recently to emphasize its broader scope in decision making and reasoning.

Algorithms Supported

Discrete-Action DQN
Parametric-Action DQN
Double DQN, Dueling DQN, Dueling Double DQN
Distributional RL: C51 and QR-DQN
Twin Delayed DDPG (TD3)
Soft Actor-Critic (SAC)

Installation

ReAgent can be installed via. Docker or manually. Detailed instructions on how to install ReAgent can be found here.

Usage

Detailed instructions on how to use ReAgent Models can be found here.

The ReAgent Serving Platform (RASP) tutorial is available here.

License

ReAgent is released under a BSD 3-Clause license. Find out more about it here.

Citing

@article{gauci2018horizon, title={Horizon: Facebook's Open Source Applied Reinforcement Learning Platform}, author={Gauci, Jason and Conti, Edoardo and Liang, Yitao and Virochsiri, Kittipat and Chen, Zhengxing and He, Yuchen and Kaden, Zachary and Narayanan, Vivek and Ye, Xiaohui}, journal={arXiv preprint arXiv:1811.00260}, year={2018} }

Project details

Release history Release notifications | RSS feed

This version

0.1

May 27, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reagent-0.1.tar.gz (3.1 MB view details)

Uploaded May 27, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

reagent-0.1-py3-none-any.whl (345.2 kB view details)

Uploaded May 27, 2020 Python 3

File details

Details for the file reagent-0.1.tar.gz.

File metadata

Download URL: reagent-0.1.tar.gz
Upload date: May 27, 2020
Size: 3.1 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for reagent-0.1.tar.gz
Algorithm	Hash digest
SHA256	`e7a77b3bf53f1f2be73b43a256db30b413bda1235927f7ee73ce6c462e80fed9`
MD5	`a49cb98bbbc5033e494888b7c81c1787`
BLAKE2b-256	`12ce8fc28d7d2993df85964f7f010d5513e1b981ca3ae0a04d67c8f36280d134`

See more details on using hashes here.

File details

Details for the file reagent-0.1-py3-none-any.whl.

File metadata

Download URL: reagent-0.1-py3-none-any.whl
Upload date: May 27, 2020
Size: 345.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.4.0 requests-toolbelt/0.9.1 tqdm/4.45.0 CPython/3.7.7

File hashes

Hashes for reagent-0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7859aeb5da62fde0ba3c7df74ac1a85a549f6e39e41edc9c154a388512fd5efe`
MD5	`128e72b5875f53516695ee9c7207b6d5`
BLAKE2b-256	`5c277c185c6c2613c4054b139ce1987eb456165f2560481b5750609434fb7ef5`

See more details on using hashes here.

reagent 0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Applied Reinforcement Learning @ Facebook

Overview

Algorithms Supported

Installation

Usage

License

Citing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes