Skip to main content

A simple framework for distributed reinforcement learning in PyTorch.

Project description

rltorch(WIP)

rltorch provides a simple framework for reinforcement learning in PyTorch. You can easily implement distributed RL algorithms.

Installation

Install rltorch from source.

git clone https://github.com/ku2482/rltorch.git
cd rltorch
pip install -e .

You can also install using PyPI.

pip install rltorch

Examples

Ape-X

You can implement Ape-X[1] agent like this example here.

python examples/atari/apex.py \
[--env_id str(default MsPacmanNoFrameskip-v4)] \
[--num_actors int(default 4)] [--cuda (optional)] \
[--seed int(default 0)]

Soft Actor-Critic

You can implement Soft Actor-Critic[2, 3] agent like this example here. Note that you need a license and mujoco_py to be installed.

python examples/mujoco/sac.py \
[--env_id str(default HalfCheetah-v2)] \
[--num_actors int(default 1)] \
[--cuda (optional)] [--seed int(default 0)]

SAC-Discrete

You can implement SAC-Discrete[4] agent like this example here.

python examples/mujoco/sac.py \
[--env_id str(default MsPacmanNoFrameskip-v4)] \
[--num_actors int(default 4)] \
[--cuda (optional)] [--seed int(default 0)]

References

[1] Horgan, Dan, et al. "Distributed prioritized experience replay." arXiv preprint arXiv:1803.00933 (2018).

[2] Haarnoja, Tuomas, et al. "Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor." arXiv preprint arXiv:1801.01290 (2018).

[3] Haarnoja, Tuomas, et al. "Soft actor-critic algorithms and applications." arXiv preprint arXiv:1812.05905 (2018).

[4] Christodoulou, Petros. "Soft Actor-Critic for Discrete Action Settings." arXiv preprint arXiv:1910.07207 (2019).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rltorch-0.1.0.tar.gz (16.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rltorch-0.1.0-py2.py3-none-any.whl (30.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file rltorch-0.1.0.tar.gz.

File metadata

  • Download URL: rltorch-0.1.0.tar.gz
  • Upload date:
  • Size: 16.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for rltorch-0.1.0.tar.gz
Algorithm Hash digest
SHA256 e21cd38f049deb596184afa9f3727bece03aba8c8aee66ac4c9e16bc4250bb58
MD5 3457efd9ab7f08f9e972f736d2b40dd8
BLAKE2b-256 3dea60356ec31f901b7d4ab44ab9aa68ad81fe6f64be78308a58455ee7397fe5

See more details on using hashes here.

File details

Details for the file rltorch-0.1.0-py2.py3-none-any.whl.

File metadata

  • Download URL: rltorch-0.1.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 30.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.8

File hashes

Hashes for rltorch-0.1.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 13cf0b130d060c36e090a30a95ca86021db1bb007cff94bc76a45df0f838d616
MD5 60109ce86f560f895e21e51c8a4e142e
BLAKE2b-256 b2093f758be9a10a023e3fc5ff983f8b50eb21fee8b4b9c0a74fe6c45780c579

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page