Skip to main content

Easy Deep Reinforcement Learning for PyTorch

Project description

Torch-Agents

In early development. Inspired by TF-Agents, I hope to make a similar library for PyTorch - simple enough for me to understand, and powerful enough to express all my crazy ideas.

Currently supported: (Double) Deep Q Networks and Open AI Gym environments.

Results

DDQN Learning Pong after 13 hours (2500 games):

https://user-images.githubusercontent.com/10812888/172884861-843fb7b5-8823-4017-9042-25c5aff88438.mp4

DDQN Learning Breakout after 16 hours (10k games):

https://user-images.githubusercontent.com/10812888/173042669-0029ad6b-4c67-4661-81e0-3c6fa1f86211.mp4

DDQN Learning Space Invaders after 10k games:

https://user-images.githubusercontent.com/10812888/173040244-69920778-4644-4dec-9e49-bf617cd038cc.mp4

Installation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torch_agents-0.0.1.tar.gz (14.4 kB view details)

Uploaded Source

Built Distribution

torch_agents-0.0.1-py3-none-any.whl (16.2 kB view details)

Uploaded Python 3

File details

Details for the file torch_agents-0.0.1.tar.gz.

File metadata

  • Download URL: torch_agents-0.0.1.tar.gz
  • Upload date:
  • Size: 14.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for torch_agents-0.0.1.tar.gz
Algorithm Hash digest
SHA256 bd7477893a680fbb524daf2f6ecdf1b8dad49a4a4f721534581b0b5fd45d91a1
MD5 bb8a1e04e15f7c58efc6fcc67b9d3ca7
BLAKE2b-256 a4d93555d4f1ba68d8424d04f160ba9089a36ee66eb9e7fd074bb42ba16a419d

See more details on using hashes here.

File details

Details for the file torch_agents-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: torch_agents-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 16.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for torch_agents-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6d11b277c50c29c825711b24efe7896d6110de0318da0128de8cfb29d2e48e25
MD5 5cabfa181805e00651480c0a2b5fc32b
BLAKE2b-256 1173a54c2224584042eb94caf0966f5cbfb513b9f747c7b01e2a6e302bb19604

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page