Skip to main content

A clean reinforcement learning library

Project description

ToyRL

Python PyPI PyPI Downloads GitHub License Ask DeepWiki

Ruff Checked with mypy pre-commit Build Docs Test Codecov

Documentation

https://ai-glimpse.github.io/toyrl

Installation

pip install toyrl

Algorithms

  • REINFORCE
  • SARSA
  • DQN & Double DQN
  • A2C
  • PPO

References

  • kengz/SLM-Lab: Our implementations are inspired by the book "Foundations of Deep Reinforcement Learning" and the implementation of SLM-Lab.
  • vwxyzjn/cleanrl: The main reference for the implementation of the PPO implementation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

toyrl-0.3.1.tar.gz (20.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

toyrl-0.3.1-py3-none-any.whl (20.8 kB view details)

Uploaded Python 3

File details

Details for the file toyrl-0.3.1.tar.gz.

File metadata

  • Download URL: toyrl-0.3.1.tar.gz
  • Upload date:
  • Size: 20.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.7.3

File hashes

Hashes for toyrl-0.3.1.tar.gz
Algorithm Hash digest
SHA256 e179ff649e513f00fbf75fa38bc23fd05e4940b302b30e6c474085d3d561d0a1
MD5 3021cced0ebefa97e974c8bfbe5f4dea
BLAKE2b-256 2ca62db5eafc0e8e38f9646ef375991d12884c3f06780288dd5a3c6cf79bec03

See more details on using hashes here.

File details

Details for the file toyrl-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: toyrl-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 20.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.7.3

File hashes

Hashes for toyrl-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 62f61e6323c8cc02d3fd5c3e3632fd55fc7df3a9a45988ecb84cb320e28c6687
MD5 13015839c19f76dd7eb213c0a797afb7
BLAKE2b-256 5923c7bee07cefba6672fab38422d4e6f37ea003c197cac49edc66816f324795

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page