Skip to main content

A simple RL library.

Project description

RLkit: A simple Reinforcement Learning library

This project is still a work in progress. More algorithms and detailed documentation coming soon :)

Currently supported agents-

  1. Random agent
  2. REINFORCE (Policy Gradients)
  3. DQN
  4. DQN with baseline
  5. Actor-Critic

See examples for details on how to use the library.

Requirements-

tensorflow==1.11.0
gym==0.10.8
numpy==1.15.4

New in v0.2

  • Added DQN and DQN with baseline agents
  • Added ActorCritic agent
  • Added support for various activation functions

Upcoming

  • Duelling DQN
  • Support for logging and plotting
  • Support for adding seeds
  • Support for custom environments

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

RLkit-0.2.0.tar.gz (6.2 kB view details)

Uploaded Source

File details

Details for the file RLkit-0.2.0.tar.gz.

File metadata

  • Download URL: RLkit-0.2.0.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.0 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/2.7.12

File hashes

Hashes for RLkit-0.2.0.tar.gz
Algorithm Hash digest
SHA256 0a86ebba31f629fd8ff782369bd54985f761d4355dc0480babac1030cecde445
MD5 009e4b785051c77eb83ef6ad0a63726f
BLAKE2b-256 184f28fc7c52b4736c4dcec69659621fcfb3370add6aaba812354a637a9490e5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page