Skip to main content

A reinforcement learning module

Project description

Build Status Coverage Status PyPI version

Reinforcement

The Reinforcement module aims to provide simple implementations for various reinforcement learning algorithms. The module tries to be agnostic about its use cases, but implements different solutions for policy selection, value- and q-function approximations as well as different agents for reinforcement learning algorithms.

The project is in its early stage and currently only provides an n-step temporal difference learning agent. The main purpose of the project is to facilitate my own understanding of reinforcement learning, with no particular application in mind.

Module structure

The module is organises in 3 main parts. Policies, reward functions and agents, each providing necessary components to construct a reinforcement learning agent. Components should have a low dependency amongst each other and share a simple common interface to facilitate modular construction of agents.

Agents

This module contains the actual agents implementing the reinforcement learning algorithm using a policy component and a reward function component. Currently only a n-step temporal difference agent is implemented.

Policies

This module contains action selection policies used by reinforcement learning agents. Available policies: epsilon greedy; normalized epsilon greedy.

Reward Functions

This module contains implementations of reward functions, which are used by reinforcement learning agents. Available reward functions: value table, q table, q neural network

Models

Reinforcement also contains neural network implementation which can be used as non-linear reward function approximiations. Currently there are 2 regression models implemented, one using Keras and one using pure Tensorflow.

Architecture

This software is crafted using Test Driven Development and tries to adhere to the SOLID principle as far as it lies in the abilities of the author.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reinforcement-1.0.6.tar.gz (7.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

reinforcement-1.0.6-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file reinforcement-1.0.6.tar.gz.

File metadata

  • Download URL: reinforcement-1.0.6.tar.gz
  • Upload date:
  • Size: 7.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for reinforcement-1.0.6.tar.gz
Algorithm Hash digest
SHA256 9a5e00980723275414761027f90c655ef626fa65b2546a19bc5959b6221f1d52
MD5 041cb8e4fc3c9adb907d9d5cf85125c3
BLAKE2b-256 0c1c131c6cf4cae50b3e1b4a49b261520434fa6b9c0616ed7ff5cf2bf56610f6

See more details on using hashes here.

File details

Details for the file reinforcement-1.0.6-py3-none-any.whl.

File metadata

File hashes

Hashes for reinforcement-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 67dec6b6a212dcfc6d502c386263e88d8a4861de86fd21983303368c93e664fe
MD5 dc45dbc4e94bdc74ad9723f22cdc3be0
BLAKE2b-256 efaeaa0d951325e0b04082042307b4dd1bc509350af11b7ff497bb6229627350

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page