easyagents

reinforcement learning for practitioners.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Reinforcement Learning for Practitioners (v1.2, 19Q4)

Travis_Status License

Status: under active development, breaking changes may occur. Release notes.

EasyAgents logo

EasyAgents is a high level reinforcement learning api focusing on ease of use and simplicity. Written in Python and running on top of established reinforcement learning libraries like tf-Agents, tensorforce or keras-rl. Environments are implemented in OpenAI gym.

In collaboration with Oliver Zeigermann.

Features

provides the same, simple api across all libraries. Thus you can easily switch between different implementations and you don't have to learn for each of them a new api.
to create and run any algorithm you only need 2 lines of code, all the parameters are named consistently across all algorithms.
supports a broad set of different algorithms
runs inside jupyter notebooks as well as stand-alone, easy to install requiring only a single 'pip install'.
easy to understand, ready-made plots and logs to investigate the algorithms and environments behaviour

Try it on colab

1. Introduction (CartPole on colab): training, plotting, switching algorithms & backends. based on the classic reinforcement learning example balancing a stick on a cart.
2. Next steps & backend switching (Orso on colab): custom training, creating a movie & switching backends. gym environment based on a routing problem.
3. Creating your own environment (LineWorld on colab): implement a gym environment from scratch, workshop example.
4. Logging, seeding & plot clearing (on colab): Investigate how an agents api and how it interacts with the gym environment; how to set seeds; controlling jupyter output cell clearing

Scenario: simple

from easyagents.agents import PpoAgent
from easyagents.callbacks import plot

ppoAgent = PpoAgent('CartPole-v0')
ppoAgent.train([plot.State(), plot.Loss(), plot.Rewards()])

Scenario_Simple

Scenario: more detailed

from easyagents.agents import PpoAgent
from easyagents.callbacks import plot

ppoAgent = PpoAgent( 'Orso-v1',fc_layers=(500,500,500))
ppoAgent.train([plot.State(), plot.Loss(), plot.Rewards(), plot.Actions(), 
                plot.StepRewards(), plot.Steps(), plot.ToMovie()], 
                learning_rate = 0.0001, num_iterations = 500, max_steps_per_episode=50 )

Scenario_Detailed

Available Algorithms and Backends

algorithm	tf-Agents	tensorforce	keras-rl	easyagents class name
CEM	`not available`	`not available`	`yes`	CemAgent
Dqn	`yes`	`yes`	`yes`	DqnAgent
Double Dqn	`open`	`not available`	`yes`	DoubleDqnAgent
Dueling Dqn	`not available`	`not available`	`yes`	DuelingDqnAgent
Ppo	`yes`	`yes`	`not available`	PpoAgent
Random	`yes`	`yes`	`not available`	RandomAgent
REINFORCE	`yes`	`yes`	`not available`	ReinforceAgent
SAC	`preview`	`not available`	`not available`	SacAgent

[191001]

if you are interested in other algorithms, backends or hyperparameters let us know by creating an issue. We'll try our best to support you.
keras-rl is not compatible with tensorflow eager execution mode. Thus keras-rl based agents should run in a different python / jupyter notebook instance than tf-agents or tensorforce based agents.

Guiding Principles

easily train, evaluate & debug policies for (you own) gym environment over "designing new algorithms"
simple & consistent over "flexible & powerful"
inspired by keras:
- same api across all algorithms
- support different implementations of the same algorithm
- extensible (pluggable backends, plots & training schemes)

Installation

Install from pypi using pip:

pip install easyagents

Documentation

release notes, class diagram

EasyAgents may not be ideal if

you would like to leverage implementation specific advantages of an algorithm
you want to do distributed or in parallel reinforcement learning

Note

This repository is under active development and in an early stage. Thus any- and everything may (and probably should) change.
If you have any difficulties in installing or using easyagents please let us know by creating an issue. We'll try our best to help you.
Any ideas, help, suggestions, comments etc in python / open source development / reinforcement learning / whatever are more than welcome. Thanks a lot in advance.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.4.1

Feb 2, 2020

1.4.0

Nov 23, 2019

1.3.2

Nov 18, 2019

1.3.0

Nov 2, 2019

This version

1.2.2

Oct 9, 2019

1.1.13

Sep 27, 2019

1.0.1

Sep 11, 2019

0.0.41

Aug 19, 2019

0.0.38

Jul 31, 2019

0.0.20

Jul 5, 2019

0.0.1

Jun 19, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easyagents-1.2.2.zip (65.9 kB view hashes)

Uploaded Oct 9, 2019 Source

Hashes for easyagents-1.2.2.zip

Hashes for easyagents-1.2.2.zip
Algorithm	Hash digest
SHA256	`1595ffd9c5a5659e8838137e1bf9a069ea88905102742d81667fd7cb7fb5c24e`
MD5	`9cc8735a3831ebaa8913eb145b8e58bc`
BLAKE2b-256	`dcf13b43df258cc813dfe3fd286ccfd227166d7befbb30d665eb06a9536f5246`