A plotter for reinforcement learning (RL)
Project description
rl-plotter
This is a simple tool which can plot learning curves easily for reinforcement learning (RL).
Installation
from PIP
pip install rl_plotter
from source
python setup.py install
Usage
Add our logger in your code of evaluation
from rl_plotter.logger import Logger
logger = Logger(exp_name="your_exp_name", log_dir, env_name)
路路路路
logger.update(score=evaluation_score_list, total_steps=current_training_steps)
or just use OpenAI-baseline bench.Monitor (recommended):
from baselines import bench
env = bench.Monitor(env, log_dir)
After the training or when you are training your agent, you can plot the learning curves in this way:
-
switch to log directory (default: ./)
-
run command to plot:
rl_plotter --save --show
more general commands in practice:
rl_plotter --save --show --avg_group --shaded_std
rl_plotter --save --show --avg_group --shaded_std --time
rl_plotter --save --show --avg_group --shaded_std --shaded_err
for help:
rl_plotter --help
and you can find parameters to custom the style of your curves.
optional arguments:
-h, --help show this help message and exit
--fig_length matplotlib figure length (default: 6)
--fig_width matplotlib figure width (default: 6)
--style matplotlib figure style (default: seaborn)
--title matplotlib figure title (default: None)
--xlabel matplotlib figure xlabel
--xkey x-axis key in csv file (default: l)
--ykey y-axis key in csv file (default: r)
--ylabel matplotlib figure ylabel
--smooth smooth radius of y axis (default: 10)
--resample if not zero, size of the uniform grid in x direction
to resample onto. Resampling is performed via
symmetric EMA smoothing (see the docstring for
symmetric_ema). Default is zero (no resampling). Note
that if average_group is True, resampling is
necessary; in that case, default value is 512.
(default: 512)
--smooth_step when resampling (i.e. when resample > 0 or
average_group is True), use this EMA decay parameter
(in units of the new grid step). See docstrings for
decay_steps in symmetric_ema or one_sided_ema functions.
(default: 1.0)
--avg_group average the curves in the same group and plot the mean
--shaded_std shaded region corresponding to standard deviation of the group
--shaded_err shaded region corresponding to error in mean estimate of the group
--legend_loc location of legend
--legend_outside place the legend outside of the figure
--no_legend_group_num don't show num of group in legend
--time enable this will set x_key to t, and activate parameters about time
--time_unit parameters about time, x axis time unit (default: h)
--time_interval parameters about time, x axis time interval (default: 1)
--xformat x-axis format
--xlim x-axis limitation (default: None)
--log_dir log dir (default: ./)
--filename csv filename
--show show figure
--save save figure
--dpi figure dpi (default: 400)
finally, the learning curves looks like this:
Features
- custom logger, style, key, label, interval, and so on ...
- multi-experiment plotter
- x-axis formatter features
- compatible with OpenAI-baseline monitor data style
- corresponding color for specific experiment
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rl_plotter-2.2.0.tar.gz
(9.4 kB
view hashes)
Built Distribution
rl_plotter-2.2.0-py3-none-any.whl
(29.1 kB
view hashes)
Close
Hashes for rl_plotter-2.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9be751d9c3a716c7e0e73afcc1cd49f86f3d1e5041e829d8bbdba464d2d9cf89 |
|
MD5 | b3e77c92e60c9a248b205a5b6857913d |
|
BLAKE2b-256 | 32dfa55c4d664d88a3153b328ab0432b3d04946fde1c95ba782a01b5847aa36a |