A plotter for reinforcement learning
Project description
rl-plotter
This is a simple tool which can plot learning curves easily for reinforcement learning.
Installation
from PIP
pip install rl_plotter
from source
python3 setup.py install
Examples
First, add a logger in your code (for example: DQN):
from rl_plotter.logger import Logger
def train(name):
dqn = DQN()
logger = Logger(name, env_name='PongNoFrameskip-v4', use_tensorboard=False)
while True:
s = env.reset()
while True:
total_step = logger.add_step()
a = dqn.select_action(s, EPSILON)
s_, r, done, info = env.step(a)
dqn.store_transition(s, a, r, s_)
episode_reward += r
if dqn.replay_memory.memory_counter > REPLAY_MEMORY_SIZE:
loss = dqn.learn()
logger.add_loss(loss.cpu().item())
if done:
break
s = s_
logger.add_episode()
logger.add_reward(episode_reward, freq=10)
logger.finish()
After the training or when you are training your agent, you can plot the learning curves in this way:
python -m rl_plotter.plotter
for help use:
python -m rl_plotter.plotter --help
The learning curves looks like this:
And you can custom the style of your curves by use parameter of `rl_plotter.plotter`or modifying`rl_plotter.plotter`To Do
- reinforcement learning plot tools
- timestamp features
- history experiment data plot tools
- ~~basic data plot tools锛坕ncluding ML-Loss plot锛墌~
-
dynamic plot tools
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rl_plotter-1.0.4.tar.gz
(11.5 kB
view hashes)
Built Distribution
rl_plotter-1.0.4-py3-none-any.whl
(12.5 kB
view hashes)
Close
Hashes for rl_plotter-1.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a9215f964375c7026744e38db4c8e69bee66ec0faed54878969270653b3c0c86 |
|
MD5 | 3e18bb7a4581116fde1cdfb5b272b065 |
|
BLAKE2b-256 | 78601bf37e77a896ca1417db4697a8b47f3a5edbaff319034e13c83691f274ee |