A plotter for reinforcement learning
Project description
rl-plotter
This is a simple tool which can plot learning curves easily for reinforcement learning.
Installation
from PIP
pip install rl_plotter
from source
python3 setup.py install
Examples
First, add a logger in your code (for example: DQN):
from rl_plotter.logger import Logger
def train(name):
dqn = DQN()
logger = Logger(name, env_name='PongNoFrameskip-v4', use_tensorboard=False)
while True:
s = env.reset()
while True:
total_step = logger.add_step()
a = dqn.select_action(s, EPSILON)
s_, r, done, info = env.step(a)
dqn.store_transition(s, a, r, s_)
episode_reward += r
if dqn.replay_memory.memory_counter > REPLAY_MEMORY_SIZE:
loss = dqn.learn()
logger.add_loss(loss.cpu().item())
if done:
break
s = s_
logger.add_episode()
logger.add_reward(episode_reward, freq=10)
logger.finish()
After the training or when you are training your agent, you can plot the learning curves in this way:
python -m rl_plotter.plotter
The learning curves looks like this:
To Do
- reinforcement learning plot tools
- timestamp features
- history experiment data plot tools
- ~~basic data plot tools锛坕ncluding ML-Loss plot锛墌~
-
dynamic plot tools
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rl_plotter-1.0.2.tar.gz
(10.9 kB
view hashes)
Built Distribution
rl_plotter-1.0.2-py3-none-any.whl
(12.2 kB
view hashes)
Close
Hashes for rl_plotter-1.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2514f7561e7b15a273ce6ca9253c9f4379adfeb910a67326ddd7ab144d021771 |
|
MD5 | 3ba6333c5dd739249f57e53689ef5db2 |
|
BLAKE2b-256 | a60953132a2e5fce7ab6d555ccd468221ede330b92d232fe3f833d5625e7b27b |