A plotter for reinforcement learning
Project description
rl-plotter
This is a simple tool which can plot learning curves easily for reinforcement learning.
Installation
from PIP
pip install rl_plotter
from source
python3 setup.py install
Examples
First, add a logger in your code (for example: DQN):
from rl_plotter.logger import Logger
def train(name):
dqn = DQN()
logger = Logger(name, env_name='PongNoFrameskip-v4', use_tensorboard=False)
while True:
s = env.reset()
while True:
total_step = logger.add_step()
a = dqn.select_action(s, EPSILON)
s_, r, done, info = env.step(a)
dqn.store_transition(s, a, r, s_)
episode_reward += r
if dqn.replay_memory.memory_counter > REPLAY_MEMORY_SIZE:
loss = dqn.learn()
logger.add_loss(loss.cpu().item())
if done:
break
s = s_
logger.add_episode()
logger.add_reward(episode_reward, freq=10)
logger.finish()
After the training or when you are training your agent, you can plot the learning curves in this way:
python plotter.py
The learning curves looks like this:
To Do
- reinforcement learning plot tools
- timestamp features
- history experiment data plot tools
- ~~basic data plot tools锛坕ncluding ML-Loss plot锛墌~
-
dynamic plot tools
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rl_plotter-1.0.1.tar.gz
(10.8 kB
view hashes)
Built Distribution
rl_plotter-1.0.1-py3-none-any.whl
(12.1 kB
view hashes)
Close
Hashes for rl_plotter-1.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f5d5c522978748f3e93443a954c3b6b6e755e8770aa0487b6e92378ad5f2ee0d |
|
MD5 | 576cc6b512500b155bb56158f07c46f3 |
|
BLAKE2b-256 | 80fd9b60141a93c566f3857122dc1b0058747c9fbacab427023511cc77abac65 |