A small package for Maximum Entropy Inverse Reinforcement Learning on simple MDPs
Project description
Maximum Entropy Inverse Reinforcement Learning
This is a python implementation of the Maximum Entropy Inverse Reinforcement Learning (MaxEnt IRL) algorithm based on the similarly named paper by Ziebart et al. and the Maximum Causal Entropy Inverse Reinforcement Learning (MaxCausalEnt IRL) algorithm based on his PhD thesis.
You may also want to have a look at the accompanying presentation.
For an example demonstrating how the Maximum (non-causal) Entropy IRL algorithm works, see the corresponding Jupyter notebook (notebooks/maxent.ipynb
).
Note that this python package contains a slightly more optimized implementation of the algorithms.
For an example on how to use this framework, have a look at the example.py
file.
The framework contains:
- Two GridWorld implementations for demonstration (
irl_maxent.gridworld
) - The algorithm implementations (
irl_maxent.maxent
) - A gradient based optimizer framework (
irl_maxent.optimizer
) - Plotting helper functions (
irl_maxent.plot
) - A MDP solver framework, i.e. value iteration and corresponding utilities (
irl_maxent.solver
) - A trajectory/trajectory generation framework (
irl_maxent.trajectory
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for irl_maxent-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6a200de277fe585406b88e6533b29a2b2cbcb229732021e956a2af1fcb0ccce4 |
|
MD5 | 7e53cd0e13937654ed3c14eae6af1cc0 |
|
BLAKE2b-256 | 324544f654e56aa8c7eaa2a458e19b88afadda118cc4719178e8cf4c4ffe1fe2 |