Skip to main content

A small package for Maximum Entropy Inverse Reinforcement Learning on simple MDPs

Project description

Maximum Entropy Inverse Reinforcement Learning

This is a python implementation of the Maximum Entropy Inverse Reinforcement Learning (MaxEnt IRL) algorithm based on the similarly named paper by Ziebart et al. and the Maximum Causal Entropy Inverse Reinforcement Learning (MaxCausalEnt IRL) algorithm based on his PhD thesis.

You may also want to have a look at the accompanying presentation.

For an example demonstrating how the Maximum (non-causal) Entropy IRL algorithm works, see the corresponding Jupyter notebook (notebooks/maxent.ipynb). Note that this python package contains a slightly more optimized implementation of the algorithms.

For an example on how to use this framework, have a look at the example.py file. The framework contains:

  • Two GridWorld implementations for demonstration (irl_maxent.gridworld)
  • The algorithm implementations (irl_maxent.maxent)
  • A gradient based optimizer framework (irl_maxent.optimizer)
  • Plotting helper functions (irl_maxent.plot)
  • A MDP solver framework, i.e. value iteration and corresponding utilities (irl_maxent.solver)
  • A trajectory/trajectory generation framework (irl_maxent.trajectory)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

irl-maxent-0.1.0.tar.gz (15.7 kB view hashes)

Uploaded Source

Built Distribution

irl_maxent-0.1.0-py3-none-any.whl (17.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page