Skip to main content

A small package for Maximum Entropy Inverse Reinforcement Learning on simple MDPs

Project description

Maximum Entropy Inverse Reinforcement Learning

This is a python implementation of the Maximum Entropy Inverse Reinforcement Learning (MaxEnt IRL) algorithm based on the similarly named paper by Ziebart et al. and the Maximum Causal Entropy Inverse Reinforcement Learning (MaxCausalEnt IRL) algorithm based on his PhD thesis.

You may also want to have a look at the accompanying presentation.

For an example demonstrating how the Maximum (non-causal) Entropy IRL algorithm works, see the corresponding Jupyter notebook (notebooks/maxent.ipynb). Note that this python package contains a slightly more optimized implementation of the algorithms.

For an example on how to use this framework, have a look at the example.py file. The framework contains:

  • Two GridWorld implementations for demonstration (irl_maxent.gridworld)
  • The algorithm implementations (irl_maxent.maxent)
  • A gradient based optimizer framework (irl_maxent.optimizer)
  • Plotting helper functions (irl_maxent.plot)
  • A MDP solver framework, i.e. value iteration and corresponding utilities (irl_maxent.solver)
  • A trajectory/trajectory generation framework (irl_maxent.trajectory)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

irl-maxent-0.1.0.tar.gz (15.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

irl_maxent-0.1.0-py3-none-any.whl (17.4 kB view details)

Uploaded Python 3

File details

Details for the file irl-maxent-0.1.0.tar.gz.

File metadata

  • Download URL: irl-maxent-0.1.0.tar.gz
  • Upload date:
  • Size: 15.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.63.0 CPython/3.10.2

File hashes

Hashes for irl-maxent-0.1.0.tar.gz
Algorithm Hash digest
SHA256 4554e87e88d920b732ef701ae7b312e5845a6e75d78bc01c9bbd2f71259b9e19
MD5 e208417a03404397dd4ffe3e98c0a475
BLAKE2b-256 c68820a3c60d3dca2979f98a1184104a7ecd1ab9aa2f2a0283d1fa8b5b62ffa9

See more details on using hashes here.

File details

Details for the file irl_maxent-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: irl_maxent-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 17.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.27.1 requests-toolbelt/0.9.1 tqdm/4.63.0 CPython/3.10.2

File hashes

Hashes for irl_maxent-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6a200de277fe585406b88e6533b29a2b2cbcb229732021e956a2af1fcb0ccce4
MD5 7e53cd0e13937654ed3c14eae6af1cc0
BLAKE2b-256 324544f654e56aa8c7eaa2a458e19b88afadda118cc4719178e8cf4c4ffe1fe2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page