Skip to main content

LSPI algorithm in Python

Project description

This is a Python implementation of the Least Squares Policy Iteration (LSPI) reinforcement learning algorithm. For more information on the algorithm please refer to the paper

“Least-Squares Policy Iteration.”
Lagoudakis, Michail G., and Ronald Parr.
Journal of Machine Learning Research 4, 2003.

You can also visit their website where more information and a Matlab version is provided.

http://www.cs.duke.edu/research/AI/LSPI/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lspi-python-1.0.1.tar.gz (11.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lspi_python-1.0.1-py2-none-any.whl (15.8 kB view details)

Uploaded Python 2

File details

Details for the file lspi-python-1.0.1.tar.gz.

File metadata

  • Download URL: lspi-python-1.0.1.tar.gz
  • Upload date:
  • Size: 11.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for lspi-python-1.0.1.tar.gz
Algorithm Hash digest
SHA256 5c7b77838d72b4e414df2b97d8b061459a4688b000ed820fe62408ae417935b1
MD5 bba3aa39b9470b51a0523480cdb91468
BLAKE2b-256 8c4c07d325c6c4de28e5e92a54b62c591ad05076493c6e862bcaca1c2c5981fc

See more details on using hashes here.

File details

Details for the file lspi_python-1.0.1-py2-none-any.whl.

File metadata

File hashes

Hashes for lspi_python-1.0.1-py2-none-any.whl
Algorithm Hash digest
SHA256 64310dc22cc36dcf4831594ed9e58766400a0e758a7373af5d55ef61f0e2d58f
MD5 87bb07e612afb1a23726de072a848ea6
BLAKE2b-256 209f04954494101667dbd0d0cf1b0254869925a2718b05fa116cfce81b81d254

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page