LSPI algorithm in Python
Project description
This is a Python implementation of the Least Squares Policy Iteration (LSPI) reinforcement learning algorithm. For more information on the algorithm please refer to the paper
“Least-Squares Policy Iteration.”
Lagoudakis, Michail G., and Ronald Parr.
Journal of Machine Learning Research 4, 2003.
You can also visit their website where more information and a Matlab version is provided.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
lspi-python-1.0.1.tar.gz
(11.8 kB
view hashes)
Built Distribution
Close
Hashes for lspi_python-1.0.1-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 64310dc22cc36dcf4831594ed9e58766400a0e758a7373af5d55ef61f0e2d58f |
|
MD5 | 87bb07e612afb1a23726de072a848ea6 |
|
BLAKE2b-256 | 209f04954494101667dbd0d0cf1b0254869925a2718b05fa116cfce81b81d254 |