Skip to main content

Reinforcement Learning (RL) with linear function approximation.

Project description

# linear-rl

Reinforcement Learning (RL) with linear function approximation.

Implementation of the True Online SARSA(lambda) algorithm with Fourier Basis linear function approximation.

# References

Reinforcement Learning: An Introduction 2nd Edition Richard S. Sutton and Andrew G. Barto

Project details


Release history Release notifications | RSS feed

This version

0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

linear-rl-0.1.tar.gz (4.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

linear_rl-0.1-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file linear-rl-0.1.tar.gz.

File metadata

  • Download URL: linear-rl-0.1.tar.gz
  • Upload date:
  • Size: 4.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.6

File hashes

Hashes for linear-rl-0.1.tar.gz
Algorithm Hash digest
SHA256 58f896c1900a3cd6564f9bebf4c97e53313a5f9505cb48e0d34799e22cc71985
MD5 41890a3f5f993b9c0108516cf2fced15
BLAKE2b-256 c1df251a9c6d3cfafd0e69e251488f89b231644a2f546d2d9763c57dd22152d4

See more details on using hashes here.

File details

Details for the file linear_rl-0.1-py3-none-any.whl.

File metadata

  • Download URL: linear_rl-0.1-py3-none-any.whl
  • Upload date:
  • Size: 6.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.6

File hashes

Hashes for linear_rl-0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bf77cde2834a1fd6381aaa60c9be0a08afb5910cf9915da5a1fc38661940e769
MD5 304591d5c090d3fa82bb8409d83d7096
BLAKE2b-256 f42d38cfe1465153b7e8fef21d2d4971a3e61c1cdbcc16fb845f4ee9d77b5d8b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page