Skip to main content

This is a template repository for Python projects that use uv for their dependency management.

Project description

Samsara RL

This library provides a vectorized implementation and explainations of foundational reinforcement learning algorithms. Reinforcement learning deals with the science of decision making; a learning paradigm of repeated trial and error used in many areas such as: I. Robotic manipulation II. Fine tuning LLMs III. Financial portfolio management IV. Adaptive electric grids There is a close intersection but often lesser studied intersection between information theory and reinforcement learning that I try and highlight in the algorithm explainations. Rate distortion theory for example an area of information theory studies the exact problem of optimal control but through the lense of noisy compression. Compression and learning are two sides of the same problem studied from different lenses.

The algorithms and notes were based off my readings of the Sutton and Barto book, the david silver deep mind RL lectures, as well as numerous other papers and resources that I cite in appropriate sections where they show up.


Table of Contents

  1. Concept and Intuition
  2. Bellman Equations
  3. Planning Algorithms
  4. Optimal Substructure & Unknown Dynamics
  5. Noisy Estimators I: Monte Carlo
  6. Noisy Estimators II: TD Learning
  7. Balancing error with eligibility Traces
  8. Optimal Control with SARSA
  9. Optimal Control with Q Learning

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

samsara_rl-0.0.1.tar.gz (102.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

samsara_rl-0.0.1-py3-none-any.whl (7.9 kB view details)

Uploaded Python 3

File details

Details for the file samsara_rl-0.0.1.tar.gz.

File metadata

  • Download URL: samsara_rl-0.0.1.tar.gz
  • Upload date:
  • Size: 102.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for samsara_rl-0.0.1.tar.gz
Algorithm Hash digest
SHA256 6d410c7da41b775d0c7878fdde0eec64ff96cc2347c3e247cb4c5284f1bcc3d5
MD5 424b2beadae4d2f74db4458d7c710241
BLAKE2b-256 9d8ea257d820d7e654e127c4ca03698b9532af97b99d073ec1fc7376d36923c2

See more details on using hashes here.

File details

Details for the file samsara_rl-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: samsara_rl-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 7.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for samsara_rl-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 82df92728945a34fe254047dc037516f7812e499aa6021042dfae1f5b087ac20
MD5 fcd44d15dab0166a6786dda5b133fd4f
BLAKE2b-256 45a8dc0b29de24c93d7763a2022c99e1870180cb75e0afe7bf19ccc2168b75c8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page