Skip to main content

Reinforcement learning demos and utilities

Project description

Abijith RL

Small reinforcement learning demos built on gymnasium.

Install

pip install abijith-rl

Usage

import abijith_rl as rl

mc_df = rl.montecarlo(num_episodes=200)
print(mc_df.head())

td_df = rl.td_prediction(num_episodes=50)
print(td_df.head())

sarsa_df = rl.sarsa(num_episodes=200)
print(sarsa_df.head())

q_df = rl.q_learning(num_episodes=200)
print(q_df.head())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abijith_rl-1.0.1.tar.gz (6.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

abijith_rl-1.0.1-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file abijith_rl-1.0.1.tar.gz.

File metadata

  • Download URL: abijith_rl-1.0.1.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for abijith_rl-1.0.1.tar.gz
Algorithm Hash digest
SHA256 9dd1a60db7ece09f30ce7a8e82770a08976e60ee11e7b89045fbe054d8e3f98a
MD5 e6c66a7218bc615b85a4771aa5da1e29
BLAKE2b-256 fa500e2281e2951a4bb6be3d92612200626befdb7c5e0a8e535aab4fca95517e

See more details on using hashes here.

File details

Details for the file abijith_rl-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: abijith_rl-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 6.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for abijith_rl-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5a7ef5c0b0c8f89b26883481dc7335ff48a8d94a7fe46a6995620be99ed2c4c2
MD5 af0a4969356d7ac785bb81cf04e796d7
BLAKE2b-256 a50e0c421de8dfaca5a16af046ff851e0b1e430ff1971825f54cd7c64557d81f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page