Skip to main content

Reinforcement learning demos and utilities

Project description

Abijith RL

Small reinforcement learning demos built on gymnasium.

Install

pip install abijith-rl

Usage

import abijith_rl as rl

mc_df = rl.montecarlo_code(num_episodes=200)
print(mc_df.head())

# Prints the full demo code string
rl.montecarlo()

td_df = rl.td_prediction(num_episodes=50)
print(td_df.head())

sarsa_df = rl.sarsa(num_episodes=200)
print(sarsa_df.head())

q_df = rl.q_learning(num_episodes=200)
print(q_df.head())

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abijith_rl-1.0.4.tar.gz (11.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

abijith_rl-1.0.4-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file abijith_rl-1.0.4.tar.gz.

File metadata

  • Download URL: abijith_rl-1.0.4.tar.gz
  • Upload date:
  • Size: 11.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for abijith_rl-1.0.4.tar.gz
Algorithm Hash digest
SHA256 49f7fdcdeb919b7af26d25fbfb8551bf4963c276f21a24be0e077ab850e55f46
MD5 d749d19434353bfa722711652330d799
BLAKE2b-256 1eff6165f24f8d8454b92641bca83b666148c5063482bca6878d299910ad2bc3

See more details on using hashes here.

File details

Details for the file abijith_rl-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: abijith_rl-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 11.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for abijith_rl-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e4c008a02fb45099793915a0222485cd332c32f6a107abbdeec9962d608ddc37
MD5 5459a33b17ceb2f51c1156d03583a5ab
BLAKE2b-256 9c30659d078ffdf042471cd7e066aed4ed3abe2359e5cb5a07507d279c577364

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page