Skip to main content

Reiforcement Learning package description.

Project description

Gradient-MC-1000-State-Random-Walk

Structure

  • Agent

The Agent represents the logic for reinforcement learning. It contains functions for policy evaluation & calculating a State's value.

  • Policy

The Policy object determines the actions taken by the agent. The Policy object must inherit from the Base Policy class.

  • Environment

The Environment represents the State that the Agent is in. It describes which states are accessible following actions & what their probabilities are.

The Environment depends on the State Space.

  • State Space

The State Space describes the states that the agent can be in. Each State is of type State.

The State Space depends on the State.

  • State

The State represents a singular State that the agent can be in. The State contains the information that describes each State.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

umbrellarl_ctorrington-0.0.0.tar.gz (9.3 kB view details)

Uploaded Source

Built Distribution

umbrellarl_ctorrington-0.0.0-py3-none-any.whl (15.3 kB view details)

Uploaded Python 3

File details

Details for the file umbrellarl_ctorrington-0.0.0.tar.gz.

File metadata

File hashes

Hashes for umbrellarl_ctorrington-0.0.0.tar.gz
Algorithm Hash digest
SHA256 9c776baed0b577e146b2505435ebcea82de8eb380dc716a78d2af9c98ccf159d
MD5 84c8e87c66d555ff7d54c6dc970e5d5b
BLAKE2b-256 659fa8318642e27b137bf269949896da3a07e84dac89601ba363b79659e15776

See more details on using hashes here.

File details

Details for the file umbrellarl_ctorrington-0.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for umbrellarl_ctorrington-0.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7491bc933c3aaed6a6048478879da5b364f607ba641db7c567bdb119fb555017
MD5 fee92bfb3f19c106578d751e1c0b3e8d
BLAKE2b-256 9e87e6d56fb27344173b078e7332511a4a1bec60a34cb9fef51cd0fc3921a531

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page