Gridworlds environments for OpenAI gym.
Project description
# OpenAI gym Gridworlds
Implementation of three gridworlds environments from book [Reinforcement Learning: An Introduction][book] compatible with [OpenAI gym][gym].
[gym]: https://github.com/openai/gym [book]: http://incompleteideas.net/book/the-book-2nd.html
## Usage
`bash $ import gym $ import gym_gridworlds $ env = gym.make('Gridworld-v0') # substitute environment's name `
## Gridworld-v0
Gridworld is simple 4 times 4 gridworld from example 4.1 in the [book]. There are fout action in each state (up, down, right, left) which deterministically cause the corresponding state transitions but actions that would take an agent of the grid leave a state unchanged. The reward is -1 for all tranistion until the terminal state is reached. The terminal state is in top left and bottom right coners.
## WindyGridworld-v0
Windy gridworld is from example 6.5 in the [book]. Windy gridworld is a standard gridworld as described above but there is a crosswind upward through the middle of the grid. Action are standard but in the middle region the resultant states are shifted upward by a wind which strength varies between columns.
## Cliff-v0
Cliff walking is a gridworld example 6.6 from the [book]. Again reward is -1 on all transition except those into region that is cliff. Stepping into this region incurs a reward of -100 and sends the agent instantly back to the start.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for gym_gridworlds-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | eedbdc696d3988fc621adf9aa7d83821d6cd465e3f24c041ceb1b132bf04c01c |
|
MD5 | 0c1ed8f04043207587525de547b654c6 |
|
BLAKE2b-256 | 57e84ae5cebd2d2e3cdcda6fcb9127c768ada6cec4de47fedf1cc039d80f98de |