A simple stochastic OpenAI environment for training RL agents
Project description
This repository contains a PIP package which is an OpenAI environment for simulating an enironment in which bananas get sold.
Installation
Install the OpenAI gym.
Then install this package via
pip install -e .
Usage
import gym
import gym_banana
env = gym.make('Banana-v0')
See https://github.com/matthiasplappert/keras-rl/tree/master/examples for some examples.
The Environment
Imagine you are selling bananas. One at a time. And the bananas get bad pretty quickly. Let's say in 3 days. The probability that I will sell the banana is given by
$$p(x) = (1+e)/(1. + e^(x+1))$$
where x-1 is my profit. This x-1 is my reward. If I don't sell the banana, the agent gets a reward of -1 (the price of the banana).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gym_banana-0.0.3.tar.gz.
File metadata
- Download URL: gym_banana-0.0.3.tar.gz
- Upload date:
- Size: 4.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a78405f287b9ae5872ab6514b8d013696318b4017d3a9b1c8f889ffdf17ccab0
|
|
| MD5 |
0ad8e34f62cd353b2bc109b8a0714d4f
|
|
| BLAKE2b-256 |
39d61722a6bfa526408570503c311f64de36eda776659f5eaa9b5b6824ea07e9
|
File details
Details for the file gym_banana-0.0.3-py3-none-any.whl.
File metadata
- Download URL: gym_banana-0.0.3-py3-none-any.whl
- Upload date:
- Size: 4.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
19aec1c3fa2b5d9b584a2bbdc68e4b2e2b2e2071a70497b222b667611212a688
|
|
| MD5 |
2ecaa6d3d1d61c7e3d0f7ba5c0ba1a87
|
|
| BLAKE2b-256 |
ce3f234dc47cfc3f15cc09008ea9ef4331e00703553ac81dbec3ab141f71c2f8
|