A simple stochastic OpenAI environment for training RL agents
Project description
This repository contains a PIP package which is an OpenAI environment for simulating an enironment in which bananas get sold.
Installation
Install the OpenAI gym.
Then install this package via
pip install -e .
Usage
import gym
import gym_banana
env = gym.make('Banana-v0')
See https://github.com/matthiasplappert/keras-rl/tree/master/examples for some examples.
The Environment
Imagine you are selling bananas. One at a time. And the bananas get bad pretty quickly. Let's say in 3 days. The probability that I will sell the banana is given by
$$p(x) = (1+e)/(1. + e^(x+1))$$
where x-1 is my profit. This x-1 is my reward. If I don't sell the banana, the agent gets a reward of -1 (the price of the banana).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for gym_banana-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 19aec1c3fa2b5d9b584a2bbdc68e4b2e2b2e2071a70497b222b667611212a688 |
|
MD5 | 2ecaa6d3d1d61c7e3d0f7ba5c0ba1a87 |
|
BLAKE2b-256 | ce3f234dc47cfc3f15cc09008ea9ef4331e00703553ac81dbec3ab141f71c2f8 |