A OpenAI Gym Env for continuous actions
Project description
Gym-style API
The domain features a continuos state and a dicrete action space.
The environment initializes:
- cross-sectional dataset with variables X_a, X_s, Y and N observations;
- logit model fitted on the dataset, retrieving parameters \theta_0, \theta_1, \theta_2;
The agent:
- sees a patient (sample observation);
- predict his risk of admission \rho, using initialized parameters
- if \rho < 1/2:
- do not intervene on X_a, which stays the same
- else:
- sample an action a in [0,1]
- compute g(a, X_a) = newX_a
- intervene on X_a by updating it to newX_a
- give reward equal to average risk of admission, using predicted Y, initial parameters and sampled values
(shouldn't I fit a new logit-link? parameters are now diff?)
To install
-
cd gym-contin
-
!pip install gym-contin
-
import gym
-
import gym_contin
-
env =gym.make('contin-v0')
To change version
- change version to, e.g., 1.0.7 from setup.py file
- git clone https://github.com/claudia-viaro/gym-contin.git
- cd gym-contin
- python setup.py sdist bdist_wheel
- twine check dist/*
- twine upload --repository-url https://upload.pypi.org/legacy/ dist/*
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
gym_contin-1.3.1.tar.gz
(4.8 kB
view hashes)
Built Distribution
Close
Hashes for gym_contin-1.3.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 09e0a12ea1b23abd2a00c43be76f7e7b0bed030194aa7ae208a88f3f02107a0a |
|
MD5 | 54d08596a58a5da4b3a85ee53c9c7e0b |
|
BLAKE2b-256 | f1c960170c6199e7ed94af72c4c02682e5f786ad705fcdb5b9c65a0dec7cee5d |