Industrial Benchmark for OpenAI Gym
Project description
Industrial Benchmark for Gym
gym-industrial
is a standalone Python re-implementation of the Industrial Benchmark (IB) for OpenAI Gym.
Installation
pip install gym-industrial
Environments
The IB's subdynamics have also been implemented as Gym environments.
System | environment id |
---|---|
Industrial Benchmark | IndustrialBenchmark-v0 |
Operational Cost | IBOperationalCost-v0 |
Mis-calibration | IBMisCalibration-v0 |
Fatigue | IBFatigue-v0 |
Subdynamics Stochastic Computation Graphs
The following are views of the Industrial Benchmark subdynamics, plus the reward function, as stochastic computation graphs (SCG).
The graph notation used and the SCG definition are taken from Gradient Estimation Using Stochastic Computation Graphs. Squares denote deterministic nodes and circles, stochastic nodes.
Definition 1 (Stochastic Computation Graph). A directed, acyclic graph, with three types of nodes:
- Input nodes, which are set externally, including the parameters we differentiate with respect to.
- Deterministic nodes, which are functions of their parents.
- Stochastic nodes, which are distributed conditionally on their parents. Each parent v ofa non-input node w is connected to it by a directed edge (v, w).
Node labels use the notation from the Industrial Benchmark paper.
Mis-calibration dynamics
Fatigue dynamics
Operational cost
Reward function
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.