Skip to main content

Gym: A universal API for reinforcement learning environments.

Project description

Gym

Gym is an open source Python library for developing and comparing reinforcement learning algorithms by providing a standard API to communicate between learning algorithms and environments, as well as a standard set of environments compliant with that API. Since its release, Gym's API has become the field standard for doing this.

Gym currently has two pieces of documentation: the documentation website and the FAQ. A new and more comprehensive documentation website is in the works.

Installation

To install the base Gym library, use pip install gym.

This does not include dependencies for all families of environments (there's a massive number, and some can be problematic to install on certain systems). You can install these dependencies for one family like pip install gym[atari] or use pip install gym[all] to install all dependencies.

We support Python 3.6, 3.7, 3.8 and 3.9 on Linux and macOS. We will accept PRs related to Windows, but do not officially support it.

API

The Gym API's API models environments as simple Python env classes. Creating environment instances and interacting with them is very simple- here's an example using the "CartPole-v1" environment:

import gym 
env = gym.make('CartPole-v1')

# env is created, now we can use it: 
for episode in range(10): 
    obs = env.reset()
    for step in range(50):
        action = env.action_space.sample()  # or given a custom model, action = policy(observation)
        nobs, reward, done, info = env.step(action)

Notable Related Libraries

  • Stable Baselines 3 is a learning library based on the Gym API. It is our recommendation for beginners who want to start learning things quickly.
  • RL Baselines3 Zoo builds upon SB3, containing optimal hyperparameters for Gym environments as well as code to easily find new ones. Such tuning is almost always required.
  • The Autonomous Learning Library and Tianshou are two reinforcement learning libraries I like that are generally geared towards more experienced users.
  • PettingZoo is like Gym, but for environments with multiple agents.

Environment Versioning

Gym keeps strict versioning for reproducibility reasons. All environments end in a suffix like "_v0". When changes are made to environments that might impact learning results, the number is increased by one to prevent potential confusion.

Citation

A whitepaper from when OpenAI Gym just came out is available https://arxiv.org/pdf/1606.01540, and can be cited with the following bibtex entry:

@misc{1606.01540,
  Author = {Greg Brockman and Vicki Cheung and Ludwig Pettersson and Jonas Schneider and John Schulman and Jie Tang and Wojciech Zaremba},
  Title = {OpenAI Gym},
  Year = {2016},
  Eprint = {arXiv:1606.01540},
}

Release Notes

There used to be release notes for all the new Gym versions here. New release notes are being moved to releases page on GitHub, like most other libraries do. Old notes can be viewed here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fragile_gym-1.21.1.tar.gz (1.6 MB view details)

Uploaded Source

Built Distribution

fragile_gym-1.21.1-py3-none-any.whl (1.6 MB view details)

Uploaded Python 3

File details

Details for the file fragile_gym-1.21.1.tar.gz.

File metadata

  • Download URL: fragile_gym-1.21.1.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for fragile_gym-1.21.1.tar.gz
Algorithm Hash digest
SHA256 ec9e8e3c199e1a667022b249f1a0382017807e78680e81b3a73033d1006b6d52
MD5 ec13af09ee289f8ca81d6c4a92876407
BLAKE2b-256 6b0dfe6a7c7cb2de43e5c2fbaa487e9961a033d698a033716d521677400a2ee3

See more details on using hashes here.

File details

Details for the file fragile_gym-1.21.1-py3-none-any.whl.

File metadata

File hashes

Hashes for fragile_gym-1.21.1-py3-none-any.whl
Algorithm Hash digest
SHA256 eea76cf59439526574b8da50e08b4bbb46b800d93ac2985519c45d371a0d90ac
MD5 9addcb255d88115c89aed4e725148c51
BLAKE2b-256 e0988f01472ad70a07a87077467a7794fe16df13ba42d868112deb24cfc73f01

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page