Super Mario Bros. for OpenAI Gym
Project description
gym-super-mario-bros
An OpenAI Gym environment for Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The Nintendo Entertainment System (NES) using the nes-py emulator.
Installation
The preferred installation of gym-super-mario-bros
is from pip
:
pip install gym-super-mario-bros
Usage
Python
You must import gym_super_mario_bros
before trying to make an environment.
This is because gym environments are registered at runtime. By default,
gym_super_mario_bros
environments use the full NES action space of 256
discrete actions. To contstrain this, gym_super_mario_bros.actions
provides
three actions lists (RIGHT_ONLY
, SIMPLE_MOVEMENT
, and COMPLEX_MOVEMENT
)
for the nes_py.wrappers.BinarySpaceToDiscreteSpaceEnv
wrapper. See
gym_super_mario_bros/actions.py for a
breakdown of the legal actions in each of these three lists.
from nes_py.wrappers import BinarySpaceToDiscreteSpaceEnv
import gym_super_mario_bros
from gym_super_mario_bros.actions import SIMPLE_MOVEMENT
env = gym_super_mario_bros.make('SuperMarioBros-v0')
env = BinarySpaceToDiscreteSpaceEnv(env, SIMPLE_MOVEMENT)
done = True
for step in range(5000):
if done:
state = env.reset()
state, reward, done, info = env.step(env.action_space.sample())
env.render()
env.close()
NOTE: gym_super_mario_bros.make
is just an alias to gym.make
for
convenience.
NOTE: remove calls to render
in training code for a nontrivial
speedup.
Command Line
gym_super_mario_bros
feature a command line interface for playing
environments using either the keyboard, or uniform random movement.
gym_super_mario_bros -e <the environment ID to play> -m <`human` or `random`>
NOTE: by default, -e
is set to SuperMarioBros-v0
and -m
is set to
human
.
Environments
These environments allow 3 attempts (lives) to make it through the 32 levels of the game. The environments only send reward-able game-play frames to agents; No cut-scenes, loading screens, etc. are sent from the NES emulator to an agent nor can an agent perform actions during these occurrences. If a cut-scene is not able to be skipped by hacking the NES's RAM, the environment will lock the Python process until the emulator is ready for the next action.
Environment | Game | Frameskip | ROM | Screenshot |
---|---|---|---|---|
SuperMarioBros-v0 |
SMB | 4 | standard | |
SuperMarioBros-v1 |
SMB | 4 | downsample | |
SuperMarioBros-v2 |
SMB | 4 | pixel | |
SuperMarioBros-v3 |
SMB | 4 | rectangle | |
SuperMarioBrosNoFrameskip-v0 |
SMB | 1 | standard | |
SuperMarioBrosNoFrameskip-v1 |
SMB | 1 | downsample | |
SuperMarioBrosNoFrameskip-v2 |
SMB | 1 | pixel | |
SuperMarioBrosNoFrameskip-v3 |
SMB | 1 | rectangle | |
SuperMarioBros2-v0 |
SMB2 | 4 | standard | |
SuperMarioBros2-v1 |
SMB2 | 4 | downsample | |
SuperMarioBros2NoFrameskip-v0 |
SMB2 | 1 | standard | |
SuperMarioBros2NoFrameskip-v1 |
SMB2 | 1 | downsample |
Individual Levels
These environments allow a single attempt (life) to make it through a single level of the game.
Use the template
SuperMarioBros-<world>-<level>-v<version>
where:
<world>
is a number in {1, 2, 3, 4, 5, 6, 7, 8} indicating the world<level>
is a number in {1, 2, 3, 4} indicating the level within a world<version>
is a number in {0, 1, 2, 3} specifying the ROM mode to use- 0: standard ROM
- 1: downsampled ROM
- 2: pixel ROM
- 3: rectangle ROM
NoFrameskip
can be added before the first hyphen to disable frame skip
For example, to play 4-2 on the downsampled ROM, you would use the environment
id SuperMarioBros-4-2-v1
. To disable frame skip you would use
SuperMarioBrosNoFrameskip-4-2-v1
.
Citation
Please cite gym-super-mario-bros
if you use it in your research.
@misc{gym-super-mario-bros,
author = {Christian Kauten},
title = {{S}uper {M}ario {B}ros for {O}pen{AI} {G}ym},
year = {2018},
publisher = {GitHub},
howpublished = {\url{https://github.com/Kautenja/gym-super-mario-bros}},
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for gym_super_mario_bros-3.0.8.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 296ae0b207167beb3e1a10593e1ec7fed5d284dfbe5ecc39f0ddf5dd4fc8590b |
|
MD5 | d952a2be19617886db9706e8f22d3638 |
|
BLAKE2b-256 | 8454966cc76b697e9477f73bca8c246f1780b49b2e95068c4d09631792328ec4 |
Hashes for gym_super_mario_bros-3.0.8-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 64f1358ed8fa250b5796a0c87cea6c7319df9e90fd7ad2acbe7ff9b99715a048 |
|
MD5 | da7a114bc0bf333ecb0b29e954db9e96 |
|
BLAKE2b-256 | 4d34126acffd166752db1e797d50f4869d6ea4963e6add256d22fbd87ac2fa7f |