Contra. for OpenAI Gym

These details have not been verified by PyPI

Project links

Homepage

Project description

Gym for Contra

An OpenAI Gym environment for Contra. on The Nintendo Entertainment System (NES) using the nes-py emulator.

Project address

Installation

The preferred installation of Contra is from pip:

pip install gym-contra

Usage

Python

You must import ContraEnv before trying to make an environment. This is because gym environments are registered at runtime. By default, ContraEnv use the full NES action space of 256 discrete actions. To contstrain this,ContraEnv.actions provides three actions lists (RIGHT_ONLY, SIMPLE_MOVEMENT, and COMPLEX_MOVEMENT) for the nes_py.wrappers.JoypadSpace wrapper. See Contra/actions.py for a breakdown of the legal actions in each of these three lists.

from nes_py.wrappers import JoypadSpace
import gym
from Contra.actions import SIMPLE_MOVEMENT, COMPLEX_MOVEMENT, RIGHT_ONLY

env = gym.make('Contra-v0')
env = JoypadSpace(env, RIGHT_ONLY)

print("actions", env.action_space)
print("observation_space ", env.observation_space.shape[0])

done = False
env.reset()
for step in range(5000):
    if done:
        print("Over")
        break
    state, reward, done, info = env.step(env.action_space.sample())
    env.render()

env.close()

NOTE: ContraEnv.make is just an alias to gym.make for convenience.

NOTE: remove calls to render in training code for a nontrivial speedup.

Command Line

Prepare to write please wait

NOTE: by default,-m is set to human.

Environments

These environments allow 3 attempts (lives) to play in the game. The environments only send reward-able game-play frames to agents; No cut-scenes, loading screens, etc. are sent from the NES emulator to an agent nor can an agent perform actions during these instances. If a cut-scene is not able to be skipped by hacking the NES's RAM, the environment will lock the Python process until the emulator is ready for the next action.

Step

Info about the rewards and info returned by the step method.

Reward Function

The reward function assumes the objective of the game is to move as far right as possible (increase the agent's x value), as fast as possible, without dying. To model this game, three separate variables compose the reward:

v: the difference in agent x values between states

in this case this is instantaneous velocity for the given step
v = x1 - x0
- x0 is the x position before the step
- x1 is the x position after the step
moving right ⇔ v > 0
moving left ⇔ v < 0
not moving ⇔ v = 0

d: a death penalty that penalizes the agent for dying in a state
- this penalty encourages the agent to avoid death
- alive ⇔ d = 0
- dead ⇔ d = -15
b : if the agent defeated the boss
- this reword will encourages the agent to defeat boss as possible
- no defeated ⇔ 0
- defeated ⇔ 30

So the reward function is:

r = v + d + b

Note:The reward is clipped into the range (-15, 15).

info dictionary

The info dictionary returned by the step method contains the following keys:

life=self._life,
dead=self._is_dead,
done=self._get_done(),
score=self._score(),
status=self._player_state,
x_pos=self._x_position,
y_pos=self._y_position,defeated=self._get_boss_defeated,

Key	Type	Description
life	int	The number of lives left, i.e., {3, 2, 1}
dead	Bool	Get The palyer is dead
done	Bool	Get the game is game over
score	int	Get the player's score
status	Bool	Alive Status (00 - Dead, 01 - Alive, 02 - Dying)
x_pos	int	Player's x position in the stage (from the left)
y_pos	int	Player's y position in the stage (from the bottom)
defeated	Bool	self._get_boss_defeated

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.1.1

Sep 19, 2019

0.1.0

Sep 18, 2019

0.0.9

Aug 20, 2019

0.0.8

Aug 20, 2019

0.0.7

Aug 17, 2019

0.0.6

Aug 15, 2019

0.0.5

Aug 15, 2019

0.0.3

Aug 15, 2019

0.0.2

Aug 15, 2019

0.0.1

Aug 15, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gym_contra-0.1.1.tar.gz (100.6 kB view details)

Uploaded Sep 19, 2019 Source

File details

Details for the file gym_contra-0.1.1.tar.gz.

File metadata

Download URL: gym_contra-0.1.1.tar.gz
Upload date: Sep 19, 2019
Size: 100.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.20.1 setuptools/41.1.0 requests-toolbelt/0.9.1 tqdm/4.32.1 CPython/3.6.5

File hashes

Hashes for gym_contra-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`9d4ad95896650718b25382208e558fcc808195a9645815190d8bdf637ed0965f`
MD5	`b66395c244de9d0b8dc2d03ea5d03dc3`
BLAKE2b-256	`dbb7a7e24afbb4537bb4220c61439a17a1907c503789d958c637a9657ebdfe6c`

See more details on using hashes here.

gym-contra 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Gym for Contra

Installation

Usage

Python

Command Line

Environments

Step

Reward Function

info dictionary

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes