Skip to main content

Python simulation of a driving challange. Compatible with the Gymnasium API standard.

Project description

ProcGrid Traffic Gym (pgtg)

A driving simulation on a grid with procedural generated maps and traffic. Compatible with the Gymnasium API standard.

Installation

pip install pgtg

Usage

The easiest way to use pgtg is to create the environment with gymnasium:

import pgtg
env = gymnasium.make("pgtg-v2")

The package relies on import side-effects to register the environment name so, even though the package is never explicitly used, its import is necessary to access the environment.

If you want to access the environment constructor directly this is also possible:

from pgtg import PGTGEnv
env = PGTGEnv()

Environment

ProcGrid Traffic Gym procedurally generates a map consisting of multiple preconstructed tiles or loads a map from a file. The goal is to drive from the start of the map to the end. The navigation task is not part of this environment, instead a shortest path is provided and marked on the map.

The environment is highly customizable, see the environment constructor for more info.

Action Space

ProcGrid Traffic Gym has a Discrete(9) action space.

Value Meaning
0 accelerate left and up
1 accelerate left
2 accelerate left and down
3 accelerate up
4 don't accelerate
5 accelerate down
6 accelerate right and up
7 accelerate right
8 accelerate right and down

Observation Space

ProcGrid Traffic Gym has a Dict observation space that shows the 9x9 area the agent currently is inside or, if a sliding observation window is used, a area of the specified size centered on the agent.

Key Type Explanation
"position" MultiDiscrete The x and y position of the agent within the observation window or, if a sliding observation window is used, always (0, 0).
"velocity" Box The velocity of the agent in x and y direction.
"map" Dict The current observation window. The keys are the name of the features ("walls", "goals", "ice", "broken road", "sand", and "traffic"). Each item is a MultiBinary that encodes that feature as a hot one encoding.
"next_subgoal_direction" Discrete(5) The direction of the next subgoal or -1 if there is no next subgoal (most likely because the agent took a wrong turn). This is disabled by default. It can be enabled with the use_next_subgoal_direction argument of the environment constructor.

Most reinforcement learning implementations can't deal with Dict observations directly, thus it might be necessary to flatten the observations. This is easily doable with the gymnasium.wrappers.FlattenObservation wrapper:

from gymnasium.wrappers import FlattenObservation
env = FlattenObservation(env)

Reward

Crossing a subgoal is rewarded with +100 / number of subgoals as is finishing the whole map. Moving into a wall or traffic is punished with -100 and ends the episode. Standing still or moving to a already visited position can also penalized but is not per default. The reward values for each of this can be customized.

Render modes

Name Explanation
human render() returns None but a pygame window showing the environment is opened automatically when step() is called.
rgb_array render() returns a np.array representing a image.
pil_image render() returns a PIL.Image.Image, useful for displaying inside jupiter notebooks.

Obstacles

Name Effect
Ice When driving over a square with ice, there is a chance the agent moves in a random direction instead of the expected one.
Sand When driving over sand, there is a chance that the agent is slowed, as the velocity is reset to 0.
Broken road When driving over broken road, there is a chance for the agent to get a flat tire. This slows the agent down, as the velocity is reset to 0 every step. A flat tire lasts until the end of the episode.

Version History

v0.1.0

  • initial release

v0.2.0

  • Sand now slows down with a customizable probability (default 20%) instead of always.
  • Bump environment version to v1 because the changes impact reproducibility with earlier versions.

v0.3.0

  • The x and y coordinates of observations are no longer swapped. This was the case for historical reasons but serves no use any more.
  • Adds the option to use a sliding observation window of variable size.
  • Adds the option to use the direction of the next subgoal as a additional observation.
  • Bump environment version to v2 because the changes impact reproducibility with earlier versions.

v0.3.0

  • Fix bug that made it impossible to save a map as a file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pgtg-0.3.1.tar.gz (51.6 kB view details)

Uploaded Source

Built Distribution

pgtg-0.3.1-py3-none-any.whl (56.6 kB view details)

Uploaded Python 3

File details

Details for the file pgtg-0.3.1.tar.gz.

File metadata

  • Download URL: pgtg-0.3.1.tar.gz
  • Upload date:
  • Size: 51.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.10.0 Windows/10

File hashes

Hashes for pgtg-0.3.1.tar.gz
Algorithm Hash digest
SHA256 61715e688d1eb6bfcaf6e7082f91360d6cb3aa0647b25a7e6d74900d163f495a
MD5 97ba0a081e828d9082e1fedf2cbbe84c
BLAKE2b-256 018b7c7bc12eb0ce60ff49939da4f22677a107362ec9718a3b6050c23340816e

See more details on using hashes here.

File details

Details for the file pgtg-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: pgtg-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 56.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.10.0 Windows/10

File hashes

Hashes for pgtg-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0022e4234ae4effd3a688a8d79262a03e2bbeacc170476fb553f0ce359422dbc
MD5 a7e34c284ac9d44be6aac74a04614b10
BLAKE2b-256 4ac52a9da15d72507bd3c0e668137572e8248504495cb18c9f76ccd721331fee

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page