Skip to main content

Multi-room maze environments from the DrStrategy paper. Provides NineRooms-v0, SpiralNineRooms-v0, and TwentyFiveRooms-v0 gymnasium environments.

Project description

MiniWorld DrStrategy - Multi-Room Maze Environment

A refactored implementation of Dr. Strategy's MiniWorld-based maze environments with updated dependencies and modern Python packaging. Based on the now-deprecated MiniWorld project and the original DrStrategy implementation.

Environment Observations

Environment Views

Full environment layout and render-on-position views:

Full Environment Partial Top-Down Observations Partial First-Person Observations
Full View Clean Top Middle TD Center TD Top Middle FP Center FP

Installation

pip install miniworld-maze

Usage

Registered Environments

This package registers the following gymnasium environments:

Environment ID Description Rooms Max Steps
NineRooms-v0 Standard 3×3 grid with adjacent room connections 9 1000
SpiralNineRooms-v0 3×3 grid with spiral connection pattern 9 1000
TwentyFiveRooms-v0 Large 5×5 grid with complex navigation 25 1000

All environments use TOP_DOWN_PARTIAL observation level and factory default room/door sizes by default.

Basic Usage

See examples/basic_usage.py for a complete working example:

#!/usr/bin/env python3
"""
Basic usage example for miniworld-maze environments.

This is a minimal example showing how to create and interact with the environment.
"""

import gymnasium as gym
import miniworld_maze  # noqa: F401


def main():
    # Create environment using gymnasium registry
    env = gym.make("NineRooms-v0", obs_width=64, obs_height=64)
    obs, info = env.reset()

    # obs is a dictionary containing:
    # - 'observation': (64, 64, 3) RGB image array
    # - 'desired_goal': (64, 64, 3) RGB image of the goal state
    # - 'achieved_goal': (64, 64, 3) RGB image of the current state

    # Take a few random actions
    for step in range(10):
        action = env.action_space.sample()
        obs, reward, terminated, truncated, info = env.step(action)

        print(f"Step {step + 1}: reward={reward:.3f}, terminated={terminated}")

        if terminated or truncated:
            obs, info = env.reset()

    env.close()
    print("Environment closed successfully!")


if __name__ == "__main__":
    main()

Headless Environments

When running in headless environments (servers, CI/CD, Docker containers) or when encountering X11/OpenGL context issues, you need to enable headless rendering:

# Set environment variable before running Python
export PYGLET_HEADLESS=1
python your_script.py

Or in your Python code (must be set before importing the library):

import os
os.environ['PYGLET_HEADLESS'] = '1'

import miniworld_maze
# ... rest of your code

This configures the underlying pyglet library to use EGL rendering instead of X11, allowing the environments to run without a display server.

Environment Variants

Available Environments

The package provides three main environment variants, each with different room layouts and connection patterns:

1. NineRooms (3×3 Grid)

-------------
| 0 | 1 | 2 |
-------------
| 3 | 4 | 5 |
-------------
| 6 | 7 | 8 |
-------------

A standard 3×3 grid where adjacent rooms are connected. The agent can navigate between rooms through doorways, with connections forming a fully connected grid pattern.

2. SpiralNineRooms (3×3 Spiral Pattern)

-------------
| 0 | 1 | 2 |
-------------
| 3 | 4 | 5 |
-------------
| 6 | 7 | 8 |
-------------

Same room layout as NineRooms but with a spiral connection pattern. Only specific room pairs are connected, creating a more challenging navigation task with fewer available paths.

3. TwentyFiveRooms (5×5 Grid)

---------------------
| 0 | 1 | 2 | 3 | 4 |
---------------------
| 5 | 6 | 7 | 8 | 9 |
---------------------
|10 |11 |12 |13 |14 |
---------------------
|15 |16 |17 |18 |19 |
---------------------
|20 |21 |22 |23 |24 |
---------------------

A larger 5×5 grid environment with 25 rooms, providing more complex navigation challenges and longer episode lengths.

Observation Types

Each environment supports three different observation modes:

  • TOP_DOWN_PARTIAL (default): Agent-centered partial top-down view with limited visibility range (POMDP)
  • TOP_DOWN_FULL: Complete top-down view showing the entire environment
  • FIRST_PERSON: 3D first-person perspective view from the agent's current position

Action Space

  • Discrete Actions (default): 7 discrete actions (turn left/right, move forward/backward, strafe left/right, no-op)
  • Continuous Actions: Continuous control with continuous=True parameter

Environment Configuration

All environments can be customized with the following parameters:

import gymnasium as gym
from miniworld_maze import ObservationLevel
import miniworld_maze  # noqa: F401

env = gym.make(
    "NineRooms-v0",                        # Environment variant
    obs_level=ObservationLevel.TOP_DOWN_PARTIAL,  # Observation type
    obs_width=64,                          # Observation image width
    obs_height=64,                         # Observation image height
    room_size=5,                           # Size of each room in environment units
    door_size=2,                           # Size of doors between rooms  
    agent_mode="empty",                    # Agent rendering: "empty", "circle", "triangle"
)

Observation Format

The environment returns observations in dictionary format:

obs = {
    'observation': np.ndarray,    # (64, 64, 3) RGB image of current view
    'desired_goal': np.ndarray,   # (64, 64, 3) RGB image of goal location
    'achieved_goal': np.ndarray,  # (64, 64, 3) RGB image of current state
}

Reward Structure

  • Goal reaching: Positive reward when agent reaches the goal location
  • Step penalty: Small negative reward per step to encourage efficiency
  • Episode termination: When goal is reached or maximum steps exceeded

License

MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

miniworld_maze-1.2.0.tar.gz (32.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

miniworld_maze-1.2.0-py3-none-any.whl (32.7 MB view details)

Uploaded Python 3

File details

Details for the file miniworld_maze-1.2.0.tar.gz.

File metadata

  • Download URL: miniworld_maze-1.2.0.tar.gz
  • Upload date:
  • Size: 32.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for miniworld_maze-1.2.0.tar.gz
Algorithm Hash digest
SHA256 696ba37db8bf48f4fddb49cd55563eb5f494925a5d25d2e5fba163e97ee1d8e7
MD5 c4658f6f5588c67ede2ee37dc4cc45aa
BLAKE2b-256 165a3cbc9d3bbb9459b1dae8f2bcff9be83c67d057895113e52b1d4e88ae9dd4

See more details on using hashes here.

Provenance

The following attestation bundles were made for miniworld_maze-1.2.0.tar.gz:

Publisher: publish.yml on mctigger/miniworld-maze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file miniworld_maze-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: miniworld_maze-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 32.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for miniworld_maze-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d6030c5125e0cab1a82616be6b3658f1971f9709ac03fc98420f4dc3add2861c
MD5 e5d7b70a3bb71906d8e59b42d5729349
BLAKE2b-256 cf47bc5410908974b3a1194687d3980b99170780d251afdcbed9547773997fd1

See more details on using hashes here.

Provenance

The following attestation bundles were made for miniworld_maze-1.2.0-py3-none-any.whl:

Publisher: publish.yml on mctigger/miniworld-maze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page