Skip to main content

Multi-room maze environments from the DrStrategy paper. Provides NineRooms-v0, SpiralNineRooms-v0, and TwentyFiveRooms-v0 gymnasium environments.

Project description

MiniWorld DrStrategy - Multi-Room Maze Environment

A refactored implementation of Dr. Strategy's MiniWorld-based maze environments with updated dependencies and modern Python packaging. Based on the now-deprecated MiniWorld project and the original DrStrategy implementation.

Environment Observations

Environment Views

Full environment layout and render-on-position views:

Full Environment Partial Top-Down Observations Partial First-Person Observations
Full View Clean Top Middle TD Center TD Top Middle FP Center FP

Installation

pip install miniworld-maze

Usage

Registered Environments

This package registers the following gymnasium environments:

Environment ID Description Rooms Max Steps
NineRooms-v0 Standard 3×3 grid with adjacent room connections 9 1000
SpiralNineRooms-v0 3×3 grid with spiral connection pattern 9 1000
TwentyFiveRooms-v0 Large 5×5 grid with complex navigation 25 1000

All environments use TOP_DOWN_PARTIAL observation level and factory default room/door sizes by default.

Basic Usage

See examples/basic_usage.py for a complete working example:

#!/usr/bin/env python3
"""
Basic usage example for miniworld-maze environments.

This is a minimal example showing how to create and interact with the environment.
"""

import gymnasium as gym
import miniworld_maze  # noqa: F401


def main():
    # Create environment using gymnasium registry
    env = gym.make("NineRooms-v0", obs_width=64, obs_height=64)
    obs, info = env.reset()

    # obs is a dictionary containing:
    # - 'observation': (64, 64, 3) RGB image array
    # - 'desired_goal': (64, 64, 3) RGB image of the goal state
    # - 'achieved_goal': (64, 64, 3) RGB image of the current state

    # Take a few random actions
    for step in range(10):
        action = env.action_space.sample()
        obs, reward, terminated, truncated, info = env.step(action)

        print(f"Step {step + 1}: reward={reward:.3f}, terminated={terminated}")

        if terminated or truncated:
            obs, info = env.reset()

    env.close()
    print("Environment closed successfully!")


if __name__ == "__main__":
    main()

Headless Environments

When running in headless environments (servers, CI/CD, Docker containers) or when encountering X11/OpenGL context issues, you need to enable headless rendering:

# Set environment variable before running Python
export PYGLET_HEADLESS=1
python your_script.py

Or in your Python code (must be set before importing the library):

import os
os.environ['PYGLET_HEADLESS'] = '1'

import miniworld_maze
# ... rest of your code

This configures the underlying pyglet library to use EGL rendering instead of X11, allowing the environments to run without a display server.

Environment Variants

Available Environments

The package provides three main environment variants, each with different room layouts and connection patterns:

1. NineRooms (3×3 Grid)

-------------
| 0 | 1 | 2 |
-------------
| 3 | 4 | 5 |
-------------
| 6 | 7 | 8 |
-------------

A standard 3×3 grid where adjacent rooms are connected. The agent can navigate between rooms through doorways, with connections forming a fully connected grid pattern.

2. SpiralNineRooms (3×3 Spiral Pattern)

-------------
| 0 | 1 | 2 |
-------------
| 3 | 4 | 5 |
-------------
| 6 | 7 | 8 |
-------------

Same room layout as NineRooms but with a spiral connection pattern. Only specific room pairs are connected, creating a more challenging navigation task with fewer available paths.

3. TwentyFiveRooms (5×5 Grid)

---------------------
| 0 | 1 | 2 | 3 | 4 |
---------------------
| 5 | 6 | 7 | 8 | 9 |
---------------------
|10 |11 |12 |13 |14 |
---------------------
|15 |16 |17 |18 |19 |
---------------------
|20 |21 |22 |23 |24 |
---------------------

A larger 5×5 grid environment with 25 rooms, providing more complex navigation challenges and longer episode lengths.

Observation Types

Each environment supports three different observation modes:

  • TOP_DOWN_PARTIAL (default): Agent-centered partial top-down view with limited visibility range (POMDP)
  • TOP_DOWN_FULL: Complete top-down view showing the entire environment
  • FIRST_PERSON: 3D first-person perspective view from the agent's current position

Action Space

  • Discrete Actions (default): 7 discrete actions (turn left/right, move forward/backward, strafe left/right, no-op)
  • Continuous Actions: Continuous control with continuous=True parameter

Environment Configuration

All environments can be customized with the following parameters:

import gymnasium as gym
from miniworld_maze import ObservationLevel
import miniworld_maze  # noqa: F401

env = gym.make(
    "NineRooms-v0",                        # Environment variant
    obs_level=ObservationLevel.TOP_DOWN_PARTIAL,  # Observation type
    obs_width=64,                          # Observation image width
    obs_height=64,                         # Observation image height
    room_size=5,                           # Size of each room in environment units
    door_size=2,                           # Size of doors between rooms  
    agent_mode="empty",                    # Agent rendering: "empty", "circle", "triangle"
)

Observation Format

The environment returns observations in dictionary format:

obs = {
    'observation': np.ndarray,    # (64, 64, 3) RGB image of current view
    'desired_goal': np.ndarray,   # (64, 64, 3) RGB image of goal location
    'achieved_goal': np.ndarray,  # (64, 64, 3) RGB image of current state
}

Reward Structure

  • Goal reaching: Positive reward when agent reaches the goal location
  • Step penalty: Small negative reward per step to encourage efficiency
  • Episode termination: When goal is reached or maximum steps exceeded

License

MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

miniworld_maze-1.5.2.tar.gz (32.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

miniworld_maze-1.5.2-py3-none-any.whl (32.7 MB view details)

Uploaded Python 3

File details

Details for the file miniworld_maze-1.5.2.tar.gz.

File metadata

  • Download URL: miniworld_maze-1.5.2.tar.gz
  • Upload date:
  • Size: 32.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for miniworld_maze-1.5.2.tar.gz
Algorithm Hash digest
SHA256 7f6aa383a4cba7ff896da5aadc84c821d3fe464f7d0d0892ae89768e8a92ae34
MD5 ee835fdc562c9ac1417624f30c3b0924
BLAKE2b-256 e9048017e23a9f8853b7df21e4963ed2b1c2025e7057edd7fa74eff4e368fefd

See more details on using hashes here.

Provenance

The following attestation bundles were made for miniworld_maze-1.5.2.tar.gz:

Publisher: publish.yml on mctigger/miniworld-maze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file miniworld_maze-1.5.2-py3-none-any.whl.

File metadata

  • Download URL: miniworld_maze-1.5.2-py3-none-any.whl
  • Upload date:
  • Size: 32.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for miniworld_maze-1.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 97aae152902e1826f03e292d9b4bfc2d691d74dfd883a17cf3f8ac5890d54fe8
MD5 9d07dce5d824b66c8a94baba78e51197
BLAKE2b-256 482a034716f2f895f4e81fd00b71642bebd18b148e8514d3238a4c7292f9961e

See more details on using hashes here.

Provenance

The following attestation bundles were made for miniworld_maze-1.5.2-py3-none-any.whl:

Publisher: publish.yml on mctigger/miniworld-maze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page