Skip to main content

Multi-room maze environments from the DrStrategy paper. Provides NineRooms-v0, SpiralNineRooms-v0, and TwentyFiveRooms-v0 gymnasium environments.

Project description

MiniWorld DrStrategy - Multi-Room Maze Environment

A refactored implementation of Dr. Strategy's MiniWorld-based maze environments with updated dependencies and modern Python packaging. Based on the now-deprecated MiniWorld project and the original DrStrategy implementation.

Environment Observations

Environment Views

Full environment layout and render-on-position views:

Full Environment Partial Top-Down Observations Partial First-Person Observations
Full View Clean Top Middle TD Center TD Top Middle FP Center FP

Installation

pip install miniworld-maze

Usage

Registered Environments

This package registers the following gymnasium environments:

Environment ID Description Rooms Max Steps
NineRooms-v0 Standard 3×3 grid with adjacent room connections 9 1000
SpiralNineRooms-v0 3×3 grid with spiral connection pattern 9 1000
TwentyFiveRooms-v0 Large 5×5 grid with complex navigation 25 1000

All environments use TOP_DOWN_PARTIAL observation level and factory default room/door sizes by default.

Basic Usage

See examples/basic_usage.py for a complete working example:

#!/usr/bin/env python3
"""
Basic usage example for miniworld-maze environments.

This is a minimal example showing how to create and interact with the environment.
"""

import gymnasium as gym
import miniworld_maze  # noqa: F401


def main():
    # Create environment using gymnasium registry
    env = gym.make("NineRooms-v0", obs_width=64, obs_height=64)
    obs, info = env.reset()

    # obs is a dictionary containing:
    # - 'observation': (64, 64, 3) RGB image array
    # - 'desired_goal': (64, 64, 3) RGB image of the goal state
    # - 'achieved_goal': (64, 64, 3) RGB image of the current state

    # Take a few random actions
    for step in range(10):
        action = env.action_space.sample()
        obs, reward, terminated, truncated, info = env.step(action)

        print(f"Step {step + 1}: reward={reward:.3f}, terminated={terminated}")

        if terminated or truncated:
            obs, info = env.reset()

    env.close()
    print("Environment closed successfully!")


if __name__ == "__main__":
    main()

Headless Environments

When running in headless environments (servers, CI/CD, Docker containers) or when encountering X11/OpenGL context issues, you need to enable headless rendering:

# Set environment variable before running Python
export PYGLET_HEADLESS=1
python your_script.py

Or in your Python code (must be set before importing the library):

import os
os.environ['PYGLET_HEADLESS'] = '1'

import miniworld_maze
# ... rest of your code

This configures the underlying pyglet library to use EGL rendering instead of X11, allowing the environments to run without a display server.

Environment Variants

Available Environments

The package provides three main environment variants, each with different room layouts and connection patterns:

1. NineRooms (3×3 Grid)

-------------
| 0 | 1 | 2 |
-------------
| 3 | 4 | 5 |
-------------
| 6 | 7 | 8 |
-------------

A standard 3×3 grid where adjacent rooms are connected. The agent can navigate between rooms through doorways, with connections forming a fully connected grid pattern.

2. SpiralNineRooms (3×3 Spiral Pattern)

-------------
| 0 | 1 | 2 |
-------------
| 3 | 4 | 5 |
-------------
| 6 | 7 | 8 |
-------------

Same room layout as NineRooms but with a spiral connection pattern. Only specific room pairs are connected, creating a more challenging navigation task with fewer available paths.

3. TwentyFiveRooms (5×5 Grid)

---------------------
| 0 | 1 | 2 | 3 | 4 |
---------------------
| 5 | 6 | 7 | 8 | 9 |
---------------------
|10 |11 |12 |13 |14 |
---------------------
|15 |16 |17 |18 |19 |
---------------------
|20 |21 |22 |23 |24 |
---------------------

A larger 5×5 grid environment with 25 rooms, providing more complex navigation challenges and longer episode lengths.

Observation Types

Each environment supports three different observation modes:

  • TOP_DOWN_PARTIAL (default): Agent-centered partial top-down view with limited visibility range (POMDP)
  • TOP_DOWN_FULL: Complete top-down view showing the entire environment
  • FIRST_PERSON: 3D first-person perspective view from the agent's current position

Action Space

  • Discrete Actions (default): 7 discrete actions (turn left/right, move forward/backward, strafe left/right, no-op)
  • Continuous Actions: Continuous control with continuous=True parameter

Environment Configuration

All environments can be customized with the following parameters:

import gymnasium as gym
from miniworld_maze import ObservationLevel
import miniworld_maze  # noqa: F401

env = gym.make(
    "NineRooms-v0",                        # Environment variant
    obs_level=ObservationLevel.TOP_DOWN_PARTIAL,  # Observation type
    obs_width=64,                          # Observation image width
    obs_height=64,                         # Observation image height
    room_size=5,                           # Size of each room in environment units
    door_size=2,                           # Size of doors between rooms  
    agent_mode="empty",                    # Agent rendering: "empty", "circle", "triangle"
)

Observation Format

The environment returns observations in dictionary format:

obs = {
    'observation': np.ndarray,    # (64, 64, 3) RGB image of current view
    'desired_goal': np.ndarray,   # (64, 64, 3) RGB image of goal location
    'achieved_goal': np.ndarray,  # (64, 64, 3) RGB image of current state
}

Reward Structure

  • Goal reaching: Positive reward when agent reaches the goal location
  • Step penalty: Small negative reward per step to encourage efficiency
  • Episode termination: When goal is reached or maximum steps exceeded

License

MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

miniworld_maze-1.5.0.tar.gz (32.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

miniworld_maze-1.5.0-py3-none-any.whl (32.7 MB view details)

Uploaded Python 3

File details

Details for the file miniworld_maze-1.5.0.tar.gz.

File metadata

  • Download URL: miniworld_maze-1.5.0.tar.gz
  • Upload date:
  • Size: 32.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for miniworld_maze-1.5.0.tar.gz
Algorithm Hash digest
SHA256 0a9e07aec1f59f1b780f8145c614a7bc315e8c0c0adda70886789414e30c2699
MD5 4a41d0d090a6e1fbf7375d35151766ba
BLAKE2b-256 549c8d52487075c82d7bd82ca0b632a45bfacce4c8cb687adf02bfd335b52604

See more details on using hashes here.

Provenance

The following attestation bundles were made for miniworld_maze-1.5.0.tar.gz:

Publisher: publish.yml on mctigger/miniworld-maze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file miniworld_maze-1.5.0-py3-none-any.whl.

File metadata

  • Download URL: miniworld_maze-1.5.0-py3-none-any.whl
  • Upload date:
  • Size: 32.7 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for miniworld_maze-1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d20f96e206e793d19b74ed6e6c7b1613c4fb66b39d0596baa599d1d84b9256c7
MD5 30d2a24d65234ddb98baceab4bd12383
BLAKE2b-256 6d656d6934b4e4d49afb67a1345ec3920eccd428aaf93903b5ec4e2c3e0ebdbe

See more details on using hashes here.

Provenance

The following attestation bundles were made for miniworld_maze-1.5.0-py3-none-any.whl:

Publisher: publish.yml on mctigger/miniworld-maze

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page