A Gymnasium-compatible environment for space mining reinforcement learning tasks

These details have not been verified by PyPI

Project description

SpaceMining — a novel RL environment beyond LLM priors

SpaceMining is a Gymnasium-compatible reinforcement learning (RL) environment designed to simulate asteroid mining in a 2D space. The agent, a mining robot, must collect resources from asteroids and deliver them to a central mothership while managing energy levels and avoiding moving obstacles. Featuring realistic physics, partial observability, and a comprehensive reward system, SpaceMining offers a challenging testbed for RL algorithms.

What is SpaceMining?

SpaceMining is a single-agent reinforcement learning environment simulating asteroid mining in a 2D space environment. The agent (mining robot) must collect resources from asteroids, deliver them to the mothership, manage energy consumption, and avoid moving obstacles while maximizing efficiency.

Research Purpose

This environment was specifically designed to evaluate large language models' ability to design reward functions for unfamiliar environments without prior knowledge. Recent studies have raised concerns that large language models may carry prior knowledge from pretraining data about standard RL environments (like CartPole, BipedalWalker, Ant), leading to potential prompt leakage and evaluation biases.

To address this issue, SpaceMining serves as a custom environment to assess true generalization capabilities on tasks free from such pretrained knowledge. This allows researchers to evaluate whether LLMs can effectively design reward functions for completely novel environments.

Features

2D physics-based space mining simulation with realistic movement
Dynamic resource management and energy consumption
Partial observability with limited observation radius
Moving obstacles and collision detection
Customizable reward structure and environment parameters
Real-time visualization with health bars and status indicators

Task Description

The agent is deployed in a 2D space environment (80x80 grid) with randomly distributed asteroids and a central mothership. The agent must:

Navigate efficiently to discover and mine resource-rich asteroids
Manage energy consumption and return to the mothership for recharging
Avoid collisions with moving obstacles
Deliver resources to maximize collection efficiency
Balance exploration and exploitation for optimal performance

The environment uses a comprehensive fitness scoring system with a target range of approximately 3000 points, evaluating resource collection, energy management, efficiency, and survival time.

Environment Specifications

State Space

The observation space includes:

Agent State (6 dimensions): Position (x, y), velocity (vx, vy), energy level, inventory
Asteroid Information (up to 45 dimensions): Relative positions and resource amounts for visible asteroids
Mothership Position (2 dimensions): Relative position to mothership

Action Space

The action space is continuous with 3 dimensions:

Thrust Control (2 dimensions): Force applied in x and y directions [-1.0, 1.0]
Mining Action (1 dimension): Binary mining activation [0.0, 1.0]

Difficulty Level

Medium Difficulty: The environment presents a balanced challenge with:

Limited observation radius (15 units) requiring strategic exploration
Energy management constraints requiring periodic returns to mothership
Moving obstacles requiring collision avoidance
Resource depletion mechanics requiring efficient mining strategies

Key Features

Gymnasium Compatibility: Adheres to the latest Gymnasium API for seamless integration with standard RL workflows.
Stable-Baselines3 Support: Optimized for training with PPO and other algorithms from Stable-Baselines3.
Complex Environment: Includes energy management, partial observability (limited observation radius), continuous action spaces, and dynamic obstacles.
Visualization Tools: Scripts to render episodes and generate GIFs for performance analysis.

Installation

From Source (Recommended for Development)

Clone the repository:

git clone https://github.com/reveurmichael/space_mining.git
cd space_mining

Install in a virtual environment:

python -m venv venv
# On Windows
venv\Scripts\activate
# On macOS/Linux
source venv/bin/activate
pip install .

For development mode:

pip install -e '.[dev]'

From PyPI (For Users)

Install from PyPI:

pip install space-mining

For detailed installation instructions and troubleshooting, see Installation Guide.

Getting Started

Run the quickstart example:

python examples/01_quickstart_random_agent.py

Train a PPO agent:

python -m space_mining.agents.train_ppo --total-timesteps 5000000 --output-dir runs/ppo

Or, if you are in the root directory of the repository, you can use the following command:

python space_mining/agents/train_ppo.py --total-timesteps 5000000 --output-dir runs/ppo

Or load a pre-trained model from Hugging Face and render a GIF (no training):

from space_mining import make_env
from space_mining.agents.ppo_agent import PPOAgent

env = make_env(render_mode='rgb_array', max_episode_steps=1200)
agent = PPOAgent.load_from_hf('LUNDECHEN/space-mining-ppo', filename='final_model.zip', env=env, device='cpu')
obs, _ = env.reset()
for _ in range(1200):
    action = agent.predict(obs, deterministic=True)
    obs, _, terminated, truncated, _ = env.step(action)
    if terminated or truncated:
        break
env.close()

Generate a GIF from a trained or HF checkpoint:

python -m space_mining.scripts.make_gif --checkpoint runs/ppo/final_model.zip --output output_gif/agent.gif
# or, if you are in the root directory of the repository, you can use the following command:
python space_mining/scripts/make_gif.py --checkpoint runs/ppo/final_model.zip --output output_gif/agent.gif
# or, after downloading from HF to a local file path
python -m space_mining.scripts.make_gif --checkpoint path/to/final_model.zip --output output_gif/agent.gif

Authors

Xinning Zhu (zhuxinning@shu.edu.cn)
Lunde Chen (lundechen@shu.edu.cn)

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.2

Sep 2, 2025

0.1.1

Aug 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

space_mining-0.1.2.tar.gz (34.0 kB view details)

Uploaded Sep 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

space_mining-0.1.2-py3-none-any.whl (36.7 kB view details)

Uploaded Sep 2, 2025 Python 3

File details

Details for the file space_mining-0.1.2.tar.gz.

File metadata

Download URL: space_mining-0.1.2.tar.gz
Upload date: Sep 2, 2025
Size: 34.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for space_mining-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`38f86373aa0da8c2560deb0d37ad74a620868426d5488b244b977a98d6133fc9`
MD5	`bf94004e6e5cd402fc1cfe32c6e7bbb5`
BLAKE2b-256	`00cb395c3d76e8c5665100948e5e8d96753256d8ca4e2ba8927d963cd07845b7`

See more details on using hashes here.

File details

Details for the file space_mining-0.1.2-py3-none-any.whl.

File metadata

Download URL: space_mining-0.1.2-py3-none-any.whl
Upload date: Sep 2, 2025
Size: 36.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for space_mining-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5a896dc8ad72902411784566162a4ec3c845d157dcfd2412b641c617751c7801`
MD5	`587a87aea7402d78509c3acaab2c678f`
BLAKE2b-256	`df09131a0cd00bc499dcc3484d490049931a64dfdbaaaa2fb518e32bbdbd25c9`

See more details on using hashes here.

space-mining 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

SpaceMining — a novel RL environment beyond LLM priors

What is SpaceMining?

Research Purpose

Features

Task Description

Environment Specifications

State Space

Action Space

Difficulty Level

Key Features

Installation

From Source (Recommended for Development)

From PyPI (For Users)

Getting Started

Authors

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes