A gym environment for LuckyWorld

These details have not been verified by PyPI

Project description

gym-luckyworld

A gym environment for the Lucky World simulator

Installation

Create a virtual environment with Python 3.13 and activate it, e.g. with miniconda:

conda create -y -n luckyworld python=3.13 && conda activate luckyworld

Install gym-luckyworld:

pip install gym-luckyworld

Quickstart

import imageio
import numpy as np
import gymnasium as gym
import gym_luckyworld # noqa: F401

env = gym.make("gym_luckyworld/LuckyWorld-PickandPlace-v0")

observation, info = env.reset()
frames = []

for _ in range(1000):
    action = env.action_space.sample()
    observation, reward, terminated, truncated, info = env.step(action)
    image = env.render()
    if env.render_mode == "rgb_array":
        frames.append(image)

    if terminated or truncated:
        observation, info = env.reset()

env.close()

if env.render_mode == "rgb_array":  
    imageio.mimsave("example.mp4", np.stack(frames), fps=10)

Description

LuckyWorld environment for imitation learning with robotic manipulation tasks.

Two tasks are available:

PickandPlace: The SO100 robot arm needs to pick up objects and place them at target locations.
Navigation: Robot navigation tasks for mobile platforms.

Robots

SO100: 6-DOF robotic arm with the following actuators:
- shoulder_pan: Shoulder rotation (-2.2 to 2.2 rad)
- shoulder_lift: Shoulder elevation (-3.14 to 0.2 rad)
- elbow_flex: Elbow joint (0.0 to 3.14 rad)
- wrist_flex: Wrist flexion (-2.0 to 1.8 rad)
- wrist_roll: Wrist rotation (-3.14 to 3.14 rad)
- gripper: Gripper position (-0.2 to 2.0)

Action Space

The action space consists of continuous values for the robot's joint positions, resulting in a 6-dimensional vector for SO100:

Six values for the arm's joint positions (absolute values within joint limits).

Observation Space

Observations are provided as a dictionary with the following keys:

agent_pos: Current joint positions of the robot arm (6D vector for SO100).
pixels: RGB camera feed (480x640x3) from the robot's perspective.

Rewards

Important: This environment is designed for imitation learning. All rewards are set to 0.0 by default.

For reinforcement learning applications, you must implement custom reward functions by:

Subclassing the task classes (PickandPlace, Navigation)
Overriding the get_reward() method with your domain-specific reward logic

Termination Criteria

Episodes terminate based on task-specific conditions:

PickandPlace:
- Success: Object is placed at target location
- Failure: Object is dropped away from target
Navigation:
- Success: Robot reaches target location
- Failure: Robot collides with obstacles

Starting State

The robot and environment objects start at randomized positions within predefined bounds.

Arguments

>>> import gymnasium as gym
>>> import gym_luckyworld
>>> env = gym.make("gym_luckyworld/LuckyWorld-PickandPlace-v0", obs_type="pixels_agent_pos", render_mode="rgb_array")
>>> env
<TimeLimit<OrderEnforcing<PassiveEnvChecker<LuckyWorldEnv<gym_luckyworld/LuckyWorld-PickandPlace-v0>>>>>

obs_type: (str) The observation type. Currently supports pixels_agent_pos (camera + joint positions). Default is pixels_agent_pos.
render_mode: (str) The rendering mode. Can be human (OpenCV windows) or rgb_array (numpy arrays). Default is human.
scene: (str) The scene to load. Default is kitchen.
robot: (str) The robot type. Currently supports so100. Default is so100.
timeout: (float) Maximum episode duration in seconds. Default is 30.0.

Example Usage for Imitation Learning

import gymnasium as gym
import gym_luckyworld

# Create environment
env = gym.make(
    "gym_luckyworld/LuckyWorld-PickandPlace-v0",
    obs_type="pixels_agent_pos",
    render_mode="rgb_array"
)

# Collect demonstration data
observations = []
actions = []

obs, info = env.reset()
for step in range(100):
    # Your demonstration policy here
    action = your_policy(obs)
    
    observations.append(obs)
    actions.append(action)
    
    obs, reward, terminated, truncated, info = env.step(action)
    
    if terminated or truncated:
        obs, info = env.reset()

env.close()

# Use observations and actions for imitation learning
# (e.g., with ACT, BC, IQL, etc.)

Custom Rewards for RL

If you want to use this environment for reinforcement learning, implement custom rewards:

from gym_luckyworld.task import PickandPlace

class RewardedPickandPlace(PickandPlace):
    def get_reward(self, observation, info):
        # Implement your reward logic here
        object_distance = info.get("object_distance_from_target", float('inf'))
        reward = -object_distance  # Negative distance as reward
        
        # Add success bonus
        if object_distance < self.distance_threshold:
            reward += 100.0
            
        return reward

# Use your custom task
env = gym.make("gym_luckyworld/LuckyWorld-PickandPlace-v0")
env.task = RewardedPickandPlace("kitchen", "pickandplace", "so100", "human")

Contribute

Instead of using pip directly, we use poetry for development purposes to easily track our dependencies. If you don't have it already, follow the instructions to install it.

Install the project with dev dependencies:

poetry install --all-extras

Follow our style

# install pre-commit hooks
pre-commit install

# apply style and linter checks on staged files
pre-commit

Acknowledgment

gym-luckyworld is adapted from gym-aloha and built on top of the LuckyRobots simulation platform.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.0.20

Jul 2, 2025

0.0.19

Jul 2, 2025

0.0.18

Jul 2, 2025

0.0.17

Jul 1, 2025

0.0.16

Jun 26, 2025

0.0.15

Jun 25, 2025

0.0.14

Jun 24, 2025

0.0.13

Jun 23, 2025

0.0.12

Jun 22, 2025

0.0.11

Jun 20, 2025

0.0.10

Jun 19, 2025

This version

0.0.9

Jun 18, 2025

0.0.8

Jun 17, 2025

0.0.7

Jun 15, 2025

0.0.6

Jun 14, 2025

0.0.5

Jun 14, 2025

0.0.4

Jun 13, 2025

0.0.3

Jun 13, 2025

0.0.1

Apr 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gym_luckyworld-0.0.9.tar.gz (7.6 kB view details)

Uploaded Jun 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gym_luckyworld-0.0.9-py3-none-any.whl (8.7 kB view details)

Uploaded Jun 18, 2025 Python 3

File details

Details for the file gym_luckyworld-0.0.9.tar.gz.

File metadata

Download URL: gym_luckyworld-0.0.9.tar.gz
Upload date: Jun 18, 2025
Size: 7.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for gym_luckyworld-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`60df8514928f971337a34cdb5071526c4ca07adbbd6db2a9126cbcdb26e13db0`
MD5	`1ed762401d001873f07782bfa6079321`
BLAKE2b-256	`0ec3b8130c72d4113f064754bc496dbbd44690522ee2389a4d410b45e87b0fd7`

See more details on using hashes here.

File details

Details for the file gym_luckyworld-0.0.9-py3-none-any.whl.

File metadata

Download URL: gym_luckyworld-0.0.9-py3-none-any.whl
Upload date: Jun 18, 2025
Size: 8.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for gym_luckyworld-0.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bbe1ebf20b7c88b1c5933f6ae56c3b5ce07b7feaeb486bc071f2ff0facad2001`
MD5	`961d1d1e05953d41badcef6a30cf259f`
BLAKE2b-256	`aeab626798a76113253606621c6e5cd9ad8a0facde52ad59fa963110753c02c9`

See more details on using hashes here.

gym-luckyworld 0.0.9

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

gym-luckyworld

Installation

Quickstart

Description

Robots

Action Space

Observation Space

Rewards

Termination Criteria

Starting State

Arguments

Example Usage for Imitation Learning

Custom Rewards for RL

Contribute

Follow our style

Acknowledgment

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes