A gymnasium environment for SO-ARM100 single-arm manipulation based on gym-aloha

These details have not been verified by PyPI

Project description

Gym SO-ARM

A gymnasium environment for SO-ARM101 single-arm manipulation based on gym-aloha, featuring multi-camera support and advanced simulation capabilities.

demo

Features

SO-ARM101 6DOF Robotic Arm: Complete simulation of the SO-ARM101 robotic manipulator with white color scheme
Multi-Camera System: Three camera views with runtime switching:
- Overview camera: Top-down perspective
- Front camera: Side view of the workspace
- Wrist camera: First-person view from the robot's gripper
Interactive GUI Viewer: OpenCV-based viewer with keyboard controls
Grid-Based Object Placement: 3×3 grid system for randomized object positioning
MP4 Video Recording: Automatic recording of camera observations to timestamped MP4 files
6DOF Joint Control: Direct control of all 6 joints including gripper via action space
Gymnasium Compatible: Full OpenAI Gym/Gymnasium interface compliance
MuJoCo Physics: High-fidelity physics simulation using dm-control

Installation

gym-soarm works with Python 3.10

From Source

# Clone the repository
git clone https://github.com/your-org/gym-soarm.git
cd gym-soarm

# Install in development mode
pip install -e .

# Or install with development dependencies
pip install -e ".[dev,test]"

Using pip

pip install gym-soarm

Quick Start

Basic Usage

import gymnasium as gym
import gym_soarm

# Create environment with human rendering and camera configuration
env = gym.make('SoArm-v0', render_mode='human', obs_type='pixels_agent_pos', camera_config='front_wrist')

# Reset environment with specific cube position
obs, info = env.reset(options={'cube_grid_position': 4})

# The environment automatically records MP4 videos when using example.py
# Access joint positions and camera images
print(f"Joint positions: {obs['agent_pos']}")  # 6 joint values including gripper
print(f"Available cameras: {list(obs['pixels'].keys())}")  # front_camera, wrist_camera

# Run simulation with 6DOF joint control
for _ in range(200):
    action = env.action_space.sample()  # 6D action: [shoulder_pan, shoulder_lift, elbow_flex, wrist_flex, wrist_roll, gripper]
    obs, reward, terminated, truncated, info = env.step(action)
    
    if terminated or truncated:
        obs, info = env.reset()

env.close()

Interactive Joint Control

For real-time joint manipulation using sliders, use the interactive control sample:

# Run the slider control sample
python examples/slider_control_final.py

Features:

Real-time Control: Use trackbars to control each of the 6 joints (shoulder_pan, shoulder_lift, elbow_flex, wrist_flex, wrist_roll, gripper)
Visual Feedback: Live display of joint angles in radians
Reset Functionality: Reset button to return robot to initial position (all joints at 0.0 rad)
Keyboard Controls:
- SPACE: Step simulation forward
- ESC: Exit application
- R: Quick reset shortcut

Usage Instructions:

Adjust joint angles using the trackbars at the top of the control window
Press SPACE to step the simulation and see the robot move
Use the "Reset" trackbar (set to 1) to reset the environment
Press ESC to exit the application

This sample is perfect for:

Understanding joint limits and robot kinematics
Manual robot positioning and pose testing
Interactive exploration of the workspace
Educational demonstrations of robotic arm control

Block Position Control

You can control the initial position of the blue cube using three different methods:

1. Grid Position System (0-8)

Use predefined 3×3 grid positions for consistent object placement:

import gymnasium as gym
import gym_soarm

env = gym.make('SoArm-v0', render_mode='human')

# Place cube at specific grid position (0-8)
obs, info = env.reset(options={'cube_grid_position': 4})  # Center position

# Use random position (default behavior)
obs, info = env.reset(options={'cube_grid_position': None})

Grid Layout (positions 0-8):

0: (-10cm, -7.5cm)  1: (-10cm,  0cm)   2: (-10cm, +7.5cm)
3: ( 0cm,  -7.5cm)  4: ( 0cm,   0cm)   5: ( 0cm,  +7.5cm)  
6: (+10cm, -7.5cm)  7: (+10cm,  0cm)   8: (+10cm, +7.5cm)

2. Custom Coordinates

For precise control, specify exact X,Y coordinates:

import gymnasium as gym
import gym_soarm

env = gym.make('SoArm-v0', render_mode='human')

# Place cube at custom coordinates
options = {
    'cube_grid_position': -1,  # Use -1 to enable custom coordinates
    'cube_x': 0.15,           # X coordinate in meters
    'cube_y': 0.35            # Y coordinate in meters
}
obs, info = env.reset(options=options)

Custom Coordinate Examples:

# Near the front of the table
obs, info = env.reset(options={'cube_grid_position': -1, 'cube_x': 0.0, 'cube_y': 0.3})

# Left side of workspace
obs, info = env.reset(options={'cube_grid_position': -1, 'cube_x': -0.1, 'cube_y': 0.4})

# Right side with precise positioning
obs, info = env.reset(options={'cube_grid_position': -1, 'cube_x': 0.12, 'cube_y': 0.38})

3. Random Placement

Let the environment choose a random position:

# Completely random placement (default)
obs, info = env.reset()

# Explicitly request random placement
obs, info = env.reset(options={'cube_grid_position': None})

Important Notes:

All positioning methods place the cube at Z=0.05m (table surface)
The cube receives a random rotation (0°, 30°, 45°, or 60°) regardless of positioning method
Custom coordinates must be within the robot's workspace bounds
When using custom coordinates, both cube_x and cube_y parameters are required

Error Handling:

# This will raise ValueError - missing cube_y
try:
    obs, info = env.reset(options={'cube_grid_position': -1, 'cube_x': 0.1})
except ValueError as e:
    print(e)  # "cube_x and cube_y must be provided when cube_grid_position is -1"

# This will raise ValueError - invalid grid position
try:
    obs, info = env.reset(options={'cube_grid_position': 10})
except ValueError as e:
    print(e)  # "cube_grid_position must be between 0 and 8 (inclusive)..."

Camera Configuration

You can configure which cameras are included in observations to optimize performance and focus on relevant viewpoints:

import gymnasium as gym
import gym_soarm

# Front camera only (minimal, fastest)
env = gym.make('SoArm-v0', obs_type='pixels', camera_config='front_only')

# Front and wrist cameras (default, balanced)
env = gym.make('SoArm-v0', obs_type='pixels', camera_config='front_wrist')

# All cameras (comprehensive, slower)
env = gym.make('SoArm-v0', obs_type='pixels', camera_config='all')

obs, info = env.reset()
print(f"Available cameras: {list(obs.keys())}")

Camera Configuration Options:

front_only: Only front camera (side view) - fastest, minimal observations
front_wrist: Front camera + wrist camera (first-person view) - balanced performance
all: All three cameras (overview + front + wrist) - comprehensive but slower

Observation Structure by Configuration:

# front_only
obs = {
    'front_camera': np.ndarray(shape=(480, 640, 3))
}

# front_wrist  
obs = {
    'front_camera': np.ndarray(shape=(480, 640, 3)),
    'wrist_camera': np.ndarray(shape=(480, 640, 3))
}

# all
obs = {
    'overview_camera': np.ndarray(shape=(480, 640, 3)),
    'front_camera': np.ndarray(shape=(480, 640, 3)),
    'wrist_camera': np.ndarray(shape=(480, 640, 3))
}

MP4 Video Recording

The example.py script automatically records camera observations to MP4 videos:

import gymnasium as gym
import gym_soarm

# Run the example script with video recording
env = gym.make('SoArm-v0', render_mode='human', obs_type='pixels_agent_pos', camera_config='front_wrist')

# Videos are automatically saved to videos/ directory with timestamps
# - front_camera_20250729_143022.mp4
# - wrist_camera_20250729_143022.mp4

# Manual video recording can be implemented using:
frames_storage = {}
obs, info = env.reset()

# Store frames from each camera
if "pixels" in obs:
    for camera_name, frame in obs['pixels'].items():
        if camera_name not in frames_storage:
            frames_storage[camera_name] = []
        frames_storage[camera_name].append(frame.copy())

# Use save_frames_to_mp4() function from example.py to save videos

Camera Switching

During simulation with render_mode='human', use these keyboard controls:

'1': Switch to overview camera
'2': Switch to front camera
'3': Switch to wrist camera
'q': Quit simulation

Environment Details

Observation Space

The environment provides rich observations including:

Robot Joint Positions: 6DOF joint positions including gripper (6-dimensional)
Camera Images: RGB images from configured cameras (480×640×3 each)
Object Information: Positions and orientations of manipulated objects

# For obs_type='pixels_agent_pos'
obs_space = gym.spaces.Dict({
    'agent_pos': gym.spaces.Box(-np.inf, np.inf, shape=(6,), dtype=np.float64),  # Joint positions
    'pixels': gym.spaces.Dict({
        'front_camera': gym.spaces.Box(0, 255, shape=(480, 640, 3), dtype=np.uint8),
        'wrist_camera': gym.spaces.Box(0, 255, shape=(480, 640, 3), dtype=np.uint8)
    })
})

Action Space

6DOF joint position control for the SO-ARM101:

Dimensions: 6 (shoulder_pan, shoulder_lift, elbow_flex, wrist_flex, wrist_roll, gripper)
Range: Joint-specific limits based on hardware specifications
Control: Direct joint position targets

Workspace Configuration

Table Size: 64cm × 45cm
Object Grid: 3×3 positioning system with ±10cm(X), ±7.5cm(Y) spacing
Cube Size: 3cm × 3cm × 3cm blue cubes
Robot Base: Positioned at (0, 0.15, 0) with 90° rotation
Robot Color: White color scheme with black servo motors for visual clarity

Camera Specifications

Camera	Position	Orientation	FOV	Description
Overview	(0, 0.4, 0.8)	Top-down	90°	Bird's eye view
Front	(0, 0.7, 0.25)	Angled forward	120°	Side perspective
Wrist	(0, -0.04, 0)	30° X-rotation	110°	First-person view

Development

Project Structure

gym-soarm/
├── gym_soarm/              # Main package
│   ├── __init__.py        # Package initialization
│   ├── env.py            # Main environment class
│   ├── constants.py      # Environment constants
│   ├── assets/           # Robot models and scenes
│   │   ├── so101_new_calib.xml    # SO-ARM101 robot model (white color)
│   │   ├── so_arm_main_new.xml    # Scene with table and objects
│   │   └── assets/               # STL mesh files
│   └── tasks/            # Task implementations
│       ├── __init__.py
│       └── sim.py        # Manipulation tasks
├── examples/             # Example scripts and demonstrations
│   ├── example.py        # Basic usage with MP4 recording
│   └── slider_control_final.py  # Interactive joint control with sliders
├── videos/               # Auto-generated MP4 video outputs
├── setup.py             # Package setup
├── pyproject.toml       # Poetry configuration
└── README.md            # This file

Running Tests

# Install test dependencies
pip install -e ".[test]"

# Run comprehensive test suite
pytest tests/ -v

# Run specific test categories
pytest tests/test_e2e.py -v              # End-to-end tests
pytest tests/test_camera_config.py -v    # Camera configuration tests

# Run basic functionality test
python examples/example.py

# Test interactive joint control
python examples/slider_control_final.py

# Test camera configuration features
python test_camera_features.py

Code Style

The project uses Ruff for linting and formatting:

# Install development dependencies
pip install -e ".[dev]"

# Run linting
ruff check gym_soarm/

# Auto-format code
ruff format gym_soarm/

Hardware Requirements

Python: ≥3.10
OpenGL: Required for rendering
Memory: ≥4GB RAM recommended
Storage: ~500MB for assets and dependencies

Troubleshooting

Common Issues

MuJoCo Installation: Ensure MuJoCo ≥2.3.7 is properly installed
OpenGL Context: On headless systems, use xvfb-run for rendering
Asset Loading: Verify all .stl files are present in assets/assets/

Platform-Specific Notes

macOS: May require XQuartz for OpenGL support
Linux: Ensure proper GPU drivers for hardware acceleration
Windows: Use WSL2 for best compatibility

Citation

If you use this environment in your research, please cite:

@software{gym_soarm,
  title={Gym SO-ARM: A Gymnasium Environment for SO-ARM101 Manipulation},
  author={SO-ARM Development Team},
  version={0.1.0},
  year={2024},
  url={https://github.com/your-org/gym-soarm}
}

License

Apache 2.0 License - see LICENSE file for details.

Contributing

Contributions are welcome! Please read our contributing guidelines and submit pull requests to our GitHub repository.

Support

For questions and support:

GitHub Issues: Report bugs or request features
Discussions: Community discussions

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.4.0

Nov 16, 2025

0.3.1

Aug 18, 2025

0.3.0

Aug 15, 2025

0.1.0

Aug 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gym_soarm-0.4.0-py3-none-any.whl (6.0 MB view details)

Uploaded Nov 16, 2025 Python 3

File details

Details for the file gym_soarm-0.4.0-py3-none-any.whl.

File metadata

Download URL: gym_soarm-0.4.0-py3-none-any.whl
Upload date: Nov 16, 2025
Size: 6.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.11

File hashes

Hashes for gym_soarm-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`de1c5d63c8268879fa4d3344ba2876996ed5e17cbcdb4a35907fab55ec2c8bf1`
MD5	`fa2f245451a639ec8283eb469c3ff5e1`
BLAKE2b-256	`ac300c6042fd8493501dc4481ee49d88582265217e6f077f949b3833c0fb042b`

See more details on using hashes here.

gym-soarm 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers