Memory-intensive robotic manipulation benchmark for Vision-Language-Action research

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

avanturist322

These details have not been verified by PyPI

Project links

Documentation

Project description

MIKASA-Robo-VLA

A memory-intensive robotic manipulation benchmark for Vision-Language-Action research.

_{Shell Game Touch}	_{Shell Game Shuffle Color Lamp Touch}	_{Remember Color}	_{Remember Shape & Color}
_{Find Imposter Color}	_Intercept	_{Rotate Strict}	_{Take It Back}

What is MIKASA-Robo-VLA?

MIKASA-Robo-VLA extends the MIKASA-Robo memory benchmark to language-conditioned Vision-Language-Action research. It provides tabletop robotic manipulation environments that require an agent to retain and use information across delayed, occluded, temporal, or multi-stage interactions.

The canonical VLA benchmark contains 90 tasks with natural-language instructions, ManiSkill/Gymnasium environments, and released trajectory datasets for training and evaluation. The benchmark task manifest is mikasa_robo_vla_envs.csv.

What changed from MIKASA-Robo (RL release)

Task set grows from 32 → 90 registered environments covering 10 memory types (vs 4 in the RL release).
Every task ships a natural-language LANGUAGE_INSTRUCTION for VLA conditioning.
Episodes are grouped into three horizon splits (Short / Medium / Long) so multi-task training and evaluation are tractable.
22,500 PPO / motion-planning oracle trajectories are released on Hugging Face in RLDS and LeRobotDataset v3 formats — no further conversion needed (6+ million transitions).
Dense and normalised-dense rewards are calibrated for every task, enabling both offline imitation learning and online RL.
The original 32-task RL implementation is available from the mikasa-robo-rl branch and remains under mikasa_robo_suite/rl/ for backwards compatibility.

[!IMPORTANT] For the complete benchmark reference, go to the documentation website:

📚 mikasarobo.github.io

It covers installation, all 90 tasks with descriptions, dataset format, API reference, training recipes, and usage examples. This README contains only a minimal setup summary.

[!NOTE] Looking for the original RL-oriented MIKASA-Robo?

Git: mikasa-robo-rl branch

PyPI: pip install mikasa-robo-suite==0.0.5

Installation

Install from the repository with the locked uv environment:

git clone https://github.com/CognitiveAISystems/MIKASA-Robo.git
cd MIKASA-Robo
uv sync --frozen

[!TIP] The submodule (utils/convert_npz_to_rlds/) is only needed if you plan to collect your own trajectory datasets (.npz) and then convert them to RLDS format. For benchmarking, evaluation, or training on the released datasets, you can skip it. To initialize it when needed:
git submodule update --init --recursive

See the installation guide for system requirements, package-install alternatives, and setup troubleshooting.

Quick Start

Every benchmark environment should be wrapped with apply_mikasa_vla_wrappers immediately after gym.make so its observations and task logic match the released VLA data pipeline.

import gymnasium as gym
import torch

import mikasa_robo_suite.vla.memory_envs  # registers VLA env IDs
from mikasa_robo_suite.vla.utils.apply_wrappers import apply_mikasa_vla_wrappers

env = gym.make(
    "RememberColor3-VLA-v0",
    num_envs=1,
    obs_mode="rgb",
    control_mode="pd_ee_delta_pose",
    reward_mode="normalized_dense",
    render_mode="all",
    sim_backend="gpu",
)
env = apply_mikasa_vla_wrappers(env, include_overlays=False)

obs, info = env.reset(seed=42)
for _ in range(env.max_episode_steps):
    action = torch.as_tensor(env.action_space.sample(), device=env.unwrapped.device)
    obs, reward, terminated, truncated, info = env.step(action)
    if torch.as_tensor(terminated | truncated).any():
        break

env.close()

For task browsing, wrapper behavior, language instructions, and the observation/action contract, use the quick start, environment catalogue, and observation/action reference.

Benchmarking

Run the reference checkpoint-free dummy policy first to smoke-test the evaluation pipeline:

uv run python examples/eval_demo.py \
  --num-episodes 1 --sim-backend gpu \
  --output-dir eval_results/dummy

Canonical evaluation is organized by horizon split and uses the benchmark protocol for task selection, seeds, metrics, and result files. See Benchmarking and the Evaluation Protocol before reporting results.

Datasets

MIKASA-Robo-VLA provides the full 90-task trajectory release on Hugging Face. The data pipeline supports:

NPZ source episodes for local collection and custom preprocessing.
RLDS / TFDS for episodic dataset pipelines.
LeRobotDataset v3 for modern PyTorch and VLA fine-tuning workflows.

Download one LeRobotDataset task with huggingface_hub:

from huggingface_hub import snapshot_download

snapshot_download(
    repo_id="mikasa-robo/mikasa-robo-vla-lerobot",
    repo_type="dataset",
    allow_patterns="remember_color_3_vla_v0/**",
    local_dir="data_mikasa_robo/data_lerobot",
)

allow_patterns matches paths inside the Hugging Face dataset repository. LeRobot task directories use normalized lowercase dataset names, for example RememberColor3-VLA-v0 is stored as remember_color_3_vla_v0/. The downloaded files are placed under data_mikasa_robo/data_lerobot/remember_color_3_vla_v0/.

The dataset guide covers the public RLDS and LeRobot releases, local collection, dataset fields, and export workflows.

Useful Links

Citation

If you use MIKASA-Robo-VLA in your research, please cite:

@inproceedings{cherepanov2026memory,
  title     = {Memory, Benchmark \& Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning},
  author    = {Egor Cherepanov and Nikita Kachaev and Alexey Kovalev and Aleksandr I. Panov},
  booktitle = {The Fourteenth International Conference on Learning Representations},
  year      = {2026},
  url       = {https://openreview.net/forum?id=9cLPurIZMj}
}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

avanturist322

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

This version

1.0.0

May 22, 2026

0.0.5

Apr 10, 2025

0.0.4

Mar 18, 2025

0.0.3

Mar 13, 2025

0.0.2

Mar 13, 2025

0.0.1

Mar 13, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mikasa_robo_suite-1.0.0.tar.gz (271.9 kB view details)

Uploaded May 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mikasa_robo_suite-1.0.0-py3-none-any.whl (412.7 kB view details)

Uploaded May 22, 2026 Python 3

File details

Details for the file mikasa_robo_suite-1.0.0.tar.gz.

File metadata

Download URL: mikasa_robo_suite-1.0.0.tar.gz
Upload date: May 22, 2026
Size: 271.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mikasa_robo_suite-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`5c55b5708403d9606a37a5603aca243f5bacbe96dbfe1a8ccd93ca75b029e37c`
MD5	`10c177d5c0d60ee92501ffd88b9a8e84`
BLAKE2b-256	`c6190c8e948f4500669bc69e983aa4cb30411c3f3ef0a14ba9d7f0ad0b2321c4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mikasa_robo_suite-1.0.0.tar.gz:

Publisher: publish-pypi.yml on CognitiveAISystems/MIKASA-Robo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mikasa_robo_suite-1.0.0.tar.gz
- Subject digest: 5c55b5708403d9606a37a5603aca243f5bacbe96dbfe1a8ccd93ca75b029e37c
- Sigstore transparency entry: 1604768082
- Sigstore integration time: May 22, 2026
Source repository:
- Permalink: CognitiveAISystems/MIKASA-Robo@16634db18bef08128ed79346469c86fc12169aed
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/CognitiveAISystems
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@16634db18bef08128ed79346469c86fc12169aed
- Trigger Event: release

File details

Details for the file mikasa_robo_suite-1.0.0-py3-none-any.whl.

File metadata

Download URL: mikasa_robo_suite-1.0.0-py3-none-any.whl
Upload date: May 22, 2026
Size: 412.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mikasa_robo_suite-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bb6bb4b8cef078b37719a6b2a145c89a92952581d38455f04b5d9e45eb71c5d0`
MD5	`a2e9bcadd7dc625415e3c9ffecaad9e7`
BLAKE2b-256	`687001287720f5770e607f02efcb4274f23d2557eaec7d6b75aec6e9770e5ce4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mikasa_robo_suite-1.0.0-py3-none-any.whl:

Publisher: publish-pypi.yml on CognitiveAISystems/MIKASA-Robo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mikasa_robo_suite-1.0.0-py3-none-any.whl
- Subject digest: bb6bb4b8cef078b37719a6b2a145c89a92952581d38455f04b5d9e45eb71c5d0
- Sigstore transparency entry: 1604768302
- Sigstore integration time: May 22, 2026
Source repository:
- Permalink: CognitiveAISystems/MIKASA-Robo@16634db18bef08128ed79346469c86fc12169aed
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/CognitiveAISystems
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@16634db18bef08128ed79346469c86fc12169aed
- Trigger Event: release

mikasa-robo-suite 1.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MIKASA-Robo-VLA

What is MIKASA-Robo-VLA?

What changed from MIKASA-Robo (RL release)

📚 mikasarobo.github.io

Installation

Quick Start

Benchmarking

Datasets

Useful Links

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance