Roboreason package

Project description

RoboReason

RoboReason is a python package that makes it easy to apply any reward model or video-language reasoning model to your robot videos.

Supported Models

Robometer (https://robometer.github.io)
TOPReward (https://topreward.github.io/webpage/)
RoboReward (https://arxiv.org/abs/2601.00675)
SOLE-R1 (https://philipmit.github.io/sole-r1/)
OpenAI models (e.g., "gpt-5")
Google models (e.g., "gemini-3-pro-preview")

ToDos

Enable fine-tuning of reward models on custom datasets

📦 File Structure

roboreason/
├── roboreason/         # Main package
│   ├── robometer/         # Robometer code
│   ├── sole.py            # SOLE-R1 code
│   ├── roboreward.py      # RoboReward code
│   ├── topreward.py       # TOPReward code
│   └── api_models.py      # OpenAI and Gemini APIs
├── test_videos/        # Example videos to test
├── model_outputs/      # Videos showing model outputs
├── lerobot_examples/   # Examples showing integration with lerobot datasets
└── pyproject.toml      # Dependencies (uv)

Install

Option 1: quick pip install

pip install -U roboreason

Option 2: use uv for dependency management

1. Clone the repository:

git clone https://github.com/philipmit/roboreason

2. Install `uv`

pip install uv

3. Sync environment

uv sync

4. Activate environment

source .venv/bin/activate

Pre-download model checkpoints (optional)

# SOLE-R1 (8B)
python -c "from roboreason.utils.model_utils import get_model_dir; get_model_dir('sole')"

# Robometer (4B)
python -c "from roboreason.utils.model_utils import get_model_dir; get_model_dir('robometer')"

# TOPReward (based on Qwen3-VL-8B)
python -c "from roboreason.utils.model_utils import get_model_dir; get_model_dir('topreward')"

# RoboReward (8B)
python -c "from roboreason.utils.model_utils import get_model_dir; get_model_dir('roboreward')"

Download all test videos from google drive link here (optional)

# pip install gdown
# cd /path/to/roboreason/test_videos/
gdown https://drive.google.com/drive/folders/1pXmiN-l8-khC4WABoMAn6saDvHjHGub0?usp=sharing

Download all videos showing example model outputs from google drive link here (optional)

# pip install gdown
# cd /path/to/roboreason/model_outputs/
gdown https://drive.google.com/drive/folders/1gi-sTk8JssO9_UO6dHTqnkeAMjyW0YZb?usp=sharing

Quick start: Example reward generation and plotting

# pip install -U roboreason
import roboreason as rr

video_paths = ['test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4']
task_description="Pick up the cube from the table."

# Robometer
rewards, success_probs = rr.generate(model="robometer",  task_description=task_description, video_paths=video_paths, view_type_per_video=['external'])
output_robometer = {"model": "robometer", "rewards": rewards[0]}

# SOLE-R1
rewards, reasoning_traces = rr.generate(model="sole-r1",  task_description=task_description, video_paths=video_paths, view_type_per_video=['external and wrist'])
output_sole = {"model": "sole-r1", "rewards": rewards[0], "reasoning_traces": reasoning_traces[0]}

rr.video_plot(outputs=[output_sole, output_robometer], plot_save_path='model_outputs/combined/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4', video_path = video_paths[0])

Examples for generating across all models

Robometer

import roboreason as rr

rewards, success_probs = rr.generate(
    model="robometer",  
    task_description="Pick up the cube from the table.", 
    video_paths=['test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4'], 
    view_type_per_video=['external']
)

SOLE-R1

import roboreason as rr

rewards, reasoning_traces = rr.generate(
    model="sole-r1",  
    task_description="Pick up the cube from the table.", 
    video_paths=['test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4'], 
    view_type_per_video=['external and wrist']
)

TOPReward

import roboreason as rr

rewards = rr.generate(
    model="topreward",  
    task_description="Pick up the cube from the table.", 
    video_paths=['test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4'], 
    view_type_per_video=['external']
)

RoboReward

import roboreason as rr

rewards = rr.generate(
    model="roboreward",  
    task_description="Pick up the cube from the table.", 
    video_paths=['test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4'], 
    view_type_per_video=['external']
)

GPT-5 (and other OpenAI models)

import roboreason as rr

# requires OpenAI API key: https://developers.openai.com/api/docs/quickstart
API_KEY = "..."

rewards, reasoning_traces = rr.generate(
    model="gpt-5",  
    task_description="Pick up the cube from the table.", 
    video_paths=['test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4'], 
    view_type_per_video=['external'], 
    key=API_KEY
)

Gemini-3-Pro (and other Google models)

import roboreason as rr

# requires Gemini API key: https://ai.google.dev/gemini-api/docs/api-key
API_KEY = "..."

rewards, reasoning_traces = rr.generate(
    model="gemini-3-pro-preview",  
    task_description="Pick up the cube from the table.", 
    video_paths=['test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4'], 
    view_type_per_video=['external'], 
    key=API_KEY
)

Video plotting

import roboreason as rr

# Robometer
rewards, success_probs = rr.generate(model="robometer",  task_description=task_description, video_paths=video_paths, view_type_per_video=['external'])
output_robometer = {"model": "robometer", "rewards": rewards[0]}

# SOLE-R1
rewards, reasoning_traces = rr.generate(model="sole-r1",  task_description=task_description, video_paths=video_paths, view_type_per_video=['external and wrist'])
output_sole = {"model": "sole-r1", "rewards": rewards[0], "reasoning_traces": reasoning_traces[0]}

rr.video_plot(
    outputs=[output_sole, output_robometer], 
    plot_save_path='model_outputs/combined/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4', 
    video_path = 'test_videos/robosuite/lift/unsuccessful/robosuite_lift_episode_11_unsuccessful_max_reward_37.mp4'
)

rr.generate

Argument	Type	Required	Description
`model`	`str`	✅	Name of the model to use. Options include: `"robometer"`, `"sole-r1"`, `"topreward"`, `"roboreward"`, OpenAI models (e.g.`"gpt-5"`), Google models (e.g., `"gemini-3-pro-preview"`)
`task_description`	`str`	✅	Natural language description of the task the robot is performing.
`video_paths`	`List[str]`	✅	List of paths to input video files.
`view_type_per_video`	`List[str]`	✅	List specifying the camera view(s) used for reward reasoning for each video (e.g., `"external"`, `"wrist"`, or `"external and wrist"`).
`key`	`str`	❌	API key required for external models (e.g., OpenAI or Gemini). Not needed for local models.

Model Type	Return Values
SOLE-R1 / GPT / Gemini	`rewards, reasoning_traces`
Robometer	`rewards, success_probs`
TOPReward / RoboReward	`rewards`

rr.video_plot

Argument	Type	Required	Description
`outputs`	`List[dict]`	❌*	List of model outputs (e.g., from `rr.generate`) to visualize together.
`plot_save_path`	`str`	❌	Path where the output video with overlays will be saved.
`video_path`	`str`	❌	Path to the original video file being visualized.
`view_type`	`str`	❌	View type used for visualization (e.g., `"external"`, `"wrist"`, `"external and wrist"`).
`show_reasoning_traces`	`bool`	❌	Whether to overlay reasoning traces on the video. Default: `False`.
`show_all_frames`	`bool`	❌	Whether to render all frames instead of sampled frames. Default: `False`.
`model`	`str`	❌**	Model name (used when calling `video_plot` directly instead of passing `outputs`).
`task_description`	`str`	❌**	Task description (used in direct-call mode).
`video_paths`	`List[str]`	❌**	Input videos (used in direct-call mode).
`view_type_per_video`	`List[str]`	❌**	View types per video (used in direct-call mode).
`key`	`str`	❌**	API key (if required for model).

Project details

Release history Release notifications | RSS feed

0.1.4.3

Apr 18, 2026

0.1.4.2

Apr 17, 2026

This version

0.1.4.1

Apr 12, 2026

0.1.3.5

Mar 31, 2026

0.1.3.4

Mar 30, 2026

0.1.3.3

Mar 29, 2026

0.1.3.2

Mar 28, 2026

0.1.3.1

Mar 28, 2026

0.1.3

Mar 28, 2026

0.1.2

Mar 28, 2026

0.1.1

Mar 26, 2026

0.1.0

Mar 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

roboreason-0.1.4.1.tar.gz (667.6 kB view details)

Uploaded Apr 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

roboreason-0.1.4.1-py3-none-any.whl (747.8 kB view details)

Uploaded Apr 12, 2026 Python 3

File details

Details for the file roboreason-0.1.4.1.tar.gz.

File metadata

Download URL: roboreason-0.1.4.1.tar.gz
Upload date: Apr 12, 2026
Size: 667.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for roboreason-0.1.4.1.tar.gz
Algorithm	Hash digest
SHA256	`17a51f93c22960c179f9be83670ba1ecdfd4855ac599d5912a085d01fbf97f76`
MD5	`752ba253ef112beb732049691ef1a79d`
BLAKE2b-256	`c11b51647d03814d2dbb5d5fb951c322f78a0c18f84b94923684bba8ab67f3e2`

See more details on using hashes here.

File details

Details for the file roboreason-0.1.4.1-py3-none-any.whl.

File metadata

Download URL: roboreason-0.1.4.1-py3-none-any.whl
Upload date: Apr 12, 2026
Size: 747.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for roboreason-0.1.4.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a44fde155f5b47118d2dca8410a6efafbf3d025fca2ad9ba77a829b5388fa1da`
MD5	`84e4857334c57785a1bed3a72f1f506b`
BLAKE2b-256	`e674538660b3122b09999a5e4ce972b64ae3c41f9b82e44d93ccb6b2f3f10190`

See more details on using hashes here.

roboreason 0.1.4.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

RoboReason

Supported Models

ToDos

📦 File Structure

Install

Option 1: quick pip install

Option 2: use uv for dependency management

1. Clone the repository:

2. Install uv

3. Sync environment

4. Activate environment

Pre-download model checkpoints (optional)

Download all test videos from google drive link here (optional)

Download all videos showing example model outputs from google drive link here (optional)

Quick start: Example reward generation and plotting

Examples for generating across all models

Robometer

SOLE-R1

TOPReward

RoboReward

GPT-5 (and other OpenAI models)

Gemini-3-Pro (and other Google models)

Video plotting

rr.generate

rr.video_plot

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

2. Install `uv`