Package for object detection, segmentation and gripping point detection.

These details have not been verified by PyPI

Project description

RAI Perception

RAI Perception brings powerful computer vision capabilities to your ROS2 applications. It integrates GroundingDINO and Grounded-SAM-2 to detect objects, create segmentation masks, and calculate gripping points.

The package includes two ready-to-use ROS2 service nodes (GroundedSamAgent and GroundingDinoAgent) that you can easily add to your applications. It also provides tools that work seamlessly with RAI LLM agents to build conversational robot scenarios.

Prerequisites

Before installing rai-perception, ensure you have:

ROS2 installed (Jazzy recommended, or Humble). If you don't have ROS2 yet, follow the official ROS2 installation guide for jazzy or humble.
Python 3.8+ and pip installed (usually pre-installed on Ubuntu).
NVIDIA GPU with CUDA support (required for optimal performance).
wget installed (required for downloading model weights):
```
sudo apt install wget
```

Installation

Step 1: Source ROS2 in your terminal:

# For ROS2 Jazzy (recommended)
source /opt/ros/jazzy/setup.bash

# For ROS2 Humble
source /opt/ros/humble/setup.bash

Step 2: Install ROS2 dependencies. rai-perception requires its ROS2 packages that needs to be installed separately:

# Update package lists first
sudo apt update

# Install rai_interfaces as a debian package
sudo apt install ros-jazzy-rai-interfaces  # or ros-humble-rai-interfaces for Humble

Step 3: Install rai-perception via pip:

pip install rai-perception

[!TIP] It's recommended to install rai-perception in a virtual environment to avoid conflicts with other Python packages.

[!TIP] To avoid sourcing ROS2 in every new terminal, add the source command to your ~/.bashrc file:
echo "source /opt/ros/jazzy/setup.bash" >> ~/.bashrc  # or humble

Getting Started

This section provides a step-by-step guide to get you up and running with RAI Perception.

Quick Start

After installing rai-perception, launch the perception agents:

Step 1: Open a terminal and source ROS2:

source /opt/ros/jazzy/setup.bash  # or humble

Step 2: Launch the perception agents:

python -m rai_perception.scripts.run_perception_agents

[!NOTE] The weights will be downloaded to ~/.cache/rai directory on first use.

The agents create two ROS 2 nodes: grounding_dino and grounded_sam using ROS2Connector.

Testing with Example Client

The rai_perception/talker.py example demonstrates how to use the perception services for object detection and segmentation. It shows the complete pipeline: GroundingDINO for object detection followed by GroundedSAM for instance segmentation, with visualization output.

Step 1: Open a terminal and source ROS2:

source /opt/ros/jazzy/setup.bash  # or humble

Step 2: Launch the perception agents:

python -m rai_perception.scripts.run_perception_agents

Step 3: In a different terminal (remember to source ROS2 first), run the example client:

source /opt/ros/jazzy/setup.bash  # or humble
python -m rai_perception.examples.talker --ros-args -p image_path:="<path-to-image>"

You can use any image containing objects like dragons, lizards, or dinosaurs. For example, use the sample.jpg from the package's images folder. The client will detect these objects and save a visualization with bounding boxes and masks to masks.png in the current directory.

[!TIP]

If you wish to integrate open-set vision into your ros2 launch file, a premade launch file can be found in rai/src/rai_bringup/launch/openset.launch.py

ROS2 Service Interface

The agents can be triggered by ROS2 services:

grounding_dino_classify: rai_interfaces/srv/RAIGroundingDino
grounded_sam_segment: rai_interfaces/srv/RAIGroundedSam

Dive Deeper: Tools and Integration

This section provides information for developers looking to integrate RAI Perception tools into their applications.

RAI Tools

rai_perception package contains tools that can be used by RAI LLM agents to enhance their perception capabilities. For more information on RAI Tools see Tool use and development tutorial.

`GetDetectionTool`

This tool calls the GroundingDINO service to detect objects from a comma-separated prompt in the provided camera topic.

[!TIP]

you can try example below with rosbotxl demo binary. The binary exposes /camera/camera/color/image_raw and /camera/camera/depth/image_rect_raw topics.

Example call

import time
from rai_perception.tools import GetDetectionTool
from rai.communication.ros2 import ROS2Connector, ROS2Context

with ROS2Context():
    connector=ROS2Connector(node_name="test_node")

    # Wait for topic discovery to complete
    print("Waiting for topic discovery...")
    time.sleep(3)

    x = GetDetectionTool(connector=connector)._run(
        camera_topic="/camera/camera/color/image_raw",
        object_names=["bed", "bed pillow", "table lamp", "plant", "desk"],
    )
    print(x)

Example output

I have detected the following items in the picture plant, table lamp, table lamp, bed, desk

`GetDistanceToObjectsTool`

This tool calls the GroundingDINO service to detect objects from a comma-separated prompt in the provided camera topic. Then it utilizes messages from the depth camera to estimate the distance to detected objects.

Example call

from rai_perception.tools import GetDistanceToObjectsTool
from rai.communication.ros2 import ROS2Connector, ROS2Context
import time

with ROS2Context():
    connector=ROS2Connector(node_name="test_node")
    connector.node.declare_parameter("conversion_ratio", 1.0)  # scale parameter for the depth map

    # Wait for topic discovery to complete
    print("Waiting for topic discovery...")
    time.sleep(3)

    x = GetDistanceToObjectsTool(connector=connector)._run(
        camera_topic="/camera/camera/color/image_raw",
        depth_topic="/camera/camera/depth/image_rect_raw",
        object_names=["desk"],
    )

    print(x)

Example output

I have detected the following items in the picture desk: 2.43m away

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.1

Feb 4, 2026

0.2.0 yanked

Feb 4, 2026

Reason this release was yanked:

Unbounded version of transformers library caused runtime errors

This version

0.1.5

Jan 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rai_perception-0.1.5.tar.gz (18.0 kB view details)

Uploaded Jan 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rai_perception-0.1.5-py3-none-any.whl (28.3 kB view details)

Uploaded Jan 20, 2026 Python 3

File details

Details for the file rai_perception-0.1.5.tar.gz.

File metadata

Download URL: rai_perception-0.1.5.tar.gz
Upload date: Jan 20, 2026
Size: 18.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/6.14.0-37-generic

File hashes

Hashes for rai_perception-0.1.5.tar.gz
Algorithm	Hash digest
SHA256	`c3b5bf84e33a11bcf7e68b7c16462cc956d4d67dd90a51d1d82252875c71f6ef`
MD5	`9b4a003924b55be3051fecac3393f9a0`
BLAKE2b-256	`e68e9f9114ea01b3e3e9671f642625b7159bba0474f6850a41d74d2ce1c093ff`

See more details on using hashes here.

File details

Details for the file rai_perception-0.1.5-py3-none-any.whl.

File metadata

Download URL: rai_perception-0.1.5-py3-none-any.whl
Upload date: Jan 20, 2026
Size: 28.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/6.14.0-37-generic

File hashes

Hashes for rai_perception-0.1.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`43d9388dcb398d7d1dba557c282a162e8bf88437afffa9092ce4c1fe6eb556a3`
MD5	`8bd53e138aa78bb02db396b1965b0f74`
BLAKE2b-256	`bade33a6e72628747bf3bb0745bbcaac809f242ef4f086f7105a946a891952af`

See more details on using hashes here.

rai-perception 0.1.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

RAI Perception

Prerequisites

Installation

Getting Started

Quick Start

Testing with Example Client

ROS2 Service Interface

Dive Deeper: Tools and Integration

RAI Tools

`GetDetectionTool`

`GetDistanceToObjectsTool`

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes