pyrhubarb

A Python framework for multi-modal document understanding with generative AI

These details have not been verified by PyPI

Project links

Project description

Rhubarb

Rhubarb is a light-weight Python framework that makes it easy to build document and video understanding applications using Multi-modal Large Language Models (LLMs) and Embedding models. Rhubarb is created from the ground up to work with Amazon Bedrock and supports multiple foundation models including Anthropic Claude Multi-modal Language Models and Amazon Nova models for document and video processing, along with Amazon Titan Multi-modal Embedding model for embeddings.

What can I do with Rhubarb?

Visit Rhubarb documentation.

Rhubarb can do multiple document processing tasks such as

✅ Document Q&A
✅ Streaming chat with documents (Q&A)
✅ Document Summarization
- 🚀 Page level summaries
- 🚀 Full summaries
- 🚀 Summaries of specific pages
- 🚀 Streaming Summaries
✅ Structured data extraction
✅ Extraction Schema creation assistance
✅ Named entity recognition (NER)
- 🚀 With 50 built-in common entities
✅ PII recognition with built-in entities
✅ Figure and image understanding from documents
- 🚀 Explain charts, graphs, and figures
- 🚀 Perform table reasoning (as figures)
✅ Large document processing with sliding window approach
✅ Document Classification with vector sampling using multi-modal embedding models
✅ Logs token usage to help keep track of costs

Video Analysis (New!)

✅ Video summarization
✅ Entity extraction from videos
✅ Action and movement analysis
✅ Text extraction from video frames
✅ Streaming video analysis responses

Rhubarb comes with built-in system prompts that makes it easy to use it for a number of different document understanding use-cases. You can customize Rhubarb by passing in your own system prompts. It supports exact JSON schema based output generation which makes it easy to integrate into downstream applications.

Supports PDF, TIFF, PNG, JPG, DOCX files (support for Excel, PowerPoint, CSV, Webp, eml files coming soon)
Supports MP4, AVI, MOV, and other common video formats for video analysis (S3 storage required)
Performs document to image conversion internally to work with the multi-modal models
Works on local files or files stored in S3
Supports specifying page numbers for multi-page documents
Supports chat-history based chat for documents
Supports streaming and non-streaming mode
Supports Converse API
Supports Cross-Region Inference

MCP Server Integration

Rhubarb now includes a built-in FastMCP server that exposes all document and video understanding capabilities through the Model Context Protocol (MCP). This allows seamless integration with MCP-compatible AI assistants like Cline, Claude Desktop, and other MCP clients.

MCP Features

8 Tools: Complete access to all Rhubarb capabilities including document analysis, video processing, entity extraction, and document classification
4 Resources: Built-in discovery for entities, models, schemas, and classification samples
Native Python: Direct integration without external dependencies
Conversation Memory: Maintains chat history across interactions
Flexible Authentication: Support for AWS profiles, access keys, and environment variables

Quick Start with MCP

No installation required - The MCP server auto-installs when first used

Configure in your MCP client (example for Cline):

{
  "rhubarb": {
    "command": "uvx",
    "args": [
      "pyrhubarb-mcp@latest",
      "--aws-profile", "my-profile",
      "--default-model", "claude-sonnet"
    ]
  }
}

Alternative configurations:

{
  "rhubarb": {
    "command": "uvx", 
    "args": [
      "pyrhubarb-mcp@latest",
      "--aws-access-key-id", "AKIA...",
      "--aws-secret-access-key", "your-secret",
      "--aws-region", "us-west-2"
    ]
  }
}

For detailed MCP server documentation, see README_MCP.md.

Installation

Start by installing Rhubarb using pip.

pip install pyrhubarb

Usage

Create a boto3 session.

import boto3
session = boto3.Session()

Call Rhubarb

Local file

from rhubarb import DocAnalysis

da = DocAnalysis(file_path="./path/to/doc/doc.pdf", 
                 boto3_session=session)
resp = da.run(message="What is the employee's name?")
resp

With file in Amazon S3

from rhubarb import DocAnalysis

da = DocAnalysis(file_path="s3://path/to/doc/doc.pdf", 
                 boto3_session=session)
resp = da.run(message="What is the employee's name?")
resp

Video Analysis

from rhubarb import VideoAnalysis
import boto3

session = boto3.Session()

# Initialize video analysis with a video in S3
va = VideoAnalysis(
    file_path="s3://my-bucket/my-video.mp4",
    boto3_session=session
)

# Ask questions about the video
response = va.run(message="What is happening in this video?")
print(response)

Large Document Processing

Rhubarb supports processing documents with more than 20 pages using a sliding window approach. This feature is particularly useful when working with Claude models, which have a limitation of processing only 20 pages at a time.

To enable this feature, set sliding_window_overlap to a value between 1 and 10 when creating a DocAnalysis object:

doc_analysis = DocAnalysis(
    file_path="path/to/large-document.pdf",
    boto3_session=session,
    sliding_window_overlap=2     # Number of pages to overlap between windows (1-10)
)

When the sliding window approach is enabled, Rhubarb will:

Break the document into chunks of 20 pages
Process each chunk separately
Combine the results from all chunks

Note: The sliding window technique is not yet supported for document classification. When using classification with large documents, only the first 20 pages will be considered.

For more details, see the Large Document Processing Cookbook.

For more usage examples see cookbooks.

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.0.8

Mar 23, 2026

0.0.7

Jun 14, 2025

0.0.6

Apr 15, 2025

0.0.5

Mar 17, 2025

0.0.4

Mar 8, 2025

0.0.3

Sep 30, 2024

0.0.2

Sep 11, 2024

0.0.1

Apr 18, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyrhubarb-0.0.8.tar.gz (52.7 kB view details)

Uploaded Mar 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pyrhubarb-0.0.8-py3-none-any.whl (69.5 kB view details)

Uploaded Mar 23, 2026 Python 3

File details

Details for the file pyrhubarb-0.0.8.tar.gz.

File metadata

Download URL: pyrhubarb-0.0.8.tar.gz
Upload date: Mar 23, 2026
Size: 52.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.10.20 Linux/6.14.0-1017-azure

File hashes

Hashes for pyrhubarb-0.0.8.tar.gz
Algorithm	Hash digest
SHA256	`f0d59d271ec3f66e15eb8e30e131ab7e80f6590f56ee53e29e396b2160579824`
MD5	`d158d426ed8052faef2184f0cc76770e`
BLAKE2b-256	`f8d1a5fbe49e0b78317ee7e9fc4abde4e7625aa0fdeb39ab3bc12b6d5bed8a23`

See more details on using hashes here.

File details

Details for the file pyrhubarb-0.0.8-py3-none-any.whl.

File metadata

Download URL: pyrhubarb-0.0.8-py3-none-any.whl
Upload date: Mar 23, 2026
Size: 69.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.10.20 Linux/6.14.0-1017-azure

File hashes

Hashes for pyrhubarb-0.0.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d6980b3f4e7f18889f8814b862bfa90b0a75d36188c6295807184b23ed8845c9`
MD5	`a72a69638b407842dbaf3fac15d55f15`
BLAKE2b-256	`3d5ade80adf856c343f2109aef203d36237cbe8c37aac38cde307834506f77e1`

See more details on using hashes here.

pyrhubarb 0.0.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Rhubarb

What can I do with Rhubarb?

Video Analysis (New!)

MCP Server Integration

MCP Features

Quick Start with MCP

Installation

Usage

Call Rhubarb

Video Analysis

Large Document Processing

Security

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes