TurboPuffer RAG integration for Vision Agents with hybrid search
Project description
TurboPuffer RAG Plugin
Hybrid search RAG (Retrieval Augmented Generation) implementation using TurboPuffer for vector + BM25 full-text search, with Gemini for embeddings.
Features
- Hybrid Search: Combines vector (semantic) and BM25 (keyword) search for better retrieval quality
- Reciprocal Rank Fusion: Merges results from multiple search strategies
- Gemini Embeddings: Uses Google's Gemini embedding model for high-quality vectors
- Low-latency Queries: Supports cache warming for fast query responses
- Implements RAG Interface: Compatible with Vision Agents RAG base class
Installation
uv add vision-agents[turbopuffer]
Usage
from vision_agents.plugins import turbopuffer
# Initialize RAG
rag = turbopuffer.TurboPufferRAG(namespace="my-knowledge")
await rag.add_directory("./knowledge")
# Hybrid search (default)
results = await rag.search("How does the chat API work?")
# Vector-only search
results = await rag.search("How does the chat API work?", mode="vector")
# BM25-only search
results = await rag.search("chat API pricing", mode="bm25")
# Or use convenience function
rag = await turbopuffer.create_rag(
namespace="product-knowledge",
knowledge_dir="./knowledge"
)
Configuration
| Parameter | Description | Default |
|---|---|---|
namespace |
TurboPuffer namespace for storing vectors | Required |
embedding_model |
Gemini embedding model | models/gemini-embedding-001 |
chunk_size |
Size of text chunks for splitting documents | 10000 |
chunk_overlap |
Overlap between chunks for context continuity | 200 |
region |
TurboPuffer region | gcp-us-central1 |
Environment Variables
TURBO_PUFFER_KEY: TurboPuffer API keyGOOGLE_API_KEY: Google API key (for Gemini embeddings)
Dependencies
turbopuffer: TurboPuffer vector database clientlangchain-google-genai: Gemini embeddingslangchain-text-splitters: Text chunking utilities
References
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vision_agents_plugins_turbopuffer-0.3.8.tar.gz.
File metadata
- Download URL: vision_agents_plugins_turbopuffer-0.3.8.tar.gz
- Upload date:
- Size: 5.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
809cdcafe1b784f50c014e69f6007026e8100292e1e1598e458f0b6dcf92475c
|
|
| MD5 |
338ab99467516a4b89d6738d50272f8a
|
|
| BLAKE2b-256 |
cb0b23e203c9435fc14c195299d01631bfaa38154c2b2230d9f07f8ca3e91f26
|
File details
Details for the file vision_agents_plugins_turbopuffer-0.3.8-py3-none-any.whl.
File metadata
- Download URL: vision_agents_plugins_turbopuffer-0.3.8-py3-none-any.whl
- Upload date:
- Size: 10.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4121e55336e7402a3e41ea1b552a48eb3012493f57443257d8eae25685d7650e
|
|
| MD5 |
760eb24866c53058502f8c32a56e38b7
|
|
| BLAKE2b-256 |
04126f69287f4922bda10ac6e21761aae4fb4445e60072447953638cfd8f633d
|