Qdrant RAG integration for Vision Agents with hybrid search
Project description
Qdrant RAG Plugin
Hybrid search RAG (Retrieval Augmented Generation) using Qdrant's built-in fastembed integration for dense and BM25 sparse embeddings.
Features
- Hybrid Search: Dense vector (semantic) + BM25 sparse (keyword) via native Qdrant RRF fusion
- fastembed Native: No external embedding dependencies — Qdrant client handles everything
- Implements RAG Interface: Compatible with Vision Agents RAG base class
Installation
uv add "vision-agents[qdrant]"
# or directly
uv add vision-agents-plugins-qdrant
Usage
from vision_agents.plugins import qdrant
# Initialize RAG (connects to local Qdrant by default)
rag = qdrant.QdrantRAG(collection="my-knowledge")
await rag.add_directory("./knowledge")
# Hybrid search (default)
results = await rag.search("How does the chat API work?")
# Vector-only search
results = await rag.search("How does the chat API work?", mode="vector")
# BM25 search
results = await rag.search("chat API pricing", mode="bm25")
# Or use convenience function
rag = await qdrant.create_rag(
collection="product-knowledge",
knowledge_dir="./knowledge"
)
Configuration
| Parameter | Description | Default |
|---|---|---|
collection |
Qdrant collection name | Required |
url |
Qdrant server URL | http://localhost:6333 |
api_key |
Qdrant API key (for Qdrant Cloud) | QDRANT_API_KEY env var |
dense_model |
fastembed dense model for semantic search | sentence-transformers/all-MiniLM-L6-v2 |
sparse_model |
fastembed sparse model for BM25 search | Qdrant/bm25 |
chunk_size |
Size of text chunks for splitting documents | 10000 |
chunk_overlap |
Overlap between chunks for context continuity | 200 |
cloud_inference |
Use Qdrant Cloud server-side inference instead of local fastembed | False |
Environment Variables
QDRANT_API_KEY: Qdrant API key (for Qdrant Cloud; not needed for local)
Running Qdrant locally
docker run -p 6333:6333 qdrant/qdrant
Dependencies
qdrant-client[fastembed]: Qdrant async client with built-in fastembed supportlangchain-text-splitters: Text chunking utilities
References
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vision_agents_plugins_qdrant-0.6.4.tar.gz.
File metadata
- Download URL: vision_agents_plugins_qdrant-0.6.4.tar.gz
- Upload date:
- Size: 4.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1114c8c5bbbd08ec243d899d82586479cc0e4db082f554ebdfb8b982c581b64b
|
|
| MD5 |
31ccdb861fa6dea3bee9649fde69622c
|
|
| BLAKE2b-256 |
0f574422bc20bb40225b21880ad62aabe7bc13d77dffa064da99083acf59c9c3
|
File details
Details for the file vision_agents_plugins_qdrant-0.6.4-py3-none-any.whl.
File metadata
- Download URL: vision_agents_plugins_qdrant-0.6.4-py3-none-any.whl
- Upload date:
- Size: 4.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
abf73c6ea9f5dc9482d052af3023d03cb2335b087a495f23aa9303edde3cefaf
|
|
| MD5 |
fe3c8fcf3da1aac4800bbada2b33e62d
|
|
| BLAKE2b-256 |
f2c6067aa5cc516bdcb4883e17c7c00eb3e34894a06ac40da88e6f1141f9d952
|