Native Qdrant implementation for Thoth Vector Database
Project description
Thoth Qdrant
A native Qdrant implementation for the Thoth Vector Database system, providing high-performance vector storage and similarity search capabilities without Haystack dependencies.
Features
- Native Qdrant Integration: Direct use of Qdrant client without Haystack
- Full API Compatibility: Same interface as thoth_vdb2 for seamless integration
- External Embeddings: Support for OpenAI, Cohere, Mistral, and HuggingFace
- Document Types: EvidenceDocument, SqlDocument, ColumnNameDocument
- Similarity Search: Native Qdrant search with document type filtering
- Batch Operations: Efficient bulk document insertion
- Caching: Intelligent embedding cache for performance
Installation
# Basic installation
pip install thoth-qdrant
# With OpenAI embeddings support
pip install thoth-qdrant[openai]
# With all embedding providers
pip install thoth-qdrant[all-providers]
Configuration
Environment Variables
# Embedding provider configuration
export EMBEDDING_PROVIDER=openai
export EMBEDDING_MODEL=text-embedding-3-small
export OPENAI_API_KEY=your-api-key
# Or use provider-specific keys
export OPENAI_API_KEY=sk-...
export COHERE_API_KEY=...
export MISTRAL_API_KEY=...
Qdrant Setup
Ensure Qdrant is running locally:
docker run -p 6333:6333 qdrant/qdrant
Usage
from thoth_qdrant import VectorStoreFactory
from thoth_qdrant.core.base import (
ColumnNameDocument,
SqlDocument,
EvidenceDocument,
ThothType,
)
# Create vector store
store = VectorStoreFactory.create(
backend="qdrant",
collection="my_collection",
host="localhost",
port=6333,
embedding_provider="openai",
embedding_model="text-embedding-3-small"
)
# Add documents
column_doc = ColumnNameDocument(
table_name="users",
column_name="email",
original_column_name="email_address",
column_description="User email for authentication",
value_description="Valid email format"
)
doc_id = store.add_column_description(column_doc)
sql_doc = SqlDocument(
question="How to find recent users?",
sql="SELECT * FROM users WHERE created_at > NOW() - INTERVAL '30 days'",
evidence="Filter by date using interval"
)
store.add_sql(sql_doc)
# Search similar documents
results = store.search_similar(
query="user email authentication",
doc_type=ThothType.COLUMN_NAME,
top_k=5,
score_threshold=0.7
)
# Bulk operations
documents = [column_doc, sql_doc]
doc_ids = store.bulk_add_documents(documents)
# Get document by ID
doc = store.get_document(doc_id)
# Delete document
store.delete_document(doc_id)
# Get all documents by type
all_columns = store.get_all_column_documents()
all_sql = store.get_all_sql_documents()
# Collection info
info = store.get_collection_info()
print(info)
API Reference
VectorStoreInterface Methods
add_column_description(doc: ColumnNameDocument) -> stradd_sql(doc: SqlDocument) -> stradd_evidence(doc: EvidenceDocument) -> strsearch_similar(query: str, doc_type: ThothType, top_k: int = 5, score_threshold: float = 0.7) -> List[BaseThothDocument]get_document(doc_id: str) -> Optional[BaseThothDocument]delete_document(doc_id: str) -> Nonebulk_add_documents(documents: List[BaseThothDocument]) -> List[str]delete_collection(thoth_type: ThothType) -> Noneget_all_column_documents() -> List[ColumnNameDocument]get_all_sql_documents() -> List[SqlDocument]get_all_evidence_documents() -> List[EvidenceDocument]get_collection_info() -> Dict[str, Any]
Testing
# Run tests with local Qdrant
pytest tests/
# Run specific test
pytest tests/test_qdrant_adapter.py -v
# With coverage
pytest --cov=thoth_qdrant tests/
Development
# Install development dependencies
pip install -e .[dev,test]
# Format code
black thoth_qdrant tests
isort thoth_qdrant tests
# Type checking
mypy thoth_qdrant
# Linting
ruff thoth_qdrant
License
Apache License 2.0 - See LICENSE.md for details
Compatibility
This library is fully compatible with thoth_vdb2 API, allowing seamless migration from Haystack-based implementations to native Qdrant.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file thoth_qdrant-0.1.4.tar.gz.
File metadata
- Download URL: thoth_qdrant-0.1.4.tar.gz
- Upload date:
- Size: 19.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
55d3f90267b08da3226e1a13c6420598c980c9d0f15eb593db6445f2fc67704a
|
|
| MD5 |
e5779355d5d0b0083f4c7d29ba36c4c5
|
|
| BLAKE2b-256 |
a4df144ddd08c50c388baa7e7c2c61e358c37ff561fa13a08a6aa43323c6f517
|
File details
Details for the file thoth_qdrant-0.1.4-py3-none-any.whl.
File metadata
- Download URL: thoth_qdrant-0.1.4-py3-none-any.whl
- Upload date:
- Size: 17.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ca904760549c44c1554580b2f9968cee75e9e165f7c450630701780393aa054b
|
|
| MD5 |
c281923db53b18d9fdfedd87ddc30f19
|
|
| BLAKE2b-256 |
f6dda7ed023ed6b746fde9153aa1f16deb0c612a51c610d0018109b3aebe0113
|