Fast embedding alternative to Sentence transformers
Project description
FastVector Library
FastVector is a Python library that provides convenient methods for indexing and searching text documents using Qdrant, a high-dimensional vector indexing and search system.
Features
- Batch document insertion with automatic embedding using SentenceTransformers. With support for OpenAI and custom embeddings.
- Efficient batch searching with support for filtering by metadata.
- Automatic generation of unique IDs for documents.
- Convenient alias methods for adding documents and performing queries.
Installation
To install the FastVector library, we install Qdrant client as well with pip:
pip install fastvector qdrant-client
Usage
Here's a simple usage example, which works as is:
from qdrant_client import QdrantClient
# Initialize the client
client = QdrantClient(":memory:") # or QdrantClient(path="path/to/db")
# Prepare your documents, metadata, and IDs
docs = ["Qdrant has Langchain integrations", "Qdrant also has Llama Index integrations"]
metadatas = [
{"source": "Langchain-docs"},
{"source": "Linkedin-docs"},
]
ids = [42, 2]
# Use the new add method
client.add(collection_name="demo_collection", docs={"documents": docs, "metadatas": metadatas, "ids": ids})
search_result = client.query(collection_name="demo_collection", query_texts=["This is a query document"])
print(search_result)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fastembed-0.0.1a1.tar.gz
(7.7 kB
view hashes)
Built Distribution
Close
Hashes for fastembed-0.0.1a1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3a3e92288b02cadc84e666f956253056415277846a969a84e74f529e4da770ea |
|
MD5 | cd60100be9baf6f6389e7d4457529c49 |
|
BLAKE2b-256 | 1f83931e0b91c94bca7b12fb9d153201bd48b5740958a2ab6f531e7a08a957d2 |