Skip to main content

Fast, State of the Art Quantized Embedding Models

Project description

FastEmbed Library

FastEmbed is a Python library that provides convenient methods for indexing and searching text documents using Qdrant, a high-dimensional vector indexing and search system.

Features

  • Batch document insertion with automatic embedding using SentenceTransformers. With support for OpenAI and custom embeddings.
  • Efficient batch searching with support for filtering by metadata.
  • Automatic generation of unique IDs for documents.
  • Convenient alias methods for adding documents and performing queries.

Installation

To install the FastEmbed library, we install Qdrant client as well with pip:

pip install fastembed qdrant-client

Usage

Here's a simple usage example, which works as is:

from qdrant_client import QdrantClient

# Initialize the client
client = QdrantClient(":memory:")  # or QdrantClient(path="path/to/db")

# Prepare your documents, metadata, and IDs
docs = ["Qdrant has Langchain integrations", "Qdrant also has Llama Index integrations"]
metadatas = [
    {"source": "Langchain-docs"},
    {"source": "Linkedin-docs"},
]
ids = [42, 2]

# Use the new add method
client.add(collection_name="demo_collection", docs={"documents": docs, "metadatas": metadatas, "ids": ids})

search_result = client.query(collection_name="demo_collection", query_texts=["This is a query document"])
print(search_result)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastembed-0.0.2.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fastembed-0.0.2-py3-none-any.whl (10.8 kB view details)

Uploaded Python 3

File details

Details for the file fastembed-0.0.2.tar.gz.

File metadata

  • Download URL: fastembed-0.0.2.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.9 Darwin/22.5.0

File hashes

Hashes for fastembed-0.0.2.tar.gz
Algorithm Hash digest
SHA256 3a1dec12df1dfb1724f46a5a90a0a55bca242195f1d76b122e087dfe6b8ac144
MD5 5e875dc4f5beb3eba128d76a78713f80
BLAKE2b-256 57315a0afd6ff054828cffe8438f92c1112ab44c43f13c49588d80c2278f5a06

See more details on using hashes here.

File details

Details for the file fastembed-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: fastembed-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 10.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.9 Darwin/22.5.0

File hashes

Hashes for fastembed-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 7460266828d8ce59f598fc854917489b6339e8240fbe5da6652a5b799b7dba2a
MD5 ad7ce669dc5db6b828c3ec3bb03a1a1a
BLAKE2b-256 d83573cae7ded0bb31fd0690aab611bdd658b647744954feb243bb43c7df0c09

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page