Fast, light, accurate library built for retrieval embedding generation (IMAGIN.studio fork — temporary pillow 12 fix)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

This is a temporary fork by IMAGIN.studio.

Upstream fastembed 0.7.4 pins pillow<12.0, which blocks Pillow 12.x security fixes (CVE-2026-25990). The fix is merged on main but not yet released (#606).

What changed: only the pillow version constraint — relaxed from <12.0 to <13.0 for Python 3.10+. No logic changes.

Published as: fastembed-imagin-studio on PyPI.

Revert plan: once upstream releases fastembed 0.7.5+ with the pillow fix, imagin-studio-api-docs-mcp will switch back to fastembed and this fork will be archived.

⚡️ What is FastEmbed?

FastEmbed is a lightweight, fast, Python library built for embedding generation. We support popular text models. Please open a GitHub issue if you want us to add a new model.

The default text embedding (TextEmbedding) model is Flag Embedding, presented in the MTEB leaderboard. It supports "query" and "passage" prefixes for the input text. Here is an example for Retrieval Embedding Generation and how to use FastEmbed with Qdrant.

📈 Why FastEmbed?

Light: FastEmbed is a lightweight library with few external dependencies. We don't require a GPU and don't download GBs of PyTorch dependencies, and instead use the ONNX Runtime. This makes it a great candidate for serverless runtimes like AWS Lambda.
Fast: FastEmbed is designed for speed. We use the ONNX Runtime, which is faster than PyTorch. We also use data parallelism for encoding large datasets.
Accurate: FastEmbed is better than OpenAI Ada-002. We also support an ever-expanding set of models, including a few multilingual models.

🚀 Installation

To install the FastEmbed library, pip works best. You can install it with or without GPU support:

pip install fastembed

# or with GPU support

pip install fastembed-gpu

📖 Quickstart

from fastembed import TextEmbedding


# Example list of documents
documents: list[str] = [
    "This is built to be faster and lighter than other embedding libraries e.g. Transformers, Sentence-Transformers, etc.",
    "fastembed is supported by and maintained by Qdrant.",
]

# This will trigger the model download and initialization
embedding_model = TextEmbedding()
print("The model BAAI/bge-small-en-v1.5 is ready to use.")

embeddings_generator = embedding_model.embed(documents)  # reminder this is a generator
embeddings_list = list(embedding_model.embed(documents))
  # you can also convert the generator to a list, and that to a numpy array
len(embeddings_list[0]) # Vector of 384 dimensions

Fastembed supports a variety of models for different tasks and modalities. The list of all the available models can be found here

🎒 Dense text embeddings

from fastembed import TextEmbedding

model = TextEmbedding(model_name="BAAI/bge-small-en-v1.5")
embeddings = list(model.embed(documents))

# [
#   array([-0.1115,  0.0097,  0.0052,  0.0195, ...], dtype=float32),
#   array([-0.1019,  0.0635, -0.0332,  0.0522, ...], dtype=float32)
# ]

Dense text embedding can also be extended with models which are not in the list of supported models.

from fastembed import TextEmbedding
from fastembed.common.model_description import PoolingType, ModelSource

TextEmbedding.add_custom_model(
    model="intfloat/multilingual-e5-small",
    pooling=PoolingType.MEAN,
    normalization=True,
    sources=ModelSource(hf="intfloat/multilingual-e5-small"),  # can be used with an `url` to load files from a private storage
    dim=384,
    model_file="onnx/model.onnx",  # can be used to load an already supported model with another optimization or quantization, e.g. onnx/model_O4.onnx
)
model = TextEmbedding(model_name="intfloat/multilingual-e5-small")
embeddings = list(model.embed(documents))

🔱 Sparse text embeddings

SPLADE++

from fastembed import SparseTextEmbedding

model = SparseTextEmbedding(model_name="prithivida/Splade_PP_en_v1")
embeddings = list(model.embed(documents))

# [
#   SparseEmbedding(indices=[ 17, 123, 919, ... ], values=[0.71, 0.22, 0.39, ...]),
#   SparseEmbedding(indices=[ 38,  12,  91, ... ], values=[0.11, 0.22, 0.39, ...])
# ]

🦥 Late interaction models (aka ColBERT)

from fastembed import LateInteractionTextEmbedding

model = LateInteractionTextEmbedding(model_name="colbert-ir/colbertv2.0")
embeddings = list(model.embed(documents))

# [
#   array([
#       [-0.1115,  0.0097,  0.0052,  0.0195, ...],
#       [-0.1019,  0.0635, -0.0332,  0.0522, ...],
#   ]),
#   array([
#       [-0.9019,  0.0335, -0.0032,  0.0991, ...],
#       [-0.2115,  0.8097,  0.1052,  0.0195, ...],
#   ]),  
# ]

🖼️ Image embeddings

from fastembed import ImageEmbedding

images = [
    "./path/to/image1.jpg",
    "./path/to/image2.jpg",
]

model = ImageEmbedding(model_name="Qdrant/clip-ViT-B-32-vision")
embeddings = list(model.embed(images))

# [
#   array([-0.1115,  0.0097,  0.0052,  0.0195, ...], dtype=float32),
#   array([-0.1019,  0.0635, -0.0332,  0.0522, ...], dtype=float32)
# ]

Late interaction multimodal models (ColPali)

from fastembed import LateInteractionMultimodalEmbedding

doc_images = [
    "./path/to/qdrant_pdf_doc_1_screenshot.jpg",
    "./path/to/colpali_pdf_doc_2_screenshot.jpg",
]

query = "What is Qdrant?"

model = LateInteractionMultimodalEmbedding(model_name="Qdrant/colpali-v1.3-fp16")
doc_images_embeddings = list(model.embed_image(doc_images))
# shape (2, 1030, 128)
# [array([[-0.03353882, -0.02090454, ..., -0.15576172, -0.07678223]], dtype=float32)]
query_embedding = model.embed_text(query)
# shape (1, 20, 128)
# [array([[-0.00218201,  0.14758301, ...,  -0.02207947,  0.16833496]], dtype=float32)]

🔄 Rerankers

from fastembed.rerank.cross_encoder import TextCrossEncoder

query = "Who is maintaining Qdrant?"
documents: list[str] = [
    "This is built to be faster and lighter than other embedding libraries e.g. Transformers, Sentence-Transformers, etc.",
    "fastembed is supported by and maintained by Qdrant.",
]
encoder = TextCrossEncoder(model_name="Xenova/ms-marco-MiniLM-L-6-v2")
scores = list(encoder.rerank(query, documents))

# [-11.48061752319336, 5.472434997558594]

Text cross encoders can also be extended with models which are not in the list of supported models.

from fastembed.rerank.cross_encoder import TextCrossEncoder 
from fastembed.common.model_description import ModelSource

TextCrossEncoder.add_custom_model(
    model="Xenova/ms-marco-MiniLM-L-4-v2",
    model_file="onnx/model.onnx",
    sources=ModelSource(hf="Xenova/ms-marco-MiniLM-L-4-v2"),
)
model = TextCrossEncoder(model_name="Xenova/ms-marco-MiniLM-L-4-v2")
scores = list(model.rerank_pairs(
    [("What is AI?", "Artificial intelligence is ..."), ("What is ML?", "Machine learning is ..."),]
))

⚡️ FastEmbed on a GPU

FastEmbed supports running on GPU devices. It requires installation of the fastembed-gpu package.

pip install fastembed-gpu

Check our example for detailed instructions, CUDA 12.x support and troubleshooting of the common issues.

from fastembed import TextEmbedding

embedding_model = TextEmbedding(
    model_name="BAAI/bge-small-en-v1.5", 
    providers=["CUDAExecutionProvider"]
)
print("The model BAAI/bge-small-en-v1.5 is ready to use on a GPU.")

Usage with Qdrant

Installation with Qdrant Client in Python:

pip install qdrant-client[fastembed]

pip install qdrant-client[fastembed-gpu]

You might have to use quotes pip install 'qdrant-client[fastembed]' on zsh.

from qdrant_client import QdrantClient, models

# Initialize the client
client = QdrantClient("localhost", port=6333) # For production
# client = QdrantClient(":memory:") # For experimentation

model_name = "sentence-transformers/all-MiniLM-L6-v2"
payload = [
    {"document": "Qdrant has Langchain integrations", "source": "Langchain-docs", },
    {"document": "Qdrant also has Llama Index integrations", "source": "LlamaIndex-docs"},
]
docs = [models.Document(text=data["document"], model=model_name) for data in payload]
ids = [42, 2]

client.create_collection(
    "demo_collection",
    vectors_config=models.VectorParams(
        size=client.get_embedding_size(model_name), distance=models.Distance.COSINE)
)

client.upload_collection(
    collection_name="demo_collection",
    vectors=docs,
    ids=ids,
    payload=payload,
)

search_result = client.query_points(
    collection_name="demo_collection",
    query=models.Document(text="This is a query document", model=model_name)
).points
print(search_result)

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

IMAGINstudio

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.7.5.3

Mar 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastembed_imagin_studio-0.7.5.3.tar.gz (76.1 kB view details)

Uploaded Mar 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

fastembed_imagin_studio-0.7.5.3-py3-none-any.whl (117.2 kB view details)

Uploaded Mar 24, 2026 Python 3

File details

Details for the file fastembed_imagin_studio-0.7.5.3.tar.gz.

File metadata

Download URL: fastembed_imagin_studio-0.7.5.3.tar.gz
Upload date: Mar 24, 2026
Size: 76.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fastembed_imagin_studio-0.7.5.3.tar.gz
Algorithm	Hash digest
SHA256	`db5ac29406ba201078aa6f3aec04f9081d9974786f171eb0d363107e5693eea6`
MD5	`f177844cac5bfec43d52ba1df68b1be9`
BLAKE2b-256	`65567e81dea623b189c795e81bac421f7b56a285cf79a0492a66f3d043e35672`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fastembed_imagin_studio-0.7.5.3.tar.gz:

Publisher: publish.yml on IMAGIN-studio/fastembed-imagin-studio

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fastembed_imagin_studio-0.7.5.3.tar.gz
- Subject digest: db5ac29406ba201078aa6f3aec04f9081d9974786f171eb0d363107e5693eea6
- Sigstore transparency entry: 1174457730
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: IMAGIN-studio/fastembed-imagin-studio@b44c11723f78176fe7b6a3daa888d80e8d82b561
- Branch / Tag: refs/tags/v0.7.5.3
- Owner: https://github.com/IMAGIN-studio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b44c11723f78176fe7b6a3daa888d80e8d82b561
- Trigger Event: push

File details

Details for the file fastembed_imagin_studio-0.7.5.3-py3-none-any.whl.

File metadata

Download URL: fastembed_imagin_studio-0.7.5.3-py3-none-any.whl
Upload date: Mar 24, 2026
Size: 117.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for fastembed_imagin_studio-0.7.5.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`925cfacd2d3e34a5e3476785be6f5bab0018caf5b7735b390aca3d085ee909b0`
MD5	`6d9b5cb9538ce385d428074ec731a06a`
BLAKE2b-256	`d0290b2687a4286e8ef49f017ab2db69cf2faf47705d5eb743a2861e1f5003d0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for fastembed_imagin_studio-0.7.5.3-py3-none-any.whl:

Publisher: publish.yml on IMAGIN-studio/fastembed-imagin-studio

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: fastembed_imagin_studio-0.7.5.3-py3-none-any.whl
- Subject digest: 925cfacd2d3e34a5e3476785be6f5bab0018caf5b7735b390aca3d085ee909b0
- Sigstore transparency entry: 1174457762
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: IMAGIN-studio/fastembed-imagin-studio@b44c11723f78176fe7b6a3daa888d80e8d82b561
- Branch / Tag: refs/tags/v0.7.5.3
- Owner: https://github.com/IMAGIN-studio
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b44c11723f78176fe7b6a3daa888d80e8d82b561
- Trigger Event: push

fastembed-imagin-studio 0.7.5.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

⚡️ What is FastEmbed?

📈 Why FastEmbed?

🚀 Installation

📖 Quickstart

🎒 Dense text embeddings

🔱 Sparse text embeddings

🦥 Late interaction models (aka ColBERT)

🖼️ Image embeddings

Late interaction multimodal models (ColPali)

🔄 Rerankers

⚡️ FastEmbed on a GPU

Usage with Qdrant

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance