Embeddings, Retrieval, and Reranking

These details have not been verified by PyPI

Project links

Project description

Sentence Transformers: Embeddings, Retrieval, and Reranking

This framework provides an easy method to compute embeddings for accessing, using, and training state-of-the-art embedding and reranker models. It can be used to compute embeddings using Sentence Transformer models (quickstart), to calculate similarity scores using Cross-Encoder (a.k.a. reranker) models (quickstart) or to generate sparse embeddings using Sparse Encoder models (quickstart). This unlocks a wide range of applications, including semantic search, semantic textual similarity, and paraphrase mining.

A wide selection of over 15,000 pre-trained Sentence Transformers models are available for immediate use on 🤗 Hugging Face, including many of the state-of-the-art models from the Massive Text Embeddings Benchmark (MTEB) leaderboard. Additionally, it is easy to train or finetune your own embedding models, reranker models or sparse encoder models using Sentence Transformers, enabling you to create custom models for your specific use cases.

For the full documentation, see www.SBERT.net.

Installation

We recommend Python 3.10+, PyTorch 1.11.0+, and transformers v4.34.0+.

Install with pip

pip install -U sentence-transformers

Install with conda

conda install -c conda-forge sentence-transformers

Install from sources

Alternatively, you can also clone the latest version from the repository and install it directly from the source code:

pip install -e .

PyTorch with CUDA

If you want to use a GPU / CUDA, you must install PyTorch with the matching CUDA Version. Follow PyTorch - Get Started for further details how to install PyTorch.

Getting Started

See Quickstart in our documentation.

Embedding Models

First download a pretrained embedding a.k.a. Sentence Transformer model.

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("all-MiniLM-L6-v2")

Then provide some texts to the model.

sentences = [
    "The weather is lovely today.",
    "It's so sunny outside!",
    "He drove to the stadium.",
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# => (3, 384)

And that's already it. We now have numpy arrays with the embeddings, one for each text. We can use these to compute similarities.

similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.6660, 0.1046],
#         [0.6660, 1.0000, 0.1411],
#         [0.1046, 0.1411, 1.0000]])

Reranker Models

First download a pretrained reranker a.k.a. Cross Encoder model.

from sentence_transformers import CrossEncoder

# 1. Load a pretrained CrossEncoder model
model = CrossEncoder("cross-encoder/ms-marco-MiniLM-L6-v2")

Then provide some texts to the model.

# The texts for which to predict similarity scores
query = "How many people live in Berlin?"
passages = [
    "Berlin had a population of 3,520,031 registered inhabitants in an area of 891.82 square kilometers.",
    "Berlin has a yearly total of about 135 million day visitors, making it one of the most-visited cities in the European Union.",
    "In 2013 around 600,000 Berliners were registered in one of the more than 2,300 sport and fitness clubs.",
]

# 2a. predict scores for pairs of texts
scores = model.predict([(query, passage) for passage in passages])
print(scores)
# => [8.607139 5.506266 6.352977]

And we're good to go. You can also use model.rank to avoid having to perform the reranking manually:

# 2b. Rank a list of passages for a query
ranks = model.rank(query, passages, return_documents=True)

print("Query:", query)
for rank in ranks:
    print(f"- #{rank['corpus_id']} ({rank['score']:.2f}): {rank['text']}")
"""
Query: How many people live in Berlin?
- #0 (8.61): Berlin had a population of 3,520,031 registered inhabitants in an area of 891.82 square kilometers.
- #2 (6.35): In 2013 around 600,000 Berliners were registered in one of the more than 2,300 sport and fitness clubs.
- #1 (5.51): Berlin has a yearly total of about 135 million day visitors, making it one of the most-visited cities in the European Union.
"""

Sparse Encoder Models

First download a pretrained sparse embedding a.k.a. Sparse Encoder model.

from sentence_transformers import SparseEncoder

# 1. Load a pretrained SparseEncoder model
model = SparseEncoder("naver/splade-cocondenser-ensembledistil")

# The sentences to encode
sentences = [
    "The weather is lovely today.",
    "It's so sunny outside!",
    "He drove to the stadium.",
]

# 2. Calculate sparse embeddings by calling model.encode()
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 30522] - sparse representation with vocabulary size dimensions

# 3. Calculate the embedding similarities
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[   35.629,     9.154,     0.098],
#         [    9.154,    27.478,     0.019],
#         [    0.098,     0.019,    29.553]])

# 4. Check sparsity stats
stats = SparseEncoder.sparsity(embeddings)
print(f"Sparsity: {stats['sparsity_ratio']:.2%}")
# Sparsity: 99.84%

Pre-Trained Models

We provide a large list of pretrained models for more than 100 languages. Some models are general purpose models, while others produce embeddings for specific use cases.

Training

This framework allows you to fine-tune your own sentence embedding methods, so that you get task-specific sentence embeddings. You have various options to choose from in order to get perfect sentence embeddings for your specific task.

Embedding Models
- Sentence Transformer > Training Overview
- Sentence Transformer > Training Examples or training examples on GitHub.
Reranker Models
- Cross Encoder > Training Overview
- Cross Encoder > Training Examples or training examples on GitHub.
Sparse Embedding Models
- Sparse Encoder > Training Overview
- Sparse Encoder > Training Examples or training examples on GitHub.

Some highlights across the different types of training are:

Support of various transformer networks including BERT, RoBERTa, XLM-R, DistilBERT, Electra, BART, ...
Multi-Lingual and multi-task learning
Evaluation during training to find optimal model
20+ loss functions for embedding models, 10+ loss functions for reranker models and 10+ loss functions for sparse embedding models, allowing you to tune models specifically for semantic search, paraphrase mining, semantic similarity comparison, clustering, triplet loss, contrastive loss, etc.

Application Examples

You can use this framework for:

Computing Sentence Embeddings
- Dense Embeddings
- Sparse Embeddings
Semantic Textual Similarity
- Dense STS
- Sparse STS
Semantic Search
- Dense Search
- Sparse Search
Retrieve & Re-Rank
- Dense only Retrieval
- Sparse/Dense/Hybrid Retrieval
Clustering
Paraphrase Mining
Translated Sentence Mining
Multilingual Image Search, Clustering & Duplicate Detection

and many more use-cases.

For all examples, see examples/sentence_transformer/applications.

Development setup

After cloning the repo (or a fork) to your machine, in a virtual environment, run:

python -m pip install -e ".[dev]"

pre-commit install

To test your changes, run:

pytest

Citing & Authors

If you find this repository helpful, feel free to cite our publication Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks:

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

If you use one of the multilingual models, feel free to cite our publication Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation:

@inproceedings{reimers-2020-multilingual-sentence-bert,
    title = "Making Monolingual Sentence Embeddings Multilingual using Knowledge Distillation",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2020",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/2004.09813",
}

Please have a look at Publications for our different publications that are integrated into SentenceTransformers.

Maintainers

Maintainer: Tom Aarsen, 🤗 Hugging Face

Don't hesitate to open an issue if something is broken (and it shouldn't be) or if you have further questions.

This project was originally developed by the Ubiquitous Knowledge Processing (UKP) Lab at TU Darmstadt. We're grateful for their foundational work and continued contributions to the field.

This repository contains experimental software and is published for the sole purpose of giving additional background details on the respective publication.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

5.3.0

Mar 12, 2026

5.2.3

Feb 17, 2026

5.2.2

Jan 27, 2026

5.2.1

Jan 26, 2026

5.2.0

Dec 11, 2025

5.1.2

Oct 22, 2025

5.1.1

Sep 22, 2025

5.1.0

Aug 6, 2025

5.0.0

Jul 1, 2025

4.1.0

Apr 15, 2025

4.0.2

Apr 3, 2025

4.0.1

Mar 26, 2025

4.0.0

Mar 26, 2025

3.4.1

Jan 29, 2025

3.4.0

Jan 23, 2025

3.3.1

Nov 18, 2024

3.3.0

Nov 11, 2024

3.2.1

Oct 21, 2024

3.2.0

Oct 10, 2024

3.1.1

Sep 19, 2024

3.1.0

Sep 11, 2024

3.0.1

Jun 7, 2024

3.0.0

May 28, 2024

2.7.0

Apr 17, 2024

2.6.1

Mar 26, 2024

2.6.0

Mar 22, 2024

2.5.1

Mar 1, 2024

2.5.0

Feb 29, 2024

2.4.0

Feb 23, 2024

2.3.1

Jan 30, 2024

2.3.0

Jan 29, 2024

2.2.2

Jun 26, 2022

2.2.1

Jun 23, 2022

2.2.0

Feb 10, 2022

2.1.0

Oct 1, 2021

2.0.0

Jun 24, 2021

1.2.1

Jun 24, 2021

1.2.0

May 24, 2021

1.1.1

May 12, 2021

1.1.0

Apr 21, 2021

1.0.4

Apr 1, 2021

1.0.3

Mar 22, 2021

1.0.2

Mar 19, 2021

1.0.1

Mar 18, 2021

1.0.0

Mar 18, 2021

0.4.1.2

Jan 4, 2021

0.4.1.1

Jan 4, 2021

0.4.1

Jan 4, 2021

0.4.0

Dec 22, 2020

0.3.9

Nov 18, 2020

0.3.8

Oct 19, 2020

0.3.7.2

Oct 2, 2020

0.3.7.1

Oct 1, 2020

0.3.7

Sep 29, 2020

0.3.6

Sep 11, 2020

0.3.5.1

Sep 2, 2020

0.3.5

Sep 1, 2020

0.3.4

Aug 24, 2020

0.3.3

Aug 6, 2020

0.3.2

Jul 23, 2020

0.3.1

Jul 22, 2020

0.3.0

Jul 9, 2020

0.2.6.2

Jun 30, 2020

0.2.6.1

Apr 16, 2020

0.2.6 yanked

Apr 16, 2020

Reason this release was yanked:

Bug in the setup.py

0.2.5.1

Mar 13, 2020

0.2.5

Jan 10, 2020

0.2.4.1

Dec 6, 2019

0.2.4

Dec 6, 2019

0.2.3

Aug 20, 2019

0.2.2

Aug 19, 2019

0.2.1

Aug 16, 2019

0.2.0

Aug 16, 2019

0.1.0

Jul 25, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sentence_transformers-5.3.0.tar.gz (403.3 kB view details)

Uploaded Mar 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sentence_transformers-5.3.0-py3-none-any.whl (512.4 kB view details)

Uploaded Mar 12, 2026 Python 3

File details

Details for the file sentence_transformers-5.3.0.tar.gz.

File metadata

Download URL: sentence_transformers-5.3.0.tar.gz
Upload date: Mar 12, 2026
Size: 403.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for sentence_transformers-5.3.0.tar.gz
Algorithm	Hash digest
SHA256	`414a0a881f53a4df0e6cbace75f823bfcb6b94d674c42a384b498959b7c065e2`
MD5	`11f4bc73cbb35350ca77bc963f878e95`
BLAKE2b-256	`fe26448453925b6ce0c29d8b54327caa71ee4835511aef02070467402273079c`

See more details on using hashes here.

File details

Details for the file sentence_transformers-5.3.0-py3-none-any.whl.

File metadata

Download URL: sentence_transformers-5.3.0-py3-none-any.whl
Upload date: Mar 12, 2026
Size: 512.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for sentence_transformers-5.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dca6b98db790274a68185d27a65801b58b4caf653a4e556b5f62827509347c7d`
MD5	`77b2c9b91d888a54d6ba9971785ca0b4`
BLAKE2b-256	`e29c2fa7224058cad8df68d84bafee21716f30892cecc7ad1ad73bde61d23754`

See more details on using hashes here.

sentence-transformers 5.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Sentence Transformers: Embeddings, Retrieval, and Reranking

Installation

Getting Started

Embedding Models

Reranker Models

Sparse Encoder Models

Pre-Trained Models

Training

Application Examples

Development setup

Citing & Authors

Maintainers

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes