LLM2Vec-Gen: Generative Embeddings from Large Language Models

These details have not been verified by PyPI

Project links

Project description

LLM2VEC-GEN: Generative Embeddings from Large Language Models

LLM2Vec-Gen is a recipe to train interpretable, generative embeddings that encode the potential answer of an LLM to a query rather than the query itself.

Installation

Either use Pypi or clone the repository and install in editable mode:

pip install llm2vec-gen  # or pip install -e .

Usage

Load a pretrained model:

import torch
from llm2vec_gen import LLM2VecGenModel

model = LLM2VecGenModel.from_pretrained("McGill-NLP/LLM2Vec-Gen-Qwen3-8B")

As an example, you can use the model for retrieval using the following code snippet:

q_instruction = "Generate a passage that best answers this question: "
d_instruction = "Summarize the following passage: "

queries = [
  "where do polar bears live and what's their habitat",
  "what does disk cleanup mean on a computer"
]
q_reps = model.encode([q_instruction + q for q in queries])

documents = [
  "Polar bears live throughout the circumpolar North in the Arctic, spanning across Canada, Alaska (USA), Russia, Greenland, and Norway. Their primary habitat is sea ice over the continental shelf, which they use for hunting, mating, and traveling. They are marine mammals that rely on this environment to hunt seals.",
  "Disk Cleanup is a built-in Windows tool that frees up hard drive space by scanning for and deleting unnecessary files like temporary files, cached data, Windows updates, and items in the Recycle Bin. It improves computer performance by removing \"junk\" files, which can prevent the system from running slowly due to low storage.",
]
d_reps = model.encode([d_instruction + d for d in documents])

# Compute cosine similarity
q_reps_norm = torch.nn.functional.normalize(q_reps, p=2, dim=1)
d_reps_norm = torch.nn.functional.normalize(d_reps, p=2, dim=1)
cos_sim = torch.mm(q_reps_norm, d_reps_norm.transpose(0, 1))

print(cos_sim)
"""
tensor([[0.8750, 0.1182],
        [0.0811, 0.9336]])
"""

Note that in all examples, the instructions should be as if you are generating the answer to the input.
You may find other examples to try LLM2Vec-Gen in other tasks (e.g., classification and clustering) in the paper's GitHub repository.

LLM2Vec-Gen provides interpretable embeddings. You can use the following code to decode the content embedded in the embeddings:

_, recon_hidden_states = model.encode("what does disk cleanup mean on a computer", get_recon_hidden_states=True)
# recon_hidden_states: torch.Tensor with shape (1, compression token size, hidden_dim)

answer = model.generate(recon_hidden_states=recon_hidden_states, max_new_tokens=55)

print(answer)
"""
* **\n\n**Disk Cleanup** is a built-in utility in Windows that helps you **free up disk space** by **removing unnecessary files** from your computer. It is designed to clean up temporary files, system cache, and other files that are no longer needed.\n\n
"""

This code snippet will return the answer of the LLM2Vec-Gen model generated from the generative embeddings of the input (recon_hidden_states).

Citation

If you use this code, models, or data, please cite the LLM2Vec-Gen paper.

@article{behnamghader2026llm2vecgen,
  title={LLM2Vec-Gen: Generative Embeddings from Large Language Models},
  author={BehnamGhader, Parishad and Adlakha, Vaibhav and Schmidt, Fabian David and Chapados, Nicolas and Mosbach, Marius and Reddy, Siva},
  journal={arXiv preprint: arXiv:2603.10913},
  year={2026}
  url={https://arxiv.org/abs/2603.10913}
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.3

Apr 5, 2026

This version

0.1.2

Mar 12, 2026

0.1.1

Mar 12, 2026

0.1.0

Mar 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm2vec_gen-0.1.2.tar.gz (4.6 MB view details)

Uploaded Mar 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm2vec_gen-0.1.2-py3-none-any.whl (23.8 kB view details)

Uploaded Mar 12, 2026 Python 3

File details

Details for the file llm2vec_gen-0.1.2.tar.gz.

File metadata

Download URL: llm2vec_gen-0.1.2.tar.gz
Upload date: Mar 12, 2026
Size: 4.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for llm2vec_gen-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`f20065a64e3a852bc5d37b8a50893e4896693d88bec1e2627be638614cbd0709`
MD5	`de82605463fe309b870e5fe682482373`
BLAKE2b-256	`9779f384bdea1ee6c4620847c68efda0d89e845a184b1b432c88337b4d3ca174`

See more details on using hashes here.

File details

Details for the file llm2vec_gen-0.1.2-py3-none-any.whl.

File metadata

Download URL: llm2vec_gen-0.1.2-py3-none-any.whl
Upload date: Mar 12, 2026
Size: 23.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for llm2vec_gen-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c8a6ed4804179214e92d10dbb2d4bc0a66a49d595cbb79b6a0fba4949a781ced`
MD5	`b516bcf61cc64a56a3953ea9684b11f2`
BLAKE2b-256	`22051edfa7a25743e3639843fa106b8805e8d2ee442d4e28e7044041c91469a0`

See more details on using hashes here.

llm2vec-gen 0.1.2

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Project description

LLM2VEC-GEN: Generative Embeddings from Large Language Models

Installation

Usage

Citation

Project details

Verified details

Owner

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes