llama-index embeddings voyageai integration

These details have not been verified by PyPI

Project description

LlamaIndex Embeddings Integration: VoyageAI

The llama-index-embeddings-voyageai package contains LlamaIndex integrations for building applications with VoyageAI's state-of-the-art embedding models. This integration provides support for text embeddings, multimodal embeddings, and contextual embeddings via the VoyageAI API.

Installation

pip install llama-index-embeddings-voyageai

Setup

1. Get Your API Key

2. Set Environment Variable

Export your API key as an environment variable:

export VOYAGE_API_KEY="your-api-key-here"

Usage

Basic Usage

from llama_index.embeddings.voyageai import VoyageEmbedding

# Initialize the VoyageAI Embedding model
embedding_model = VoyageEmbedding(
    model_name="voyage-3.5",
    voyage_api_key="your-api-key",  # Optional if VOYAGE_API_KEY is set
)

# Get a single embedding
embedding = embedding_model.get_text_embedding("Your text here")
print(f"Embedding dimension: {len(embedding)}")

# Get embeddings for multiple texts
texts = ["Text 1", "Text 2", "Text 3"]
embeddings = embedding_model.get_text_embedding_batch(texts)
print(f"Number of embeddings: {len(embeddings)}")

Query vs Document Embeddings

VoyageAI embeddings distinguish between queries and documents for optimal retrieval performance:

from llama_index.embeddings.voyageai import VoyageEmbedding

embedding_model = VoyageEmbedding(model_name="voyage-3.5")

# Get query embedding (automatically uses input_type="query")
query_embedding = embedding_model.get_query_embedding(
    "What is machine learning?"
)

# Get document embedding (automatically uses input_type="document")
doc_embedding = embedding_model.get_text_embedding("Machine learning is...")

Advanced Parameters

from llama_index.embeddings.voyageai import VoyageEmbedding

embedding_model = VoyageEmbedding(
    model_name="voyage-3.5",
    voyage_api_key="your-api-key",
    truncation=True,  # Enable text truncation
    output_dtype="float",  # Options: "float", "int8", "uint8", "binary", "ubinary"
    output_dimension=512,  # Reduce dimensionality (256, 512, 1024, 2048)
    embed_batch_size=128,  # Batch size for processing
)

# Use general text embedding with custom input type
embedding = embedding_model.get_general_text_embedding(
    "Your text here", input_type="query"
)

Multimodal Embeddings

VoyageAI supports multimodal embeddings for text and images with voyage-multimodal-3, and text, images, and video with voyage-multimodal-3.5. Important: You must set truncation=True when using multimodal models.

from llama_index.embeddings.voyageai import VoyageEmbedding
from io import BytesIO

# Initialize with multimodal model (truncation=True is REQUIRED)
embedding_model = VoyageEmbedding(
    model_name="voyage-multimodal-3",  # or "voyage-multimodal-3.5" for video support
    truncation=True,  # Required for multimodal models
)

# Embed an image from file path (PNG, JPEG, JPG, WEBP, GIF supported)
image_embedding = embedding_model.get_image_embedding("path/to/image.jpg")
print(f"Image embedding dimension: {len(image_embedding)}")  # 1024

# Embed an image from BytesIO
with open("path/to/image.png", "rb") as f:
    image_data = BytesIO(f.read())
    image_embedding = embedding_model.get_image_embedding(image_data)

# The multimodal model also works with text
text_embedding = embedding_model.get_text_embedding("Description of the image")
query_embedding = embedding_model.get_query_embedding(
    "Find images with red color"
)

# Batch text embeddings
batch_embeddings = embedding_model.get_text_embedding_batch(
    ["Image description 1", "Image description 2", "Image description 3"]
)

Video Embeddings (voyage-multimodal-3.5 only)

from llama_index.embeddings.voyageai import VoyageEmbedding

# Initialize with voyage-multimodal-3.5 for video support
embedding_model = VoyageEmbedding(
    model_name="voyage-multimodal-3.5",
    truncation=True,
)

# Embed a single video (max 20MB, supports MP4, MPEG, MOV, AVI, FLV, MPG, WEBM, WMV, 3GP)
video_embedding = embedding_model.get_video_embedding("path/to/video.mp4")
print(f"Video embedding dimension: {len(video_embedding)}")  # 1024

# Embed multiple videos
video_embeddings = embedding_model.get_video_embeddings(
    ["video1.mp4", "video2.mp4", "video3.mp4"]
)

# Async video embedding
video_embedding = await embedding_model.aget_video_embedding(
    "path/to/video.mp4"
)

Contextual Embeddings

For enhanced context-aware embeddings using the voyage-context-3 model:

from llama_index.embeddings.voyageai import VoyageEmbedding

# Initialize with contextual model
embedding_model = VoyageEmbedding(
    model_name="voyage-context-3", output_dtype="float", output_dimension=1024
)

# The model will use contextualized_embed internally
# providing enhanced embeddings with better context understanding
embeddings = embedding_model.get_text_embedding_batch(
    ["First document chunk", "Second document chunk", "Third document chunk"]
)

Async Usage

The integration supports async operations for better performance:

import asyncio
from llama_index.embeddings.voyageai import VoyageEmbedding


async def get_embeddings_async():
    # Regular text embeddings
    embedding_model = VoyageEmbedding(model_name="voyage-3.5")

    # Get async query embedding
    query_embedding = await embedding_model.aget_query_embedding("Your query")

    # Get async text embeddings
    embeddings = await embedding_model.aget_text_embedding_batch(
        ["Text 1", "Text 2", "Text 3"]
    )

    # For multimodal image embeddings
    multimodal_model = VoyageEmbedding(
        model_name="voyage-multimodal-3",
        truncation=True,  # Required for multimodal
    )
    image_embedding = await multimodal_model.aget_image_embedding(
        "path/to/image.jpg"
    )

    return query_embedding, embeddings, image_embedding


# Run async function
results = asyncio.run(get_embeddings_async())

Integration with LlamaIndex

from llama_index.core import VectorStoreIndex, Settings, Document
from llama_index.embeddings.voyageai import VoyageEmbedding
from llama_index.llms.openai import OpenAI

# Configure LlamaIndex settings
Settings.llm = OpenAI()
Settings.embed_model = VoyageEmbedding(
    model_name="voyage-3.5", voyage_api_key="your-api-key"
)

# Create documents
documents = [
    Document(text="LlamaIndex is a data framework for LLM applications."),
    Document(text="VoyageAI provides state-of-the-art embedding models."),
    Document(text="Embeddings convert text into numerical vectors."),
]

# Create vector index
index = VectorStoreIndex.from_documents(documents)

# Query the index
query_engine = index.as_query_engine(similarity_top_k=2)
response = query_engine.query("What is LlamaIndex?")
print(response)

Available Models

VoyageAI offers several specialized embedding models:

Text Embeddings

voyage-4: General-purpose and multilingual retrieval with 1024 dimensions (supports 256, 512, 1024, 2048)
voyage-4-lite: Cost and latency optimized with highest throughput, 1024 dimensions (supports 256, 512, 1024, 2048)
voyage-4-large: Best retrieval quality in voyage-4 series, 1024 dimensions (supports 256, 512, 1024, 2048)
voyage-3.5: Latest general-purpose model with 1024 dimensions (supports 256, 512, 1024, 2048)
voyage-3.5-lite: Cost and latency optimized variant with 1024 dimensions (supports 256, 512, 1024, 2048)
voyage-3-large: Best for general-purpose and multilingual retrieval, 1024 dimensions (supports 256, 512, 1024, 2048)
voyage-code-3: Specialized for code retrieval, 1024 dimensions (supports 256, 512, 1024, 2048)
voyage-3: General-purpose model (1024 dimensions)
voyage-3-lite: Lightweight variant (512 dimensions)

Domain-Specific Models

voyage-finance-2: Optimized for financial documents (1024 dimensions)
voyage-law-2: Specialized for legal documents (1024 dimensions)
voyage-multilingual-2: Enhanced multilingual support (1024 dimensions)

Specialized Models

voyage-multimodal-3: Supports text and image embeddings (1024 dimensions)
voyage-multimodal-3.5: Supports text, image, and video embeddings (1024 dimensions, supports 256, 512, 2048). Currently in preview.
voyage-context-3: Enhanced contextual embeddings with 32K batch token limit (1024 dimensions)

Legacy Models

voyage-2: Earlier generation model (1024 dimensions)
voyage-large-2: Large variant (1536 dimensions)
voyage-large-2-instruct: Large instruct variant (1024 dimensions)
voyage-code-2: Code embedding model (1536 dimensions)

For the latest model information, visit the VoyageAI documentation.

Configuration Options

Parameter	Type	Default	Description
`model_name`	str	Required	The embedding model to use
`voyage_api_key`	str	`None`	VoyageAI API key (falls back to VOYAGE_API_KEY env var)
`embed_batch_size`	int	`1000`	Batch size for embedding calls (max 1000)
`truncation`	bool	`None`	Enable text truncation for long inputs
`output_dtype`	str	`None`	Output format: "float", "int8", "uint8", "binary", "ubinary"
`output_dimension`	int	`None`	Reduce dimensionality (256, 512, 1024, 2048, model-dependent)
`callback_manager`	CallbackManager	`None`	LlamaIndex callback manager for observability

Features

Dynamic Batching: Automatically batches requests based on token limits for each model
Token Management: Respects per-model token limits (ranging from 32K to 1M tokens)
Multimodal Support: Process text, images, and videos with multimodal models
Video Embeddings: Embed video content with voyage-multimodal-3.5 (requires voyageai>=0.3.6)
Contextual Embeddings: Enhanced context-aware embeddings with specialized models
Async Support: Full async/await support for better performance
Flexible Output: Support for various output data types and dimensions
Auto-truncation: Optional text truncation for inputs exceeding model limits

API Batch Token Limits

These limits represent the maximum total tokens that can be sent in a single API request (across all texts in the batch):

Model	Batch Token Limit
voyage-4-lite	1,000,000
voyage-3.5-lite	1,000,000
voyage-4	320,000
voyage-3.5	320,000
voyage-multimodal-3	320,000
voyage-multimodal-3.5	320,000
voyage-2	320,000
voyage-4-large	120,000
voyage-3-large	120,000
voyage-code-3	120,000
voyage-large-2-instruct	120,000
voyage-finance-2	120,000
voyage-multilingual-2	120,000
voyage-law-2	120,000
voyage-large-2	120,000
voyage-3	120,000
voyage-3-lite	120,000
voyage-code-2	120,000
voyage-context-3	32,000

Note: The maximum batch size is 1,000 items per API request. The integration automatically handles batching based on both token limits and batch size.

Environment Variables

Variable	Description
`VOYAGE_API_KEY`	VoyageAI API key (required)

Error Handling

The integration includes proper error handling for:

Missing or invalid API keys
Unsupported image formats (for multimodal models)
Invalid model selection
Network errors and API failures
Token limit violations

Additional Information

For more information about VoyageAI and its embedding models:

License

This project is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.6.0

Mar 12, 2026

0.5.3

Jan 21, 2026

0.5.2

Dec 22, 2025

0.5.1

Nov 14, 2025

0.5.0

Oct 27, 2025

0.4.2

Sep 8, 2025

0.4.1

Aug 7, 2025

0.4.0

Jul 30, 2025

0.3.6

May 20, 2025

0.3.5

Feb 2, 2025

0.3.4

Dec 18, 2024

0.3.3

Dec 16, 2024

0.3.2

Dec 15, 2024

0.3.1

Dec 10, 2024

0.3.0

Nov 17, 2024

0.2.2

Sep 22, 2024

0.2.1

Aug 23, 2024

0.2.0

Aug 22, 2024

0.1.4

Apr 1, 2024

0.1.3

Mar 8, 2024

0.1.2

Feb 21, 2024

0.1.1

Feb 12, 2024

0.1.0

Feb 10, 2024

0.0.1

Feb 3, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_embeddings_voyageai-0.6.0.tar.gz (9.7 kB view details)

Uploaded Mar 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llama_index_embeddings_voyageai-0.6.0-py3-none-any.whl (9.4 kB view details)

Uploaded Mar 12, 2026 Python 3

File details

Details for the file llama_index_embeddings_voyageai-0.6.0.tar.gz.

File metadata

Download URL: llama_index_embeddings_voyageai-0.6.0.tar.gz
Upload date: Mar 12, 2026
Size: 9.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for llama_index_embeddings_voyageai-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`12c469e651132c84f7da7f3c6173fff805d65d724b4b47bbf39c3e34b939b63b`
MD5	`10287ebd8bf199c6b1b29a6e62714822`
BLAKE2b-256	`18990ed3e06b06ebdd281810df17247646546dbf3e90df8c12834d519c7074c5`

See more details on using hashes here.

File details

Details for the file llama_index_embeddings_voyageai-0.6.0-py3-none-any.whl.

File metadata

Download URL: llama_index_embeddings_voyageai-0.6.0-py3-none-any.whl
Upload date: Mar 12, 2026
Size: 9.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for llama_index_embeddings_voyageai-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`daa8cf471b8b0051cead2e43e48d5f09e242328904c5af67d429d30c567bb68c`
MD5	`3d394cf8a5cda014a3c587039a07e156`
BLAKE2b-256	`8cbbc430e20e3ef6bdc7ffc9c23e7eec2f87d9eb7d4642b573363220aca1c668`

See more details on using hashes here.

llama-index-embeddings-voyageai 0.6.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LlamaIndex Embeddings Integration: VoyageAI

Installation

Setup

1. Get Your API Key

2. Set Environment Variable

Usage

Basic Usage

Query vs Document Embeddings

Advanced Parameters

Multimodal Embeddings

Video Embeddings (voyage-multimodal-3.5 only)

Contextual Embeddings

Async Usage

Integration with LlamaIndex

Available Models

Text Embeddings

Domain-Specific Models

Specialized Models

Legacy Models

Configuration Options

Features

API Batch Token Limits

Environment Variables

Error Handling

Additional Information

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes