Custom hybrid retriever with alpha tuning and routing.

These details have not been verified by PyPI

Project description

Koda Retriever

This retriever is a custom fine-tunable Hybrid Retriever that dynamically determines the optimal alpha for a given query. An LLM is used to categorize the query and therefore determine the optimal alpha value, as each category has a preset/provided alpha value. It is recommended that you run tests on your corpus of data and queries to determine categories and corresponding alpha values for your use case.

koda-retriever-mascot

Disclaimer

The default categories and alpha values are not recommended for production use

Introduction

Alpha tuning in hybrid retrieval for RAG models refers to the process of adjusting the weight (alpha) given to different components of a hybrid search strategy. In RAG, the retrieval component is crucial for fetching relevant context from a knowledge base, which the generation component then uses to produce answers. By fine-tuning the alpha parameter, the balance between the retrieved results from dense vector search methods and traditional sparse methods can be optimized. This optimization aims to enhance the overall performance of the system, ensuring that the retrieval process effectively supports the generation of accurate and contextually relevant responses.

Simply explained

Imagine you're playing a game where someone whispers a sentence to you, and you have to decide whether to draw a picture of exactly what they said, or draw a picture of what you think they mean. Alpha tuning is like finding the best rule for when to draw exactly what's said and when to think deeper about the meaning. It helps us get the best mix, so the game is more fun and everyone understands each other better!

Usage Snapshot

Koda Retriever is compatible with all other retrieval interfaces and objects that would normally be able to interact with an LI-native retriever.

Please see the examples folder for more specific examples.

# Setup
from llama_index.packs.koda_retriever import KodaRetriever
from llama_index.core import VectorStoreIndex
from llama_index.llms.openai import OpenAI
from llama_index.embeddings.openai import OpenAIEmbedding
from llama_index.core.postprocessor import LLMRerank
from llama_index.core import Settings

Settings.llm = OpenAI()
Settings.embed_model = OpenAIEmbedding()
vector_store = PineconeVectorStore(pinecone_index=index, text_key="summary")
vector_index = VectorStoreIndex.from_vector_store(
    vector_store=vector_store, embed_model=Settings.embed_model
)

reranker = LLMRerank(llm=Settings.llm)  # optional
retriever = KodaRetriever(
    index=vector_index, llm=Settings.llm, reranker=reranker, verbose=True
)

# Retrieval
query = "What was the intended business model for the parks in the Jurassic Park lore?"

results = retriever.retrieve(query)

# Query Engine
query_engine = RetrieverQueryEngine.from_args(retriever=retriever)

response = query_engine.query(query)

Prerequisites

Vector Store Index w/ hybrid search enabled
LLM (or any model to route/classify prompts)

Please note that you will also need vector AND text representations of your data for a hybrid retriever to work. It is not uncommon for some vector databases to only store the vectors themselves, in which case an error will occur downstream if you try to run any hybrid queries.

Setup

Citations

Idea & original implementation sourced from the following docs:

Buy me a coffee

Thanks!

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.1

Sep 8, 2025

This version

0.4.0

Jul 31, 2025

0.3.0

Nov 18, 2024

0.2.0

Aug 22, 2024

0.1.1

Mar 15, 2024

0.1.0

Feb 26, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_index_packs_koda_retriever-0.4.0.tar.gz (8.8 kB view details)

Uploaded Jul 31, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llama_index_packs_koda_retriever-0.4.0-py3-none-any.whl (9.6 kB view details)

Uploaded Jul 31, 2025 Python 3

File details

Details for the file llama_index_packs_koda_retriever-0.4.0.tar.gz.

File metadata

Download URL: llama_index_packs_koda_retriever-0.4.0.tar.gz
Upload date: Jul 31, 2025
Size: 8.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for llama_index_packs_koda_retriever-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`e7fa59dd7d5b98e17452b4e8090e4da5882e284a427905c8f429f6046f18043d`
MD5	`c64540657ed30ea5c8f5e2bb1681a90c`
BLAKE2b-256	`15a6237bfada293b0dcf1ae17cf664eda7fe1d3a5ae7b16bf30788bbb207986a`

See more details on using hashes here.

File details

Details for the file llama_index_packs_koda_retriever-0.4.0-py3-none-any.whl.

File metadata

Download URL: llama_index_packs_koda_retriever-0.4.0-py3-none-any.whl
Upload date: Jul 31, 2025
Size: 9.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.13

File hashes

Hashes for llama_index_packs_koda_retriever-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`830d7f5aead65a9c6f88ce148369641ea32edbdffe235e723f07879aecff9ff9`
MD5	`f6451dcbdc43bb32220b8d545fb82e55`
BLAKE2b-256	`717af4977843847a298baf97c39d90a9582efed1e3b83b4b6baa635527f3d08c`

See more details on using hashes here.

llama-index-packs-koda-retriever 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Koda Retriever

Disclaimer

Introduction

Simply explained

Usage Snapshot

Prerequisites

Setup

Citations

Buy me a coffee

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes