A simple, unified interface for RAG across multiple vector store backends (txtai, Chroma, Qdrant, LightRAG)

These details have not been verified by PyPI

Project links

Project description

bibirags

A simple, unified interface for RAG (Retrieval-Augmented Generation) across multiple vector store backends.

bibirags wraps txtai, Chroma, Qdrant, and LightRAG behind a consistent three-function API so you can swap backends without rewriting your pipeline.

save_<backend>(chunks, rag_root, conf, ...)  → index documents
search_<backend>(query, rag_root, conf, ...) → retrieve chunks
query_<backend>(query, rag_root, conf, ...)  → retrieve + answer

All LLM and embedding calls go through LiteLLM via a single LitellmConfDict, meaning any model provider (OpenAI, Anthropic, Cohere, Ollama, etc.) works out of the box.

Installation

Install the core package plus the backends you need:

# Qdrant only
pip install bibirags[qdrant]

# Chroma only
pip install bibirags[chroma]

# txtai only
pip install bibirags[txtai]

# LightRAG only
pip install bibirags[lightrag]

# All backends + document loading helpers
pip install bibirags[all]

Add [docs] to get PDF and TXT loading via LangChain:

pip install bibirags[qdrant,docs]

Quick start

Build a `LitellmConfDict`

Every function takes a single conf dict instead of scattered llm_model / embed_model / api_key arguments:

from bibirags import LitellmConfDict

# OpenAI
conf: LitellmConfDict = {
    "embed_model": "text-embedding-3-small",
    "llm_model": "gpt-4o-mini",
    "api_key": "sk-...",          # falls back to OPENAI_API_KEY env var
}

# Ollama (local)
conf: LitellmConfDict = {
    "embed_model": "ollama/bge-m3:latest",
    "llm_model": "ollama/gemma3:8b",
    "api_base": "http://localhost:11434",
}

# Any LiteLLM-compatible proxy
conf: LitellmConfDict = {
    "embed_model": "openai/text-embedding-3-small",
    "llm_model": "openai/gpt-4o",
    "api_base": "https://my-proxy.example.com/v1",
    "api_key": "proxy-key",
}

Index raw text chunks

from bibirags import save_qdrant, search_qdrant, query_qdrant

chunks = [
    "The Eiffel Tower was completed in 1889.",
    "The Louvre is the world's largest art museum.",
    "Paris is the capital of France.",
]

conf = {"embed_model": "text-embedding-3-small", "llm_model": "gpt-4o-mini"}
rag_root = "./my_rag_index"

# 1. Index
save_qdrant(chunks, rag_root, conf)

# 2. Semantic search
results = search_qdrant("When was the Eiffel Tower built?", rag_root, conf)

# 3. RAG query → answer + source chunks
answer, sources = query_qdrant("When was the Eiffel Tower built?", rag_root, conf)
print(answer)

Load documents from disk

from bibirags import chunk_docs, save_chroma, query_chroma

conf = {"embed_model": "text-embedding-3-small", "llm_model": "gpt-4o"}
chunks = chunk_docs("./my_docs/", chunk_size=800, chunk_overlap=120)

save_chroma(chunks, "./chroma_index", conf)

answer, sources = query_chroma(
    "What does the contract say about termination?",
    rag_root="./chroma_index",
    conf=conf,
)

Using Ollama (local models)

from bibirags import save_txtai, query_txtai

conf = {
    "embed_model": "ollama/bge-m3:latest",
    "llm_model": "ollama/gemma3:8b",
    "api_base": "http://localhost:11434",
}

save_txtai(chunks, "./txtai_index", conf)
answer, sources = query_txtai("What happened in the news?", "./txtai_index", conf)

Backends at a glance

Backend	Best for	Index format	Notes
Qdrant	Production workloads, filtering	Local files or server	Cosine similarity, rich payload filtering
Chroma	LangChain ecosystems	Local SQLite	Easy LangChain integration
txtai	All-in-one HuggingFace pipelines	SQLite + FAISS	Built-in pipeline support
LightRAG	Knowledge-graph RAG	Local JSON + vector	Graph-enhanced hybrid retrieval

API reference

`LitellmConfDict`

class LitellmConfDict(TypedDict, total=False):
    embed_model: str   # required for save/search/query
    llm_model:   str   # required for query
    api_base:    str   # optional – custom API endpoint
    api_key:     str   # optional – falls back to env vars

`chunk_docs`

chunk_docs(docs_path, chunk_size=800, chunk_overlap=120) → list[str]

Recursively loads .pdf and .txt files from docs_path and returns text chunks.

`save_<backend>`

save_qdrant(chunks, rag_root, conf)
save_chroma(chunks, rag_root, conf)
save_txtai(chunks, rag_root, conf)
save_lightrag(chunks, rag_root, conf)

`search_<backend>`

results: list[str] = search_qdrant(query, rag_root, conf, top_k=3)

Returns the top_k most relevant chunk texts.

`query_<backend>`

answer, sources = query_qdrant(query, rag_root, conf, top_k=3)

Returns (answer_string, list_of_source_chunks).

Contributing

git clone https://github.com/yourname/bibirags
cd bibirags
pip install -e ".[dev]"
pre-commit install
pytest

License

MIT – see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.3

Jun 8, 2026

This version

0.1.2

Jun 4, 2026

0.1.1

May 30, 2026

0.1.0

May 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bibirags-0.1.2.tar.gz (17.8 kB view details)

Uploaded Jun 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bibirags-0.1.2-py3-none-any.whl (16.9 kB view details)

Uploaded Jun 4, 2026 Python 3

File details

Details for the file bibirags-0.1.2.tar.gz.

File metadata

Download URL: bibirags-0.1.2.tar.gz
Upload date: Jun 4, 2026
Size: 17.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.14

File hashes

Hashes for bibirags-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`a006ee774bb622061aaa613d4d5a7da5b2cc0af9313e0687a7df2e89586f88ee`
MD5	`5d0c7124f5516451ca66909a57a42a0a`
BLAKE2b-256	`167e06b4108402f2deccb45b942d9dc381bdad4e9ceaa2b4ce7e6e769de9572c`

See more details on using hashes here.

File details

Details for the file bibirags-0.1.2-py3-none-any.whl.

File metadata

Download URL: bibirags-0.1.2-py3-none-any.whl
Upload date: Jun 4, 2026
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.6.14

File hashes

Hashes for bibirags-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4e44619195869f83250228c6b6b388b8d8c5e0b07b418d94835904a2aa262726`
MD5	`096a4585f9b4433db9dd27ced613a8e7`
BLAKE2b-256	`a94a8559d8c87eb4d2de784e4d3fd9d52bf30b0311ea6e572a44112ab83bab46`

See more details on using hashes here.

bibirags 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

bibirags

Installation

Quick start

Build a LitellmConfDict

Index raw text chunks

Load documents from disk

Using Ollama (local models)

Backends at a glance

API reference

LitellmConfDict

chunk_docs

save_<backend>

search_<backend>

query_<backend>

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Build a `LitellmConfDict`

`LitellmConfDict`

`chunk_docs`

`save_<backend>`

`search_<backend>`

`query_<backend>`