A modular, production-grade Retrieval-Augmented Generation library

These details have not been verified by PyPI

Project links

Project description

raglib-py

raglib-py is a production-grade Retrieval-Augmented Generation library for Python.

This README is designed to be the full user guide for PyPI users, so you do not need to go anywhere else to start.

Important package note:

PyPI package name: raglib-py
Python import name: raglib

What You Can Do

Load documents from files, folders, URLs, or raw text
Use one clean entry point: RAG(...)
Switch between local and cloud LLM providers
Choose embedding provider and vector store backend
Run 12 RAG strategies out of the box
Use interactive terminal chat mode

Installation

Install core package:

pip install raglib-py

Common optional extras:

# Local Ollama chat/embedding support
pip install "raglib-py[ollama]"

# Chroma vector DB backend
pip install "raglib-py[chroma]"

# DOCX/PDF/PPTX document loading
pip install "raglib-py[docx,pdf,pptx]"

# All major runtime extras
pip install "raglib-py[all]"

Quick import check:

python -c "from raglib import RAG; print('OK')"

Quickstart (Zero API Keys)

from raglib import RAG

rag = RAG("RAG improves grounded generation using retrieved context.")
result = rag.query("What does RAG improve?")

print(result.answer)
print([doc.id for doc in result.sources])

In this mode, raglib uses offline defaults automatically:

chat model: MockLLMClient
embeddings: MockEmbedding

Real Local Stack (Ollama)

If you have local models in Ollama, this is a strong default setup:

from raglib import RAG

rag = RAG(
	source=r"C:\path\to\your\document.docx",
	chat_llm="ollama",
	chat_model="gemma3:4b",
	embedding_llm="ollama",
	embedding_model="nomic-embed-text:latest",
	vector_db="chroma",
	rag_type="corrective",
	top_k=5,
)

result = rag.query("What is the core concept of this paper?")
print(result.answer)

Cloud Chat + Local Embeddings Example

from raglib import RAG

rag = RAG(
	source="Service test content.",
	chat_llm="groq",
	llm_key="YOUR_GROQ_API_KEY",
	chat_model="qwen/qwen3-32b",
	embedding_llm="ollama",
	embedding_model="nomic-embed-text:latest",
	vector_db="memory",
	rag_type="naive",
)

print(rag.query("Reply in one line that service is available.").answer)

Supported Input Sources

source accepts:

File path: .txt, .md, .docx, .pptx, .pdf
Folder path: recursive load
URL: web page text extraction
Raw text string
List of any mix of the above

You can ingest more data later:

rag.add("new_notes.md")
rag.add("https://example.com/post")

RAG API (Main Constructor)

RAG(
	source=None,
	chat_llm=None,
	embedding_llm=None,
	vision_llm=None,
	llm_key=None,
	rag_type="corrective",
	top_k=5,
	chunk_size=400,
	chunk_overlap=50,
	output_dir=None,
	chat_model=None,
	embedding_model=None,
	embedding_base_url=None,
	vision_model=None,
	vector_db=None,
	vector_db_kwargs=None,
)

Key methods:

query(question): ask one question and get GenerationResult
add(source): add more documents to existing index
chat(): start terminal interactive Q/A session

How Many RAG Strategies Are Included?

raglib currently provides 12 built-in RAG strategies:

naive
advanced
corrective
self
agentic
hybrid
multi_query
multi_hop
routing
memory
web
tool

Use them by setting rag_type:

rag = RAG(source="docs/", rag_type="multi_hop")

When to use what:

naive: fastest baseline
advanced: better quality via rerank/reduce/dedup
corrective: retries when context quality is weak
self: decision + reflection-driven behavior
agentic: planner-based sub-query execution
hybrid: local + web retrieval blending
multi_query: query variant expansion
multi_hop: multi-step reasoning retrieval
routing: automatic retrieval route selection
memory: conversation-memory-aware answering
web: web-first retrieval
tool: retrieval plus tool output injection

Provider Support

Chat providers:

openai
anthropic
groq
google
ollama
custom BaseLLMClient or LangChain-style invoke() model

Embedding providers:

openai
google
ollama
huggingface (aliases: free, local)
mock

Vision providers (for scanned PDF fallback):

openai
anthropic
google
mock

Vector backends:

chroma (default selection)
memory
custom BaseVectorStore instance

Interactive Terminal Chat

from raglib import RAG

rag = RAG("my_docs/")
rag.chat()

Commands inside session:

help
history
clear
exit / quit / q

Output Saving

Set output_dir to save each query result as JSON:

rag = RAG(source="docs/", output_dir="outputs")
rag.query("Summarize this")

Troubleshooting

Import confusion:

Install: pip install raglib-py
Import: from raglib import RAG

Chroma issues:

If Chroma is unavailable or unstable, use vector_db="memory"

Missing provider package:

Install needed extra (for example: pip install "raglib-py[groq]")

Complete Local Test Script

A ready user script is included at:

examples/local_ollama_chroma_test.py

An all-strategy comparison script is included at:

examples/check_all_rag_types.py

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.10

Apr 6, 2026

0.1.9

Apr 6, 2026

0.1.8

Apr 6, 2026

0.1.7

Apr 5, 2026

0.1.6

Apr 5, 2026

0.1.5

Apr 5, 2026

This version

0.1.4

Apr 5, 2026

0.1.3

Apr 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

raglib_py-0.1.4.tar.gz (50.6 kB view details)

Uploaded Apr 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

raglib_py-0.1.4-py3-none-any.whl (99.7 kB view details)

Uploaded Apr 5, 2026 Python 3

File details

Details for the file raglib_py-0.1.4.tar.gz.

File metadata

Download URL: raglib_py-0.1.4.tar.gz
Upload date: Apr 5, 2026
Size: 50.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for raglib_py-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`9ea0a1bcbe3422701873c3cf86207c2eb25cd97ed81e1a3a44c8beb5e960d23c`
MD5	`2328c9c523f84923c0bec8a9f00aaccd`
BLAKE2b-256	`358bee1bb4f2ba11e43e091b6d727b12aad2a04aeb562a9c80e179dd06d90672`

See more details on using hashes here.

File details

Details for the file raglib_py-0.1.4-py3-none-any.whl.

File metadata

Download URL: raglib_py-0.1.4-py3-none-any.whl
Upload date: Apr 5, 2026
Size: 99.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for raglib_py-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a90bd55bf2c55eb89d637b91c4a2b3f8d7dbfcc1e7668450c24f7947b7c18fa2`
MD5	`e43fc332d8113ba59b92132c29716637`
BLAKE2b-256	`6750a3e2a378c609cbe3a872586ee4d799c2109f3b1204f16381604a3576c9f9`

See more details on using hashes here.

raglib-py 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

raglib-py

What You Can Do

Installation

Quickstart (Zero API Keys)

Real Local Stack (Ollama)

Cloud Chat + Local Embeddings Example

Supported Input Sources

RAG API (Main Constructor)

How Many RAG Strategies Are Included?

Provider Support

Interactive Terminal Chat

Output Saving

Troubleshooting

Complete Local Test Script

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes