Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling

These details have not been verified by PyPI

Project description

Haiku RAG

Agentic RAG built on LanceDB, Pydantic AI, and Docling.

Features

Hybrid search — Vector + full-text with Reciprocal Rank Fusion
Question answering — QA agents with citations (page numbers, section headings)
Reranking — MxBAI, Cohere, Zero Entropy, or vLLM
Research agents — Multi-agent workflows via pydantic-graph: plan, search, evaluate, synthesize
RLM agent — Complex analytical tasks via sandboxed Python code execution (aggregation, computation, multi-document analysis)
Conversational RAG — Chat TUI and web application for multi-turn conversations with session memory
Document structure — Stores full DoclingDocument, enabling structure-aware context expansion
Multiple providers — Embeddings: Ollama, OpenAI, VoyageAI, LM Studio, vLLM. QA/Research: any model supported by Pydantic AI
Local-first — Embedded LanceDB, no servers required. Also supports S3, GCS, Azure, and LanceDB Cloud
CLI & Python API — Full functionality from command line or code
MCP server — Expose as tools for AI assistants (Claude Desktop, etc.)
Visual grounding — View chunks highlighted on original page images
File monitoring — Watch directories and auto-index on changes
Time travel — Query the database at any historical point with --before
Inspector — TUI for browsing documents, chunks, and search results

Installation

Python 3.12 or newer required

Full Package (Recommended)

pip install haiku.rag

Includes all features: document processing, all embedding providers, and rerankers.

Using uv? uv pip install haiku.rag

Slim Package (Minimal Dependencies)

pip install haiku.rag-slim

Install only the extras you need. See the Installation documentation for available options.

Quick Start

Note: Requires an embedding provider (Ollama, OpenAI, etc.). See the Tutorial for setup instructions.

# Index a PDF
haiku-rag add-src paper.pdf

# Search
haiku-rag search "attention mechanism"

# Ask questions with citations
haiku-rag ask "What datasets were used for evaluation?" --cite

# Deep QA — decomposes complex questions into sub-queries
haiku-rag ask "How does the proposed method compare to the baseline on MMLU?" --deep

# Research mode — iterative planning and search
haiku-rag research "What are the limitations of the approach?"

# RLM mode — complex analytical tasks via code execution
haiku-rag rlm "How many documents mention transformers?"

# Interactive chat — multi-turn conversations with memory
haiku-rag chat

# Watch a directory for changes
haiku-rag serve --monitor

See Configuration for customization options.

Python API

from haiku.rag.client import HaikuRAG

async with HaikuRAG("research.lancedb", create=True) as rag:
    # Index documents
    await rag.create_document_from_source("paper.pdf")
    await rag.create_document_from_source("https://arxiv.org/pdf/1706.03762")

    # Search — returns chunks with provenance
    results = await rag.search("self-attention")
    for result in results:
        print(f"{result.score:.2f} | p.{result.page_numbers} | {result.content[:100]}")

    # QA with citations
    answer, citations = await rag.ask("What is the complexity of self-attention?")
    print(answer)
    for cite in citations:
        print(f"  [{cite.chunk_id}] p.{cite.page_numbers}: {cite.content[:80]}")

For research agents and chat, see the Agents docs.

MCP Server

Use with AI assistants like Claude Desktop:

haiku-rag serve --mcp --stdio

Add to your Claude Desktop configuration:

{
  "mcpServers": {
    "haiku-rag": {
      "command": "haiku-rag",
      "args": ["serve", "--mcp", "--stdio"]
    }
  }
}

Provides tools for document management, search, QA, and research directly in your AI assistant.

Examples

See the examples directory for working examples:

Docker Setup - Complete Docker deployment with file monitoring and MCP server
Web Application - Full-stack conversational RAG with CopilotKit frontend

Documentation

Full documentation at: https://ggozad.github.io/haiku.rag/

Installation - Provider setup
Architecture - System overview
Configuration - YAML configuration
CLI - Command reference
Python API - Complete API docs
Agents - QA, chat, and research agents
RLM Agent - Complex analytical tasks via code execution
Applications - Chat TUI, web app, and inspector
Server - File monitoring and MCP
MCP - Model Context Protocol integration
Benchmarks - Performance benchmarks
Changelog - Version history

License

This project is licensed under the MIT License.

mcp-name: io.github.ggozad/haiku-rag

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.44.0

Apr 29, 2026

0.43.1

Apr 25, 2026

0.43.0

Apr 24, 2026

0.42.1

Apr 22, 2026

0.42.0

Apr 22, 2026

0.41.0

Apr 20, 2026

0.40.1

Apr 17, 2026

0.40.0

Apr 16, 2026

0.39.0

Apr 9, 2026

0.38.0

Apr 8, 2026

0.37.0

Apr 7, 2026

0.36.3

Apr 1, 2026

0.36.2

Mar 28, 2026

0.36.1

Mar 27, 2026

0.36.0

Mar 26, 2026

0.35.1

Mar 24, 2026

0.35.0

Mar 24, 2026

0.34.1

Mar 16, 2026

0.34.0

Mar 13, 2026

0.33.3

Mar 12, 2026

0.33.2

Mar 11, 2026

0.33.1

Mar 6, 2026

0.33.0

Mar 4, 2026

0.32.3

Mar 3, 2026

0.32.2

Feb 28, 2026

0.32.0

Feb 24, 2026

0.31.1

Feb 20, 2026

0.31.0

Feb 20, 2026

0.30.2

Feb 19, 2026

0.30.1

Feb 17, 2026

This version

0.30.0

Feb 16, 2026

0.29.1

Feb 10, 2026

0.29.0

Feb 6, 2026

0.28.0

Jan 31, 2026

0.27.2

Jan 29, 2026

0.27.1

Jan 27, 2026

0.27.0

Jan 26, 2026

0.26.9

Jan 22, 2026

0.26.8

Jan 22, 2026

0.26.7

Jan 20, 2026

0.26.6

Jan 19, 2026

0.26.5

Jan 16, 2026

0.26.4

Jan 15, 2026

0.26.3

Jan 15, 2026

0.26.2

Jan 13, 2026

0.26.1

Jan 13, 2026

0.26.0

Jan 13, 2026

0.25.0

Jan 12, 2026

0.24.2

Jan 8, 2026

0.24.1

Jan 8, 2026

0.24.0

Jan 7, 2026

0.23.2

Jan 5, 2026

0.23.1

Dec 29, 2025

0.23.0

Dec 26, 2025

0.22.0

Dec 19, 2025

0.21.0

Dec 18, 2025

0.20.2

Dec 12, 2025

0.20.1

Dec 11, 2025

0.20.0

Dec 10, 2025

0.19.6

Dec 3, 2025

0.19.5

Dec 1, 2025

0.19.4

Nov 28, 2025

0.19.3

Nov 27, 2025

0.19.2

Nov 27, 2025

0.19.1

Nov 26, 2025

0.19.0

Nov 25, 2025

0.18.0

Nov 21, 2025

0.17.2

Nov 19, 2025

0.17.1

Nov 18, 2025

0.17.0

Nov 17, 2025

0.16.1

Nov 14, 2025

0.16.0

Nov 13, 2025

0.15.0

Nov 7, 2025

0.14.1

Nov 6, 2025

0.14.0

Nov 5, 2025

0.13.3

Oct 30, 2025

0.13.2

Oct 29, 2025

0.13.1

Oct 28, 2025

0.13.0

Oct 27, 2025

0.12.1

Oct 16, 2025

0.12.0

Oct 14, 2025

0.11.4

Oct 7, 2025

0.11.3

Oct 1, 2025

0.11.2

Sep 30, 2025

0.11.1

Sep 25, 2025

0.11.0

Sep 24, 2025

0.10.2

Sep 23, 2025

0.10.1

Sep 22, 2025

0.10.0

Sep 19, 2025

0.9.3

Sep 19, 2025

0.9.2

Sep 17, 2025

0.9.1

Sep 17, 2025

0.9.0

Sep 17, 2025

0.8.1

Sep 15, 2025

0.8.0

Sep 10, 2025

0.7.7

Sep 9, 2025

0.7.6

Sep 8, 2025

0.7.5

Sep 5, 2025

0.7.4

Sep 5, 2025

0.7.3

Sep 5, 2025

0.7.2

Sep 4, 2025

0.7.1

Sep 3, 2025

0.7.0

Sep 3, 2025

0.6.0

Aug 17, 2025

0.5.5

Aug 12, 2025

0.5.4

Aug 12, 2025

0.5.2

Aug 11, 2025

0.5.1

Aug 8, 2025

0.5.0

Aug 1, 2025

0.4.3

Aug 1, 2025

0.4.2

Jul 26, 2025

0.4.1

Jul 23, 2025

0.4.0

Jul 20, 2025

0.3.4

Jul 16, 2025

0.3.3

Jul 9, 2025

0.3.2

Jul 4, 2025

0.3.1

Jul 4, 2025

0.3.0

Jun 28, 2025

0.2.0

Jun 20, 2025

0.1.0

Jun 19, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

haiku_rag-0.30.0.tar.gz (392.4 kB view details)

Uploaded Feb 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

haiku_rag-0.30.0-py3-none-any.whl (7.3 kB view details)

Uploaded Feb 16, 2026 Python 3

File details

Details for the file haiku_rag-0.30.0.tar.gz.

File metadata

Download URL: haiku_rag-0.30.0.tar.gz
Upload date: Feb 16, 2026
Size: 392.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for haiku_rag-0.30.0.tar.gz
Algorithm	Hash digest
SHA256	`d283e438409c3a440de67e76345fb886dc0b3ed211c2512b397050b5f82271c2`
MD5	`dfa91e883476f64da677034c32d8fa4d`
BLAKE2b-256	`abda9bba87c54a171d6665248130bc1b8192fe4597d9c93a13596d3838183700`

See more details on using hashes here.

File details

Details for the file haiku_rag-0.30.0-py3-none-any.whl.

File metadata

Download URL: haiku_rag-0.30.0-py3-none-any.whl
Upload date: Feb 16, 2026
Size: 7.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for haiku_rag-0.30.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`20e4783ae135693d262c0b5149d6292376bf93871a8f5797d8995779edcf8276`
MD5	`c4d67d1201c57b27e3fc09b4a34ee7af`
BLAKE2b-256	`1b07a3c4a6cb5d41a7b838172b35cebed4eabab167413db9756a725f76431674`

See more details on using hashes here.

haiku.rag 0.30.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Haiku RAG

Features

Installation

Full Package (Recommended)

Slim Package (Minimal Dependencies)

Quick Start

Python API

MCP Server

Examples

Documentation

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes