100% offline RAG storage and MCP server for querying local document knowledge bases

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

RAG Knowledge Base

100% offline RAG storage and MCP server for querying local document knowledge bases. Designed for high-quality retrieval over large corpora (5–10 GB+) with GPU acceleration, hybrid search, and cross-encoder reranking.

Features

Core

Fully offline — no internet required after initial setup
30+ file formats — Markdown, PDF, DOCX, PPTX, XLSX, CSV, JSON, YAML, XML, HTML, RST, RTF, ODT/ODS/ODP, EPUB, images (OCR), source code, logs, and plain text
Multiple RAG databases — create, switch, and manage independent knowledge bases
Shareable — export a RAG as a single .rag file; import on another machine
MCP server — query from VS Code Copilot, Claude Desktop, or any MCP client
Web UI — Gradio dashboard for search, status, and management
File watching — automatic re-indexing on file changes
Cross-platform — Windows, macOS, and Linux

Search Quality

Hybrid search — combines vector similarity (cosine) with BM25 keyword matching via configurable score fusion
Cross-encoder reranking — second-stage reranking with cross-encoder/ms-marco-MiniLM-L-6-v2 for precision
Structure-aware chunking — preserves Markdown heading hierarchy, PDF page boundaries, and code function/class boundaries
Contextual prefixes — each chunk carries its document title and section path for better embedding quality
Min-score filtering — automatically filters low-relevance noise from results

Indexing Performance

GPU acceleration — auto-detects CUDA and Apple MPS; falls back to CPU
Parallel file parsing — multi-threaded document parsing with configurable worker count
Batched embedding — encodes chunks in large batches (default 256) for optimal throughput
Incremental indexing — xxHash-based file manifest tracks changes at O(1) cost; unchanged files are skipped instantly
Tuned HNSW — configurable ef_construction and M parameters for ChromaDB's vector index

Quick Start

Install from PyPI

pip install rag-knowledge-base

Install from source

pip install -e .

GPU support: Install PyTorch with CUDA before installing this package:
pip install torch --index-url https://download.pytorch.org/whl/cu126

Create and Index a RAG

# Create a knowledge base pointing at your document folders
rag-kb create my-docs \
  --folders /path/to/docs /path/to/more/docs \
  --description "My documentation"

# Set as active
rag-kb use my-docs

# Index (auto-detects GPU, uses 4 parallel workers by default)
rag-kb index

# Re-run index any time — only changed files are processed
rag-kb index

Query via MCP (VS Code Copilot)

# Start MCP server (stdio mode for VS Code)
rag-kb serve

Add to your .vscode/mcp.json:

{
  "servers": {
    "rag-knowledge-base": {
      "command": "rag-kb",
      "args": ["serve"]
    }
  }
}

Query via MCP (HTTP)

# Start MCP server on HTTP
rag-kb serve --http --port 8080

Web Dashboard

# Launch Gradio UI
rag-kb ui

Share a RAG

# Export
rag-kb export my-docs --output my-docs.rag

# Import (on another machine)
rag-kb import my-docs.rag --name received-docs

Supported File Formats

Category	Extensions
Documents	`.md`, `.txt`, `.pdf`, `.docx`, `.pptx`, `.rst`, `.rtf`, `.epub`
Spreadsheets	`.xlsx`, `.xls`, `.csv`, `.tsv`
Data	`.json`, `.jsonl`, `.yaml`, `.yml`, `.xml`
Web	`.html`, `.htm`
OpenDocument	`.odt`, `.ods`, `.odp`
Images (OCR)	`.png`, `.jpg`, `.jpeg`, `.gif`, `.bmp`, `.tiff`, `.tif`, `.webp`
Code	`.py`, `.js`, `.ts`, `.java`, `.c`, `.cpp`, `.go`, `.rs`, `.rb`, `.php`, `.cs`, `.kt`, `.swift`, `.scala`, `.sh`, `.bash`, `.zsh`, `.ps1`, `.bat`, `.cmd`, `.sql`, `.r`, `.m`, `.lua`, `.pl`, `.ex`, `.exs`, `.hs`, `.clj`, `.lisp`, `.erl`, `.elm`, `.v`, `.sv`, `.vhdl`, `.zig`, `.nim`, `.dart`, `.groovy`, `.tf`, `.hcl`, `.toml`, `.ini`, `.cfg`, `.conf`, `.env`, `.dockerfile`, `.makefile`
Logs	`.log`

CLI Reference

Command	Description
`rag-kb create NAME --folders ...`	Create a new RAG database
`rag-kb list`	List all RAG databases
`rag-kb use NAME`	Set the active RAG
`rag-kb delete NAME [-y]`	Delete a RAG database
`rag-kb detach NAME`	Detach a RAG from source files (read-only mode)
`rag-kb attach NAME`	Re-attach a detached RAG to its source files
`rag-kb index [--rag NAME] [--workers N]`	Index documents (parallel, incremental)
`rag-kb export NAME --output FILE`	Export RAG to a `.rag` file
`rag-kb import FILE [--name NAME]`	Import RAG from a `.rag` file
`rag-kb serve [--http] [--port N]`	Start MCP server
`rag-kb ui [--port N]`	Launch web dashboard
`rag-kb config`	Show current configuration

Use -v for verbose output: rag-kb -v index

MCP Tools

When connected via MCP, the following tools are available:

Tool	Description
`search_knowledge_base`	Hybrid search with optional cross-encoder reranking
`get_document_content`	Retrieve all chunks from a specific document
`list_indexed_files`	List all indexed files with metadata
`get_index_status`	Current indexing state and statistics
`reindex`	Trigger re-indexing of the active RAG
`list_rags`	List all available RAG databases
`switch_rag`	Switch the active RAG database
`export_rag`	Export a RAG to a shareable file
`import_rag`	Import a RAG from a shared file

Configuration

Config file location (platform-dependent):

Windows: %LOCALAPPDATA%\rag-kb\config.yaml
macOS: ~/Library/Application Support/rag-kb/config.yaml
Linux: ~/.local/share/rag-kb/config.yaml

# Embedding
embedding_model: paraphrase-multilingual-MiniLM-L12-v2
chunk_size: 1024
chunk_overlap: 128

# Search quality
reranking_enabled: true
reranker_model: cross-encoder/ms-marco-MiniLM-L-6-v2
hybrid_search_enabled: true
hybrid_search_alpha: 0.7        # 0.0 = pure BM25, 1.0 = pure vector
min_score_threshold: 0.15

# Indexing performance
indexing_workers: 4              # parallel file-parsing threads
embedding_batch_size: 256        # texts per encode() call

# ChromaDB HNSW tuning
hnsw_ef_construction: 200
hnsw_m: 32

# File types to index
supported_extensions:
  - .md
  - .txt
  - .pdf
  - .docx
  - .pptx
  - .xlsx
  - .csv
  - .json
  - .yaml
  - .xml
  - .html
  - .htm
  - .rst
  - .rtf
  - .odt
  - .epub
  - .png
  - .jpg
  # ... see config for full list

# Server
host: 127.0.0.1
port: 8080

Architecture

Documents  ──►  Parsers (30+ formats)
                    │
                    ▼
             Structure-aware Chunker
             (headings / pages / functions)
                    │
                    ▼
             Embedding Model (CUDA / MPS / CPU)
             + Contextual Prefixes
                    │
                    ▼
             ChromaDB (HNSW cosine index)
                    │
          ┌─────────┼─────────┐
          ▼         ▼         ▼
       MCP Server  Web UI   CLI
          │
    ┌─────┼─────┐
    ▼     ▼     ▼
  Vector  BM25  Hybrid Fusion
    │           │
    ▼           ▼
  Cross-encoder Reranking
    │
    ▼
  Results

License

MIT License — see LICENSE for details.

All dependencies use MIT, Apache 2.0, or BSD licenses.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

yohasacura

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.1

Mar 1, 2026

This version

1.0.0

Feb 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rag_knowledge_base-1.0.0.tar.gz (73.0 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rag_knowledge_base-1.0.0-py3-none-any.whl (91.1 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file rag_knowledge_base-1.0.0.tar.gz.

File metadata

Download URL: rag_knowledge_base-1.0.0.tar.gz
Upload date: Feb 26, 2026
Size: 73.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rag_knowledge_base-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`f2156e9ddd4192fc1b70ab1da8a8913c76c2b6b86ddabfd4e9dde0e076570e43`
MD5	`f6befdb71351076b7a23c49719514845`
BLAKE2b-256	`0846dcb033d7aa025795b67bced3086db8e41326c5dbea9444af0cdb21720836`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rag_knowledge_base-1.0.0.tar.gz:

Publisher: release.yml on yohasacura/rag-knowledge-base

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rag_knowledge_base-1.0.0.tar.gz
- Subject digest: f2156e9ddd4192fc1b70ab1da8a8913c76c2b6b86ddabfd4e9dde0e076570e43
- Sigstore transparency entry: 995800950
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: yohasacura/rag-knowledge-base@719f2f22547c5102f5a2aefd586631507162c83f
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/yohasacura
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@719f2f22547c5102f5a2aefd586631507162c83f
- Trigger Event: push

File details

Details for the file rag_knowledge_base-1.0.0-py3-none-any.whl.

File metadata

Download URL: rag_knowledge_base-1.0.0-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 91.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rag_knowledge_base-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`aca1c570a51751d0f463f707f6b2483a70acd8545651dbe124e513e503fdc2e6`
MD5	`4a6711b953164e225b339db92d1397c2`
BLAKE2b-256	`0bc3ffe07b9bed91e8f9f29a3b48ce26eb0248829276e7210a2ab8b77ded3876`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rag_knowledge_base-1.0.0-py3-none-any.whl:

Publisher: release.yml on yohasacura/rag-knowledge-base

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rag_knowledge_base-1.0.0-py3-none-any.whl
- Subject digest: aca1c570a51751d0f463f707f6b2483a70acd8545651dbe124e513e503fdc2e6
- Sigstore transparency entry: 995800953
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: yohasacura/rag-knowledge-base@719f2f22547c5102f5a2aefd586631507162c83f
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/yohasacura
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@719f2f22547c5102f5a2aefd586631507162c83f
- Trigger Event: push

rag-knowledge-base 1.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

RAG Knowledge Base

Features

Core

Search Quality

Indexing Performance

Quick Start

Install from PyPI

Install from source

Create and Index a RAG

Query via MCP (VS Code Copilot)

Query via MCP (HTTP)

Web Dashboard

Share a RAG

Supported File Formats

CLI Reference

MCP Tools

Configuration

Architecture

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance