Skip to main content

MCP Server for rapid-rag - Local RAG with semantic search and LLM queries

Project description

mcp-server-rapid-rag

MCP Server for rapid-rag - Local RAG with semantic search and LLM queries.

Search your documents with AI, no cloud needed! Works with Ollama for local LLM inference.

Installation

pip install mcp-server-rapid-rag

Configuration

Add to your Claude Desktop config (~/.config/claude/claude_desktop_config.json):

{
  "mcpServers": {
    "rapid-rag": {
      "command": "mcp-server-rapid-rag"
    }
  }
}

Or with uvx:

{
  "mcpServers": {
    "rapid-rag": {
      "command": "uvx",
      "args": ["mcp-server-rapid-rag"]
    }
  }
}

Tools

rag_add

Add files or directories to the RAG collection. Supports .txt, .md, .pdf.

"Add my docs folder to RAG: ~/Documents/notes"

rag_add_text

Add raw text directly to the collection.

"Store this meeting notes in RAG: [text content]"

rag_search

Semantic search - find the most relevant documents.

"Search my documents for: Python async patterns"

rag_query

Full RAG pipeline - search documents and get an AI-generated answer.

"Based on my documents, how do I configure logging?"

rag_info

Get collection statistics.

"Show me the RAG collection info"

rag_list

List all available collections.

"List my RAG collections"

rag_clear

Clear a collection (requires confirmation).

"Clear the 'old_project' RAG collection"

Example Usage

Ask Claude:

"Add all the markdown files from ~/projects/docs to my RAG"

Claude will:

  1. Index all .md files in the directory
  2. Split them into chunks with embeddings
  3. Store them in ChromaDB locally

Then ask:

"Based on my docs, how do I set up authentication?"

Claude will:

  1. Search the indexed documents
  2. Pass relevant chunks to Ollama
  3. Generate an answer with source citations

Requirements

  • rapid-rag: Core RAG library with ChromaDB
  • Ollama (optional): For rag_query - local LLM inference

Install Ollama

# macOS/Linux
curl -fsSL https://ollama.ai/install.sh | sh
ollama pull qwen2.5:7b

Collections

Documents are organized into collections. Each collection has:

  • Separate vector database
  • Persistent storage in ./rapid_rag_data/{collection}/
  • Own embedding cache

Default collection is "default", but you can create multiple:

"Add ~/work/project-a to the 'project-a' collection"
"Search 'project-a' for: API endpoints"

Links

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_server_rapid_rag-0.1.0.tar.gz (5.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mcp_server_rapid_rag-0.1.0-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file mcp_server_rapid_rag-0.1.0.tar.gz.

File metadata

  • Download URL: mcp_server_rapid_rag-0.1.0.tar.gz
  • Upload date:
  • Size: 5.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for mcp_server_rapid_rag-0.1.0.tar.gz
Algorithm Hash digest
SHA256 37c79b4fea64b7b27563f50489d120f28f93b1d4bbb19d4ce2564a92b3b1f09c
MD5 7893dcedeacda016fb780329ca5f267f
BLAKE2b-256 602c7fb3a1b7de1f8d23d5ce361318a958293f33cc3feebf343d76337f5532ef

See more details on using hashes here.

File details

Details for the file mcp_server_rapid_rag-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for mcp_server_rapid_rag-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3101b41635cc1c4d990a150e7d09c0edf01a91e4c7f02c74d70ac909bc8eb9d2
MD5 fe4d52f7908edaf8b5ded6dfa2a5c28c
BLAKE2b-256 2dbddbaaace8984d8020e5bdfbc3065e46421dd9185ee768e773ab7dadc86573

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page