Skip to main content

Vector database for software files with MCP interface

Project description

MseeP.ai Security Assessment Badge

Files-DB-MCP: Vector Search for Code Projects

A local vector database system that provides LLM coding agents with fast, efficient search capabilities for software projects via the Message Control Protocol (MCP).

Features

  • Zero Configuration - Auto-detects project structure with sensible defaults
  • Real-Time Monitoring - Continuously watches for file changes
  • Vector Search - Semantic search for finding relevant code
  • MCP Interface - Compatible with Claude Code and other LLM tools
  • Open Source Models - Uses Hugging Face models for code embeddings

Installation

Option 1: Clone and Setup (Recommended)

# Using SSH (recommended if you have SSH keys set up with GitHub)
git clone git@github.com:randomm/files-db-mcp.git ~/.files-db-mcp && bash ~/.files-db-mcp/install/setup.sh

# Using HTTPS (if you don't have SSH keys set up)
git clone https://github.com/randomm/files-db-mcp.git ~/.files-db-mcp && bash ~/.files-db-mcp/install/setup.sh

Option 2: Automated Installation Script

curl -fsSL https://raw.githubusercontent.com/randomm/files-db-mcp/main/install/install.sh | bash

Usage

After installation, run in any project directory:

files-db-mcp

The service will:

  1. Detect your project files
  2. Start indexing in the background
  3. Begin responding to MCP search queries immediately

Requirements

  • Docker
  • Docker Compose

Configuration

Files-DB-MCP works without configuration, but you can customize it with environment variables:

  • EMBEDDING_MODEL - Change the embedding model (default: 'jinaai/jina-embeddings-v2-base-code' or project-specific model)
  • FAST_STARTUP - Set to 'true' to use a smaller model for faster startup (default: 'false')
  • QUANTIZATION - Enable/disable quantization (default: 'true')
  • BINARY_EMBEDDINGS - Enable/disable binary embeddings (default: 'false')
  • IGNORE_PATTERNS - Comma-separated list of files/dirs to ignore

First-Time Startup

On first run, Files-DB-MCP will download embedding models which may take several minutes depending on:

  • The size of the selected model (300-500MB for high-quality models)
  • Your internet connection speed

Subsequent startups will be much faster as models are cached in a persistent Docker volume. For faster initial startup, you can:

# Use a smaller, faster model (90MB)
EMBEDDING_MODEL=sentence-transformers/all-MiniLM-L6-v2 files-db-mcp

# Or enable fast startup mode
FAST_STARTUP=true files-db-mcp

Model Caching

Files-DB-MCP automatically persists downloaded embedding models, so you only need to download them once:

  • Models are stored in a Docker volume called model_cache
  • This volume persists between container restarts and across different projects
  • The cache is shared for all projects using Files-DB-MCP on your machine
  • You don't need to download the model again for each project

Claude Code Integration

Add to your Claude Code configuration:

{
  "mcpServers": {
    "files-db-mcp": {
      "command": "python",
      "args": ["/path/to/src/claude_mcp_server.py", "--host", "localhost", "--port", "6333"]
    }
  }
}

For details, see Claude MCP Integration.

Documentation

Repository Structure

  • /src - Source code
  • /tests - Unit and integration tests
  • /docs - Documentation
  • /scripts - Utility scripts
  • /install - Installation scripts
  • /.docker - Docker configuration
  • /config - Configuration files
  • /ai-assist - AI assistance files

License

MIT License

Contributing

Contributions welcome! Please feel free to submit a pull request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iflow_mcp_files_db_mcp-0.1.0.tar.gz (160.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

iflow_mcp_files_db_mcp-0.1.0-py3-none-any.whl (37.9 kB view details)

Uploaded Python 3

File details

Details for the file iflow_mcp_files_db_mcp-0.1.0.tar.gz.

File metadata

File hashes

Hashes for iflow_mcp_files_db_mcp-0.1.0.tar.gz
Algorithm Hash digest
SHA256 dd53447a404258eeaba431150453bff69cd50438b6a428834282f09cf289005e
MD5 fe21018ddd2ff0e69443f8817b4daa68
BLAKE2b-256 ad15fe221fb3a0b4e8c726913a61ecd11fe74903e8a5949207a86d76c704d994

See more details on using hashes here.

File details

Details for the file iflow_mcp_files_db_mcp-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for iflow_mcp_files_db_mcp-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9fcc883f929ea6af4c0e5246c18f2304b851f8084c4f662f218c1b420f7942a1
MD5 0b4c0ee6523792782c95fe15d9b837de
BLAKE2b-256 f3963fd6156dd53b83ac1da5ded1ddb170d2569890a9382213ad309a91ed9396

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page