MCP server for vector search with Qdrant and Ollama embeddings

These details have not been verified by PyPI

Project links

Project description

W3 MCP Qdrant Server

Python MCP server for vector search using Qdrant vector database and Ollama embeddings.

Status: ✅ Working with Qdrant vector search and Ollama embeddings

Features

qdrant_search - Search for similar documents using text queries (auto-embedded via Ollama)
qdrant_list_collections - List and manage Qdrant collections

Supports flexible output formats (Markdown or JSON) with configurable similarity thresholds.

Quick Start

1. Prerequisites Setup

Qdrant Server

# Using Docker (Recommended)
docker run -p 6333:6333 qdrant/qdrant:latest

Or install locally: Qdrant Quick Start

Ollama Server

# Install: https://ollama.ai
ollama pull nomic-embed-text
ollama serve

Available embedding models:

nomic-embed-text (768 dims) - recommended, lightweight
mxbai-embed-large (1024 dims) - higher quality
all-minilm (384 dims) - ultra-lightweight

2. Clean Setup (Important!)

cd /path/to/w3-mcp-server-qdrant

# Remove old lockfile and venv
rm -rf uv.lock .venv venv

# Unset old environment variable
unset VIRTUAL_ENV

3. Install Dependencies

# Install Python dependencies (using uv)
uv sync

# Install MCP CLI dependencies
uv pip install 'mcp[cli]'

4. Configure Environment

Create a .env file or export environment variables:

# Qdrant
export QDRANT_URL=http://localhost:6333
export QDRANT_API_KEY=  # Optional if using API key auth

# Ollama
export OLLAMA_BASE_URL=http://localhost:11434
export OLLAMA_MODEL=nomic-embed-text

5. Verify Installation

# Check Qdrant
curl http://localhost:6333/health

# Check Ollama
curl http://localhost:11434/api/tags

# Check Python env
uv run python -c "from mcp.server.fastmcp import FastMCP; print('✓ MCP ready')"

6. Test with MCP Inspector

# Start MCP Inspector (interactive web UI)
uv run mcp dev server.py

Opens URL like:

http://localhost:6274/?MCP_PROXY_AUTH_TOKEN=...

Features:

✅ Available tools listed in sidebar
✅ Test each tool interactively with JSON input
✅ Real-time request/response viewing
✅ Server logs and debugging
✅ No extra dependencies needed

Usage

Option A: MCP Inspector (Development)

Best way to test and debug:

cd /path/to/w3-mcp-server-qdrant

# Start inspector
uv run mcp dev server.py

Opens web UI at http://localhost:5173:

See available tools
Test each tool with JSON input
View request/response in real-time
See server logs

Option B: Direct Python

# Run server (stdio mode)
uv run python server.py

Option C: Claude Code Integration

Method 1: From PyPI (Recommended)

Install from PyPI:

pip install w3-mcp-server-qdrant
# or
uv pip install w3-mcp-server-qdrant

Edit ~/.claude/claude_config.json or ~/.mcp.json:

{
  "mcpServers": {
    "qdrant": {
      "type": "stdio",
      "command": "uv",
      "args": ["run", "--with", "w3-mcp-server-qdrant", "w3-mcp-server-qdrant"],
      "env": {
        "QDRANT_URL": "http://localhost:6333",
        "OLLAMA_BASE_URL": "http://localhost:11434",
        "OLLAMA_MODEL": "nomic-embed-text"
      }
    }
  }
}

Advantages:

✅ No need to clone the repo
✅ Easy version management
✅ Automatic dependency isolation

Method 2: From Local Source

Edit ~/.claude/claude_config.json:

{
  "mcpServers": {
    "qdrant": {
      "type": "stdio",
      "command": "uv",
      "args": ["run", "server.py"],
      "cwd": "/path/to/w3-mcp-server-qdrant",
      "env": {
        "QDRANT_URL": "http://localhost:6333",
        "OLLAMA_BASE_URL": "http://localhost:11434",
        "OLLAMA_MODEL": "nomic-embed-text"
      }
    }
  }
}

Then restart Claude Code.

Tools Documentation

qdrant_search

Search for similar documents in a collection using text query (auto-embedded via Ollama).

Parameters:

collection_name (string, required): Name of the collection to search
query_text (string, required): Text to search for (will be embedded automatically)
limit (integer, 1-100): Max results to return (default: 5)
score_threshold (float, 0.0-1.0): Minimum similarity threshold (default: 0.0)
response_format (string): "markdown" or "json" (default: "markdown")

Example:

{
  "collection_name": "tech_docs",
  "query_text": "How do vector databases work?",
  "limit": 10,
  "score_threshold": 0.6,
  "response_format": "markdown"
}

Output:

Returns matching documents with similarity scores:

Found 3 results for "How do vector databases work?":

1. [tech_docs.pdf:42](0.89) - Vector databases store embeddings...
2. [tutorial.md:15](0.76) - Getting started with vectors...
3. [paper.pdf:8](0.71) - Advanced vector indexing techniques...

qdrant_list_collections

List all collections in Qdrant with metadata.

Parameters:

response_format (string): "markdown" or "json" (default: "markdown")

Example:

{
  "response_format": "json"
}

Output:

{
  "collections": [
    {
      "name": "tech_docs",
      "points_count": 1250,
      "vector_size": 768
    },
    {
      "name": "papers",
      "points_count": 3840,
      "vector_size": 1024
    }
  ]
}

Configuration

QDRANT_URL

Specifies the URL of your Qdrant server.

Set via:

Environment variable:

export QDRANT_URL=http://localhost:6333
uv run python server.py

.env file:
```
QDRANT_URL=http://localhost:6333
```

In claude_config.json:

"env": {
  "QDRANT_URL": "http://localhost:6333"
}

OLLAMA_BASE_URL

Specifies the URL of your Ollama server.

Default: http://localhost:11434

OLLAMA_MODEL

Specifies which embedding model to use.

Default: nomic-embed-text

Recommended models:

nomic-embed-text (768 dims) - balanced, good for most use cases
all-minilm (384 dims) - fast, lightweight
mxbai-embed-large (1024 dims) - higher quality, slower

Project Structure

w3-mcp-server-qdrant/
├── server.py              # MCP server entry point
├── pyproject.toml         # Project config
├── .env.example           # Environment variables template
├── README.md              # This file
└── tests/
    └── test_mcp_server.py # Integration tests

How It Works

Architecture

MCP Client (Claude, IDE, etc.)
    ↓
MCP Server (server.py)
    ├── Ollama: text → embedding vector
    └── Qdrant: vector search

Search Flow

User provides text query
Ollama embeds query → embedding vector
Qdrant searches for similar vectors
Results returned with scores and metadata

Examples

Search documents

# Via Claude/MCP interface
qdrant_search(
    collection_name="tech_docs",
    query_text="machine learning algorithms",
    limit=5,
    score_threshold=0.6,
    response_format="markdown"
)

List collections

# Via Claude/MCP interface
qdrant_list_collections(response_format="json")

Development

Run tests

pytest tests/

Code formatting

black server.py
ruff check server.py

Testing with MCP Inspector

uv run mcp dev server.py

Web UI at http://localhost:5173 shows:

Available tools and schemas
Real-time request/response
Server logs
Interactive testing

Performance Tips

Score threshold: Use score_threshold to filter low-relevance results and reduce noise
Result limit: Adjust limit parameter (1-100) to balance quality vs. speed
Embedding model: Choose based on quality vs. speed tradeoff:
- nomic-embed-text: balanced (recommended)
- all-minilm: fast, lightweight
- mxbai-embed-large: higher quality but slower

Troubleshooting

Qdrant connection error

# Check if Qdrant is running
curl http://localhost:6333/health

# Start Qdrant with Docker
docker run -p 6333:6333 qdrant/qdrant:latest

Ollama embedding failed

# Check if Ollama is running
curl http://localhost:11434/api/tags

# Pull embedding model
ollama pull nomic-embed-text

# Start Ollama
ollama serve

Collection not found

Ensure collection exists in Qdrant
Create collection through Qdrant UI or external tools
Verify collection name matches exactly

MCP module not found

# Install dependencies
uv sync

# Or manually
pip install mcp pydantic qdrant-client

Server hangs on startup

Check if Qdrant server is running and accessible
Check if Ollama server is running
Try: curl http://localhost:6333/health and curl http://localhost:11434/api/tags

Future Enhancements

Support for additional embedding models
Batch vector operations
Collection creation/deletion tools
Vector update and delete operations
Semantic search filters

References

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.7

Apr 1, 2026

0.1.6

Apr 1, 2026

This version

0.1.5

Mar 27, 2026

0.1.4

Mar 26, 2026

0.1.3

Mar 17, 2026

0.1.2

Mar 16, 2026

0.1.1

Mar 16, 2026

0.1.0

Mar 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

w3_mcp_server_qdrant-0.1.5.tar.gz (11.4 kB view details)

Uploaded Mar 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

w3_mcp_server_qdrant-0.1.5-py3-none-any.whl (9.1 kB view details)

Uploaded Mar 27, 2026 Python 3

File details

Details for the file w3_mcp_server_qdrant-0.1.5.tar.gz.

File metadata

Download URL: w3_mcp_server_qdrant-0.1.5.tar.gz
Upload date: Mar 27, 2026
Size: 11.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.5

File hashes

Hashes for w3_mcp_server_qdrant-0.1.5.tar.gz
Algorithm	Hash digest
SHA256	`bd2ac7c54fc56e13df099a240319c138aee78f9ca1789d9050bc7af1a1e4b11a`
MD5	`aa8f1eabe233711cde756cbb0fc83268`
BLAKE2b-256	`0dc9fcbe1a69877e9ad8959157c17dc82c3c48b7fbe2dfa22365445af190eb10`

See more details on using hashes here.

File details

Details for the file w3_mcp_server_qdrant-0.1.5-py3-none-any.whl.

File metadata

Download URL: w3_mcp_server_qdrant-0.1.5-py3-none-any.whl
Upload date: Mar 27, 2026
Size: 9.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.5

File hashes

Hashes for w3_mcp_server_qdrant-0.1.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`937864eadaa4e6e6c2ee0dc11efc9474a6c54f3e03c6b32981638417a1154bf0`
MD5	`d43b7dcc921e8c03fb5e06314253b53e`
BLAKE2b-256	`84a0f8b30cfa9ca65525e628079e8234faf1f562620ae123d50c20e3b4e8c2b2`

See more details on using hashes here.

w3-mcp-server-qdrant 0.1.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

W3 MCP Qdrant Server

Features

Quick Start

1. Prerequisites Setup

Qdrant Server

Ollama Server

2. Clean Setup (Important!)

3. Install Dependencies

4. Configure Environment

5. Verify Installation

6. Test with MCP Inspector

Usage

Option A: MCP Inspector (Development)

Option B: Direct Python

Option C: Claude Code Integration

Method 1: From PyPI (Recommended)

Method 2: From Local Source

Tools Documentation

qdrant_search

qdrant_list_collections

Configuration

QDRANT_URL

OLLAMA_BASE_URL

OLLAMA_MODEL

Project Structure

How It Works

Architecture

Search Flow

Examples

Search documents

List collections

Development

Run tests

Code formatting

Testing with MCP Inspector

Performance Tips

Troubleshooting

Qdrant connection error

Ollama embedding failed

Collection not found

MCP module not found

Server hangs on startup

Future Enhancements

References

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes