Skip to main content

MCP Server for Research Assistant with Vector Database Management

Project description

BINATI AI Logo

MCP Research Assistant Server

By BINATI AInalytics

A Model Context Protocol (MCP) server that provides intelligent research data management using vector embeddings and semantic search. This server enables you to save, organize, and retrieve research content using ChromaDB and OpenAI embeddings.

Screenshots

UI Screenshot 1 UI Screenshot 2 UI Screenshot 3 UI Screenshot 4 UI Screenshot 5

Features

  • Vector Storage: Uses ChromaDB for efficient storage and retrieval
  • Topic Organization: Organize research content by topics
  • Deduplication: Automatic content deduplication using hashing
  • Semantic Search: Query research content using natural language
  • Multiple Topics: Manage multiple research topics simultaneously
  • OpenAI Embeddings: Uses OpenAI's text-embedding-3-small model

Installation

Using uvx (Recommended)

uvx mcp-research-assistant

Using uv

uv pip install mcp-research-assistant

Using pip

pip install mcp-research-assistant

From Source

git clone https://github.com/CyprianFusi/mcp-research-assistant.git
cd mcp-research-assistant
uv pip install -e .

Configuration

Environment Variables

Required:

  • OPENAI_API_KEY - Your OpenAI API key for embeddings
  • RESEARCH_DB_PATH - Base path for storing research databases
    • A research_chroma_dbs directory will be created inside this path
    • Example: /path/to/data (will create /path/to/data/research_chroma_dbs)
    • Example: ~/.research_assistant_mcp (will create ~/.research_assistant_mcp/research_chroma_dbs)

Create a .env file with your configuration:

OPENAI_API_KEY=your-api-key-here
RESEARCH_DB_PATH=/path/to/data

Claude Desktop Configuration

MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "research-assistant": {
      "command": "uvx",
      "args": ["mcp-research-assistant"],
      "env": {
        "OPENAI_API_KEY": "your-api-key-here",
        "RESEARCH_DB_PATH": "/path/to/data"
      }
    }
  }
}

Note: Both OPENAI_API_KEY and RESEARCH_DB_PATH are required. The database will be stored in RESEARCH_DB_PATH/research_chroma_dbs/.

Available Tools

1. save_research_data

Save research content to vector database for future retrieval.

Parameters:

  • content (List[str]): List of text content to save
  • topic (str): Topic name for organizing the data (creates separate DB)

Example:

Save these research findings about AI to the "artificial-intelligence" topic

2. query_research_data

Query saved research content using natural language.

Parameters:

  • query (str): Natural language query
  • topic (str): Topic to search in (default: "default")
  • k (int): Number of results to return (default: 5)

Example:

Query the "artificial-intelligence" topic for information about transformers

3. list_topics

List all available research topics and their document counts.

Example:

List all available research topics

4. delete_topic

Delete a research topic and all its associated data.

Parameters:

  • topic (str): Topic name to delete

Example:

Delete the "old-research" topic

5. get_topic_info

Get detailed information about a specific topic.

Parameters:

  • topic (str): Topic name

Example:

Get information about the "artificial-intelligence" topic

Usage Examples

Once configured with Claude Desktop or another MCP client, you can:

  • "Save this article about machine learning to my 'ml-research' topic"
  • "Query my 'ml-research' for information about neural networks"
  • "List all my research topics"
  • "Get information about the 'quantum-computing' topic"
  • "Delete the 'old-notes' topic"

Technical Details

  • Protocol: Model Context Protocol (MCP)
  • Transport: stdio
  • Vector Database: ChromaDB
  • Embeddings: OpenAI text-embedding-3-small
  • Storage: Local filesystem at RESEARCH_DB_PATH/research_chroma_dbs/

Requirements

  • Python 3.11 or higher
  • OpenAI API key
  • Dependencies: chromadb, langchain, fastmcp, openai

Development

Setup Development Environment

# Clone the repository
git clone https://github.com/CyprianFusi/mcp-research-assistant.git
cd mcp-research-assistant

# Install with development dependencies
uv pip install -e .

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Cyprian Fusi

Acknowledgments

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_research_assistant-0.1.5.tar.gz (1.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mcp_research_assistant-0.1.5-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file mcp_research_assistant-0.1.5.tar.gz.

File metadata

  • Download URL: mcp_research_assistant-0.1.5.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for mcp_research_assistant-0.1.5.tar.gz
Algorithm Hash digest
SHA256 321767654288109972c7ebfa08819ccef95f9b4e1c557c96ba3f0ba10f190177
MD5 e0e18a7e6f5598ceb1f76af72955ec3b
BLAKE2b-256 2107b3d1128b16d3b2e02e6169deec01af6420b84821f8e67a8a049fc8640a17

See more details on using hashes here.

File details

Details for the file mcp_research_assistant-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for mcp_research_assistant-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 b620e358a458a72d5ebcf57a906a14d458b5fd6b180ef9d8d48c8c18083d4245
MD5 65922afbb835b4539e1f747c6ee02285
BLAKE2b-256 66d98b49c450b6562b8d54007d1676f1940a680ebe0580641907d5e70b049d2b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page