A Model Context Protocol server for research assistance with ChromaDB vector storage
Project description
Research Assistant MCP Server
A Model Context Protocol (MCP) server that provides research assistance capabilities with ChromaDB vector storage. This server enables AI assistants to save, retrieve, and manage research content efficiently using vector embeddings.
Features
- Vector Storage: Uses ChromaDB for efficient storage and retrieval
- Topic Organization: Organize research content by topics
- Deduplication: Automatic content deduplication using hashing
- Semantic Search: Query research content using natural language
- Multiple Topics: Manage multiple research topics simultaneously
- OpenAI Embeddings: Uses OpenAI's text-embedding-3-small model
Installation
Using uvx (Recommended)
uvx research-assistant-mcp
Using uv
uv pip install research-assistant-mcp
Using pip
pip install research-assistant-mcp
From Source
git clone https://github.com/laxmimerit/research-assistant-mcp.git
cd research-assistant-mcp
uv pip install -e .
Configuration
Environment Variables
Required:
OPENAI_API_KEY- Your OpenAI API key for embeddingsRESEARCH_DB_PATH- Base path for storing research databases- A
research_chroma_dbsdirectory will be created inside this path - Example:
/path/to/data(will create/path/to/data/research_chroma_dbs) - Example:
~/.research_assistant_mcp(will create~/.research_assistant_mcp/research_chroma_dbs)
- A
Create a .env file with your configuration:
OPENAI_API_KEY=your-api-key-here
RESEARCH_DB_PATH=/path/to/data
Claude Desktop Configuration
MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
{
"mcpServers": {
"research-assistant": {
"command": "uvx",
"args": ["research-assistant-mcp"],
"env": {
"OPENAI_API_KEY": "your-api-key-here",
"RESEARCH_DB_PATH": "/path/to/data"
}
}
}
}
Note: Both OPENAI_API_KEY and RESEARCH_DB_PATH are required. The database will be stored in RESEARCH_DB_PATH/research_chroma_dbs/.
Available Tools
1. save_research_data
Save research content to vector database for future retrieval.
Parameters:
content(List[str]): List of text content to savetopic(str): Topic name for organizing the data (creates separate DB)
Example:
Save these research findings about AI to the "artificial-intelligence" topic
2. query_research_data
Query saved research content using natural language.
Parameters:
query(str): Natural language querytopic(str): Topic to search in (default: "default")k(int): Number of results to return (default: 5)
Example:
Query the "artificial-intelligence" topic for information about transformers
3. list_topics
List all available research topics and their document counts.
Example:
List all available research topics
4. delete_topic
Delete a research topic and all its associated data.
Parameters:
topic(str): Topic name to delete
Example:
Delete the "old-research" topic
5. get_topic_info
Get detailed information about a specific topic.
Parameters:
topic(str): Topic name
Example:
Get information about the "artificial-intelligence" topic
Usage Examples
Once configured with Claude Desktop or another MCP client, you can:
- "Save this article about machine learning to my 'ml-research' topic"
- "Query my 'ml-research' for information about neural networks"
- "List all my research topics"
- "Get information about the 'quantum-computing' topic"
- "Delete the 'old-notes' topic"
Technical Details
- Protocol: Model Context Protocol (MCP)
- Transport: stdio
- Vector Database: ChromaDB
- Embeddings: OpenAI text-embedding-3-small
- Storage: Local filesystem at
RESEARCH_DB_PATH/research_chroma_dbs/
Requirements
- Python 3.11 or higher
- OpenAI API key
- Dependencies: chromadb, langchain, fastmcp, openai
Development
Setup Development Environment
# Clone the repository
git clone https://github.com/laxmimerit/research-assistant-mcp.git
cd research-assistant-mcp
# Install with development dependencies
uv pip install -e .
License
This project is licensed under the MIT License - see the LICENSE file for details.
Author
Laxmi Kant Tiwari
- Email: info@kgptalkie.com
- GitHub: https://github.com/laxmimerit
Acknowledgments
- Built with FastMCP
- Uses ChromaDB for vector storage
- Powered by LangChain
- Implements the Model Context Protocol
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file research_assistant_mcp-0.1.0.tar.gz.
File metadata
- Download URL: research_assistant_mcp-0.1.0.tar.gz
- Upload date:
- Size: 8.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d6442817a3ec202809ee9d7d74ef328e5f1a270555e2b74958081cdfc9f147ed
|
|
| MD5 |
51b2dd8e57de26b134ba184801214d14
|
|
| BLAKE2b-256 |
7ab6a4f3afa8402040fecbfc30e0df40a9d2912e65f7e0bf36b5e48322f28296
|
File details
Details for the file research_assistant_mcp-0.1.0-py3-none-any.whl.
File metadata
- Download URL: research_assistant_mcp-0.1.0-py3-none-any.whl
- Upload date:
- Size: 7.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ad799cf77429a299d66ee838baff1b63f47217dcbee275b2e3693599431acf69
|
|
| MD5 |
28cc2a86467c38bb76e6170af13f4759
|
|
| BLAKE2b-256 |
81c3e6f1a58e1bef8fc63527c5760f46320f211daf0a6b719d5cf4b0337f4dec
|