Skip to main content

A MCP server for searching and downloading academic papers from multiple sources.

Project description

📚 Browse MCP

English | 中文

🔬 browse-mcp is a Python-based MCP server that enables users to search, download, and read academic papers from various platforms. It provides three main tools:

  • 🔎 paper_search: Search papers across multiple academic databases
  • 📥 paper_download: Download paper PDFs, return paths of downloaded files
  • 📖 paper_read: Extract and read text content from papers

PyPI License Python


📑 Table of Contents


✨ Features

  • 🌐 Multi-Source Support: Search and download papers from 19+ academic databases including arXiv, PubMed, PubMed Central, bioRxiv, medRxiv, Google Scholar, IACR ePrint Archive, Semantic Scholar, CrossRef, Science Direct, Springer, IEEE Xplore, Scopus, CORE, and more.
  • 🎯 Unified Interface: All platforms accessible through consistent paper_search, paper_download, and paper_read tools.
  • 📊 Standardized Output: Papers are returned in a consistent dictionary format via the Paper class.
  • Asynchronous Operations: Efficiently handles concurrent searches and downloads using httpx and async/await.
  • 🔌 MCP Integration: Compatible with MCP clients for LLM context enhancement.
  • 🧩 Extensible Design: Easily add new academic platforms by extending the sources module.

🎬 Screenshot

Screenshot

📝 Supported Academic Platforms

✅ Fully Implemented (19 sources)

Free & Open Access:

  • arXiv - Pre-print repository for physics, mathematics, CS, and more
  • PubMed - Biomedical literature database
  • PubMed Central (PMC) - Free full-text biomedical and life sciences articles
  • bioRxiv - Pre-print server for biology
  • medRxiv - Pre-print server for health sciences
  • Semantic Scholar - AI-powered research tool
  • CrossRef - DOI registration agency and metadata provider
  • Google Scholar - Academic search engine
  • IACR ePrint Archive - Cryptology pre-prints
  • CORE - Open access research papers aggregator

API Key Required:

  • Science Direct - Elsevier's full-text scientific database (requires Elsevier API key)
  • Springer Link - Springer's scientific publications (requires Springer API key)
  • IEEE Xplore - IEEE's digital library (requires IEEE API key)
  • Scopus - Elsevier's abstract and citation database (requires Scopus API key)

Institutional Access Required:

  • ACM Digital Library - ACM's computing literature (no public API)
  • Web of Science - Clarivate's citation database (requires subscription)
  • JSTOR - Digital library of academic journals (no public API)
  • ResearchGate - Academic social network (no official API)

Retired Services:

  • Microsoft Academic - Service retired December 31, 2021 (placeholder implementation)

📦 Installation

browse-mcp can be installed using uv or pip. Below are detailed installation guides for different scenarios.

⚡ Quick Install

Install the package:

pip install browse-mcp

Start the MCP server:

browse-mcp

🔧 MCP Client Configuration

Choose your MCP client and follow the configuration steps:

1️⃣ Claude Desktop

Location:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json

Configuration:

{
  "mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}
2️⃣ Claude Code (CLI)

Location: ~/.config/claude/config.json

Configuration:

{
  "mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}

Verify Installation:

# Check if browse-mcp is loaded
claude mcp list

# Test the server
claude mcp test browse-mcp
3️⃣ Cline (VS Code Extension)

Location: VS Code Settings → Extensions → Cline → MCP Settings

Method 1: Through VS Code Settings UI

  1. Open VS Code Settings (Cmd/Ctrl + ,)
  2. Search for "Cline MCP"
  3. Click "Edit in settings.json"
  4. Add the configuration:
{
  "cline.mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}

Method 2: Direct settings.json Edit

Edit ~/.config/Code/User/settings.json (Linux/macOS) or %APPDATA%\Code\User\settings.json (Windows):

{
  "cline.mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}
4️⃣ Zed Editor

Location: ~/.config/zed/settings.json

Configuration:

{
  "context_servers": {
    "browse-mcp": {
      "command": {
        "path": "python",
        "args": ["-m", "browse_mcp"]
      },
      "settings": {
        "env": {
          "SEMANTIC_SCHOLAR_API_KEY": "",
          "SCIENCEDIRECT_API_KEY": "",
          "SPRINGER_API_KEY": "",
          "IEEE_API_KEY": "",
          "SCOPUS_API_KEY": "",
          "CORE_API_KEY": "",
          "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
          "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
          "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
        }
      }
    }
  }
}
5️⃣ Custom MCP Client

For other MCP clients, use the standard MCP server configuration:

Server Command:

python -m browse_mcp

Environment Variables:

  • SEMANTIC_SCHOLAR_API_KEY: Optional API key for Semantic Scholar
  • SCIENCEDIRECT_API_KEY: Optional API key for Science Direct
  • SPRINGER_API_KEY: Optional API key for Springer Link
  • IEEE_API_KEY: Optional API key for IEEE Xplore
  • SCOPUS_API_KEY: Optional API key for Scopus
  • CORE_API_KEY: Optional API key for CORE
  • BROWSE_MCP_DOWNLOAD_PATH: Download directory (default: ./downloads)

Server Capabilities:

  • Tools: paper_search, paper_download, paper_read
  • Transport: stdio
  • Protocol: MCP 1.0

⚙️ Environment Variables

API Keys (optional - only for premium services):

  • SEMANTIC_SCHOLAR_API_KEY: Semantic Scholar (Get API Key)
  • SCIENCEDIRECT_API_KEY: Elsevier Science Direct (Get API Key)
  • SPRINGER_API_KEY: Springer Nature (Get API Key)
  • IEEE_API_KEY: IEEE Xplore (Get API Key)
  • SCOPUS_API_KEY: Elsevier Scopus (Get API Key)
  • CORE_API_KEY: CORE aggregator (Get API Key)
  • WOS_API_KEY: Web of Science (requires institutional subscription)

General Settings:

  • BROWSE_MCP_DOWNLOAD_PATH: Directory for downloaded PDFs (default: ./downloads)

Source Control:

  • BROWSE_MCP_ENABLED_SOURCES: Comma-separated list to enable specific sources (whitelist)
  • BROWSE_MCP_DISABLED_SOURCES: Comma-separated list to disable specific sources (blacklist)
  • If both are set, BROWSE_MCP_ENABLED_SOURCES takes precedence
  • If neither is set, all 18 sources are enabled by default

Available Source Names (18 total):

Source Name Type API Key Required Description
arxiv Free - Preprint repository for physics, mathematics, computer science
pubmed Free - Biomedical literature from MEDLINE
pmc Free - PubMed Central full-text archive
biorxiv Free - Preprint server for biology
medrxiv Free - Preprint server for health sciences
google_scholar Free - Google Scholar search
iacr Free - International Association for Cryptologic Research
semantic Free SEMANTIC_SCHOLAR_API_KEY (optional)
Get API Key
Semantic Scholar AI-powered search (higher rate limits with API key)
crossref Free - Crossref DOI metadata
core Free CORE_API_KEY
Get API Key
CORE aggregator of open access papers
ieee Premium IEEE_API_KEY
Get API Key
IEEE Xplore digital library
scopus Premium SCOPUS_API_KEY
Get API Key
Elsevier Scopus database
springer Premium SPRINGER_API_KEY
Get API Key
Springer publications
sciencedirect Premium SCIENCEDIRECT_API_KEY
Get API Key
Elsevier ScienceDirect
wos Premium WOS_API_KEY
Institutional Access
Web of Science (requires institutional subscription)
acm Premium - ACM Digital Library
jstor Premium - JSTOR archive
researchgate Free - ResearchGate social network

🚀 Usage

Once configured, browse-mcp provides three main tools accessible through Claude Desktop or any MCP-compatible client.

1. Search Papers (paper_search)

Search for academic papers across multiple sources:

Basic Search Examples:

# Search arXiv for machine learning papers
paper_search([
    {"searcher": "arxiv", "query": "machine learning", "max_results": 5}
])

# Search PubMed Central for biomedical papers
paper_search([
    {"searcher": "pmc", "query": "cancer treatment", "max_results": 5}
])

# Search CORE for open access papers
paper_search([
    {"searcher": "core", "query": "climate change", "max_results": 5}
])

Multi-Platform Search:

# Search multiple platforms simultaneously
paper_search([
    {"searcher": "arxiv", "query": "deep learning", "max_results": 5},
    {"searcher": "pubmed", "query": "cancer immunotherapy", "max_results": 3},
    {"searcher": "pmc", "query": "diabetes treatment", "max_results": 3},
    {"searcher": "semantic", "query": "climate change", "max_results": 4, "year": "2020-2023"}
])

Premium Sources (require API keys):

# Search IEEE Xplore (requires IEEE_API_KEY)
paper_search([
    {"searcher": "ieee", "query": "neural networks", "max_results": 5}
])

# Search Springer Link (requires SPRINGER_API_KEY)
paper_search([
    {"searcher": "springer", "query": "quantum computing", "max_results": 5}
])

# Search Scopus (requires SCOPUS_API_KEY)
paper_search([
    {"searcher": "scopus", "query": "artificial intelligence", "max_results": 5}
])

Search All Platforms:

# Search all platforms (omit "searcher" parameter)
paper_search([
    {"query": "quantum computing", "max_results": 10}
])

2. Download Papers (paper_download)

Download paper PDFs using their identifiers:

# Download from free sources
paper_download([
    {"searcher": "arxiv", "paper_id": "2106.12345"},
    {"searcher": "pubmed", "paper_id": "32790614"},
    {"searcher": "pmc", "paper_id": "PMC7419405"},
    {"searcher": "biorxiv", "paper_id": "10.1101/2020.01.01.123456"},
    {"searcher": "semantic", "paper_id": "DOI:10.18653/v1/N18-3011"}
])

# Download from CORE (open access)
paper_download([
    {"searcher": "core", "paper_id": "123456789"}
])

Note: Premium sources (IEEE, Springer, Science Direct, Scopus) require institutional access or subscriptions for PDF downloads.

3. Read Papers (paper_read)

Extract and read text content from papers:

# Read papers from free sources
paper_read(searcher="arxiv", paper_id="2106.12345")
paper_read(searcher="pubmed", paper_id="32790614")
paper_read(searcher="pmc", paper_id="PMC7419405")
paper_read(searcher="biorxiv", paper_id="10.1101/2020.01.01.123456")
paper_read(searcher="semantic", paper_id="DOI:10.18653/v1/N18-3011")
paper_read(searcher="core", paper_id="123456789")

🛠️ For Development

For developers who want to modify the code or contribute:

  1. Setup Environment:

    # Install uv if not installed
    curl -LsSf https://astral.sh/uv/install.sh | sh
    
    # Clone repository
    git clone https://github.com/LinXueyuanStdio/browse-mcp.git
    cd browse-mcp
    
    # Create and activate virtual environment
    uv venv
    source .venv/bin/activate  # On Windows: .venv\Scripts\activate
    
  2. Install Dependencies:

    # Install dependencies (recommended)
    uv pip install -e .
    
    # Add development dependencies (optional)
    uv pip install pytest flake8
    

🤝 Contributing

We welcome contributions! Here's how to get started:

  1. Fork the Repository: Click "Fork" on GitHub.

  2. Clone and Set Up:

    git clone https://github.com/yourusername/browse-mcp.git
    cd browse-mcp
    uv pip install -e .  # Install in development mode
    
  3. Make Changes:

    • Add new platforms in browse_mcp/sources/.
    • Update tests in tests/.
  4. Submit a Pull Request: Push changes and create a PR on GitHub.

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.


Happy researching with browse-mcp! If you encounter issues, open a GitHub issue.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

browse_mcp-0.1.8.tar.gz (39.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

browse_mcp-0.1.8-py3-none-any.whl (57.0 kB view details)

Uploaded Python 3

File details

Details for the file browse_mcp-0.1.8.tar.gz.

File metadata

  • Download URL: browse_mcp-0.1.8.tar.gz
  • Upload date:
  • Size: 39.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.11.5 Darwin/25.2.0

File hashes

Hashes for browse_mcp-0.1.8.tar.gz
Algorithm Hash digest
SHA256 f14cf55c909252118e3a4be968442bd6721b8ffccfbfdc03b55b865f9066a8cd
MD5 92a3478bdec249a73a5ae96943a552f4
BLAKE2b-256 3e2567daeee28f2e15f2bf8ea6fc09009ed7a047665d22fc32c0475a4b31097a

See more details on using hashes here.

File details

Details for the file browse_mcp-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: browse_mcp-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 57.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.11.5 Darwin/25.2.0

File hashes

Hashes for browse_mcp-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 a9ed3f20c19101cc05355bb7a351f55314a3e201e7cc56310c3c49a14f8a4a80
MD5 0de6c798aeb0e78b183af76c19f86a75
BLAKE2b-256 ce52e25c238fa27b20c3a27258a7591ef1a1f964e4e17496ae874ca728749725

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page