Skip to main content

A MCP server for searching and downloading academic papers from multiple sources.

Project description

📚 Browse MCP

English | 中文

🔬 browse-mcp is a Python-based MCP server that enables users to search, download, and read academic papers from various platforms. It provides three main tools:

  • 🔎 paper_search: Search papers across multiple academic databases
  • 📥 paper_download: Download paper PDFs, return paths of downloaded files
  • 📖 paper_read: Extract and read text content from papers

PyPI License Python


📑 Table of Contents


✨ Features

  • 🌐 Multi-Source Support: Search and download papers from 19+ academic databases including arXiv, PubMed, PubMed Central, bioRxiv, medRxiv, Google Scholar, IACR ePrint Archive, Semantic Scholar, CrossRef, Science Direct, Springer, IEEE Xplore, Scopus, CORE, and more.
  • 🎯 Unified Interface: All platforms accessible through consistent paper_search, paper_download, and paper_read tools.
  • 📊 Standardized Output: Papers are returned in a consistent dictionary format via the Paper class.
  • Asynchronous Operations: Efficiently handles concurrent searches and downloads using httpx and async/await.
  • 🔌 MCP Integration: Compatible with MCP clients for LLM context enhancement.
  • 🧩 Extensible Design: Easily add new academic platforms by extending the sources module.

🎬 Screenshot

Screenshot

📝 Supported Academic Platforms

✅ Fully Implemented (19 sources)

Free & Open Access:

  • arXiv - Pre-print repository for physics, mathematics, CS, and more
  • PubMed - Biomedical literature database
  • PubMed Central (PMC) - Free full-text biomedical and life sciences articles
  • bioRxiv - Pre-print server for biology
  • medRxiv - Pre-print server for health sciences
  • Semantic Scholar - AI-powered research tool
  • CrossRef - DOI registration agency and metadata provider
  • Google Scholar - Academic search engine
  • IACR ePrint Archive - Cryptology pre-prints
  • CORE - Open access research papers aggregator

API Key Required:

  • Science Direct - Elsevier's full-text scientific database (requires Elsevier API key)
  • Springer Link - Springer's scientific publications (requires Springer API key)
  • IEEE Xplore - IEEE's digital library (requires IEEE API key)
  • Scopus - Elsevier's abstract and citation database (requires Scopus API key)

Institutional Access Required:

  • ACM Digital Library - ACM's computing literature (no public API)
  • Web of Science - Clarivate's citation database (requires subscription)
  • JSTOR - Digital library of academic journals (no public API)
  • ResearchGate - Academic social network (no official API)

Retired Services:

  • Microsoft Academic - Service retired December 31, 2021 (placeholder implementation)

📦 Installation

browse-mcp can be installed using uv or pip. Below are detailed installation guides for different scenarios.

⚡ Quick Install

Install the package:

pip install browse-mcp

Start the MCP server:

browse-mcp

🔧 MCP Client Configuration

Choose your MCP client and follow the configuration steps:

1️⃣ Claude Desktop

Location:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json

Configuration:

{
  "mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}
2️⃣ Claude Code (CLI)

Location: ~/.config/claude/config.json

Configuration:

{
  "mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}

Verify Installation:

# Check if browse-mcp is loaded
claude mcp list

# Test the server
claude mcp test browse-mcp
3️⃣ Cline (VS Code Extension)

Location: VS Code Settings → Extensions → Cline → MCP Settings

Method 1: Through VS Code Settings UI

  1. Open VS Code Settings (Cmd/Ctrl + ,)
  2. Search for "Cline MCP"
  3. Click "Edit in settings.json"
  4. Add the configuration:
{
  "cline.mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}

Method 2: Direct settings.json Edit

Edit ~/.config/Code/User/settings.json (Linux/macOS) or %APPDATA%\Code\User\settings.json (Windows):

{
  "cline.mcpServers": {
    "browse-mcp": {
      "command": "python",
      "args": ["-m", "browse_mcp"],
      "env": {
        "SEMANTIC_SCHOLAR_API_KEY": "",
        "SCIENCEDIRECT_API_KEY": "",
        "SPRINGER_API_KEY": "",
        "IEEE_API_KEY": "",
        "SCOPUS_API_KEY": "",
        "CORE_API_KEY": "",
        "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
        "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
        "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
      }
    }
  }
}
4️⃣ Zed Editor

Location: ~/.config/zed/settings.json

Configuration:

{
  "context_servers": {
    "browse-mcp": {
      "command": {
        "path": "python",
        "args": ["-m", "browse_mcp"]
      },
      "settings": {
        "env": {
          "SEMANTIC_SCHOLAR_API_KEY": "",
          "SCIENCEDIRECT_API_KEY": "",
          "SPRINGER_API_KEY": "",
          "IEEE_API_KEY": "",
          "SCOPUS_API_KEY": "",
          "CORE_API_KEY": "",
          "BROWSE_MCP_ENABLED_SOURCES": "arxiv,pubmed,pmc,biorxiv,medrxiv,semantic,core,crossref,google_scholar,iacr",
          "BROWSE_MCP_DISABLED_SOURCES": "ieee,scopus,springer,sciencedirect,wos,acm,jstor",
          "BROWSE_MCP_DOWNLOAD_PATH": "./downloads"
        }
      }
    }
  }
}
5️⃣ Custom MCP Client

For other MCP clients, use the standard MCP server configuration:

Server Command:

python -m browse_mcp

Environment Variables:

  • SEMANTIC_SCHOLAR_API_KEY: Optional API key for Semantic Scholar
  • SCIENCEDIRECT_API_KEY: Optional API key for Science Direct
  • SPRINGER_API_KEY: Optional API key for Springer Link
  • IEEE_API_KEY: Optional API key for IEEE Xplore
  • SCOPUS_API_KEY: Optional API key for Scopus
  • CORE_API_KEY: Optional API key for CORE
  • BROWSE_MCP_DOWNLOAD_PATH: Download directory (default: ./downloads)

Server Capabilities:

  • Tools: paper_search, paper_download, paper_read
  • Transport: stdio
  • Protocol: MCP 1.0

⚙️ Environment Variables

API Keys (optional - only for premium services):

  • SEMANTIC_SCHOLAR_API_KEY: Semantic Scholar (Get API Key)
  • SCIENCEDIRECT_API_KEY: Elsevier Science Direct (Get API Key)
  • SPRINGER_API_KEY: Springer Nature (Get API Key)
  • IEEE_API_KEY: IEEE Xplore (Get API Key)
  • SCOPUS_API_KEY: Elsevier Scopus (Get API Key)
  • CORE_API_KEY: CORE aggregator (Get API Key)
  • WOS_API_KEY: Web of Science (requires institutional subscription)

General Settings:

  • BROWSE_MCP_DOWNLOAD_PATH: Directory for downloaded PDFs (default: ./downloads)

Source Control:

  • BROWSE_MCP_ENABLED_SOURCES: Comma-separated list to enable specific sources (whitelist)
  • BROWSE_MCP_DISABLED_SOURCES: Comma-separated list to disable specific sources (blacklist)
  • If both are set, BROWSE_MCP_ENABLED_SOURCES takes precedence
  • If neither is set, all 18 sources are enabled by default

Available Source Names (18 total):

Source Name Type API Key Required Description
arxiv Free - Preprint repository for physics, mathematics, computer science
pubmed Free - Biomedical literature from MEDLINE
pmc Free - PubMed Central full-text archive
biorxiv Free - Preprint server for biology
medrxiv Free - Preprint server for health sciences
google_scholar Free - Google Scholar search
iacr Free - International Association for Cryptologic Research
semantic Free SEMANTIC_SCHOLAR_API_KEY (optional)
Get API Key
Semantic Scholar AI-powered search (higher rate limits with API key)
crossref Free - Crossref DOI metadata
core Free CORE_API_KEY
Get API Key
CORE aggregator of open access papers
ieee Premium IEEE_API_KEY
Get API Key
IEEE Xplore digital library
scopus Premium SCOPUS_API_KEY
Get API Key
Elsevier Scopus database
springer Premium SPRINGER_API_KEY
Get API Key
Springer publications
sciencedirect Premium SCIENCEDIRECT_API_KEY
Get API Key
Elsevier ScienceDirect
wos Premium WOS_API_KEY
Institutional Access
Web of Science (requires institutional subscription)
acm Premium - ACM Digital Library
jstor Premium - JSTOR archive
researchgate Free - ResearchGate social network

🚀 Usage

Once configured, browse-mcp provides three main tools accessible through Claude Desktop or any MCP-compatible client.

1. Search Papers (paper_search)

Search for academic papers across multiple sources:

Basic Search Examples:

# Search arXiv for machine learning papers
paper_search([
    {"searcher": "arxiv", "query": "machine learning", "max_results": 5}
])

# Search PubMed Central for biomedical papers
paper_search([
    {"searcher": "pmc", "query": "cancer treatment", "max_results": 5}
])

# Search CORE for open access papers
paper_search([
    {"searcher": "core", "query": "climate change", "max_results": 5}
])

Multi-Platform Search:

# Search multiple platforms simultaneously
paper_search([
    {"searcher": "arxiv", "query": "deep learning", "max_results": 5},
    {"searcher": "pubmed", "query": "cancer immunotherapy", "max_results": 3},
    {"searcher": "pmc", "query": "diabetes treatment", "max_results": 3},
    {"searcher": "semantic", "query": "climate change", "max_results": 4, "year": "2020-2023"}
])

Premium Sources (require API keys):

# Search IEEE Xplore (requires IEEE_API_KEY)
paper_search([
    {"searcher": "ieee", "query": "neural networks", "max_results": 5}
])

# Search Springer Link (requires SPRINGER_API_KEY)
paper_search([
    {"searcher": "springer", "query": "quantum computing", "max_results": 5}
])

# Search Scopus (requires SCOPUS_API_KEY)
paper_search([
    {"searcher": "scopus", "query": "artificial intelligence", "max_results": 5}
])

Search All Platforms:

# Search all platforms (omit "searcher" parameter)
paper_search([
    {"query": "quantum computing", "max_results": 10}
])

2. Download Papers (paper_download)

Download paper PDFs using their identifiers:

# Download from free sources
paper_download([
    {"searcher": "arxiv", "paper_id": "2106.12345"},
    {"searcher": "pubmed", "paper_id": "32790614"},
    {"searcher": "pmc", "paper_id": "PMC7419405"},
    {"searcher": "biorxiv", "paper_id": "10.1101/2020.01.01.123456"},
    {"searcher": "semantic", "paper_id": "DOI:10.18653/v1/N18-3011"}
])

# Download from CORE (open access)
paper_download([
    {"searcher": "core", "paper_id": "123456789"}
])

Note: Premium sources (IEEE, Springer, Science Direct, Scopus) require institutional access or subscriptions for PDF downloads.

3. Read Papers (paper_read)

Extract and read text content from papers:

# Read papers from free sources
paper_read(searcher="arxiv", paper_id="2106.12345")
paper_read(searcher="pubmed", paper_id="32790614")
paper_read(searcher="pmc", paper_id="PMC7419405")
paper_read(searcher="biorxiv", paper_id="10.1101/2020.01.01.123456")
paper_read(searcher="semantic", paper_id="DOI:10.18653/v1/N18-3011")
paper_read(searcher="core", paper_id="123456789")

🛠️ For Development

For developers who want to modify the code or contribute:

  1. Setup Environment:

    # Install uv if not installed
    curl -LsSf https://astral.sh/uv/install.sh | sh
    
    # Clone repository
    git clone https://github.com/LinXueyuanStdio/browse-mcp.git
    cd browse-mcp
    
    # Create and activate virtual environment
    uv venv
    source .venv/bin/activate  # On Windows: .venv\Scripts\activate
    
  2. Install Dependencies:

    # Install dependencies (recommended)
    uv pip install -e .
    
    # Add development dependencies (optional)
    uv pip install pytest flake8
    

🤝 Contributing

We welcome contributions! Here's how to get started:

  1. Fork the Repository: Click "Fork" on GitHub.

  2. Clone and Set Up:

    git clone https://github.com/yourusername/browse-mcp.git
    cd browse-mcp
    uv pip install -e .  # Install in development mode
    
  3. Make Changes:

    • Add new platforms in browse_mcp/sources/.
    • Update tests in tests/.
  4. Submit a Pull Request: Push changes and create a PR on GitHub.

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.


Happy researching with browse-mcp! If you encounter issues, open a GitHub issue.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

browse_mcp-0.1.9.tar.gz (42.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

browse_mcp-0.1.9-py3-none-any.whl (60.5 kB view details)

Uploaded Python 3

File details

Details for the file browse_mcp-0.1.9.tar.gz.

File metadata

  • Download URL: browse_mcp-0.1.9.tar.gz
  • Upload date:
  • Size: 42.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.11.5 Darwin/25.2.0

File hashes

Hashes for browse_mcp-0.1.9.tar.gz
Algorithm Hash digest
SHA256 455410d2a62274e926f2a49b111a96290b60a9a9f93ffd1e3b5b80416852ea82
MD5 364b47002502ec19c56d1db7bffbef17
BLAKE2b-256 f11a28286b22fd6da799148a6361c92a7cf1389342c3c641c691d78b52ff7b47

See more details on using hashes here.

File details

Details for the file browse_mcp-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: browse_mcp-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 60.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.11.5 Darwin/25.2.0

File hashes

Hashes for browse_mcp-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 a9ae7135b74af61384cd71816d3e9f75293cf9e29b15872e65717cb8ad1df163
MD5 be9ba87e4c42f7dc963241a92262d50d
BLAKE2b-256 e6ba2ca12953e1e9a342b72fa9e46f10ecdd79e800aa2e51ec5db424c251e3a4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page