Skip to main content

A Model Context Protocol server for arXiv paper search and retrieval

Project description

arXiv MCP Server

A Model Context Protocol (MCP) server that provides arXiv paper search and retrieval capabilities. This server enables LLMs to search for academic papers on arXiv and get cleaned titles, abstracts, authors, and content without dealing with complex HTML parsing.

Features

  • Search papers by query, author, category, and date
  • Advanced search with specific field filters
  • Get detailed paper metadata (title, abstract, authors, categories)
  • Retrieve full paper content via Jina Reader
  • Browse recent papers by category
  • List all arXiv categories
  • Pagination support for search results

Available Tools

search

Search arXiv for papers matching a query.

Argument Type Required Description
query string Yes Search query (e.g., 'LLM', 'transformer')
category string No Filter by category (e.g., 'cs.AI', 'cs.LG')
author string No Filter by author name
sort_by string No Sort order: 'relevance', 'date_desc', 'date_asc'
page int No Page number (default: 1)
page_size int No Results per page, max 50 (default: 25)

searchAdvanced

Advanced search with specific field filters.

Argument Type Required Description
title string No Search in paper titles
abstract string No Search in abstracts
author string No Search by author name
category string No Filter by category
id_arxiv string No Search by arXiv ID pattern
date_from string No Start date (YYYY-MM-DD)
date_to string No End date (YYYY-MM-DD)
sort_by string No Sort order
page int No Page number
page_size int No Results per page

getPaper

Get detailed information about a specific arXiv paper.

Argument Type Required Description
id_or_url string Yes arXiv ID (e.g., '2301.00001') or full URL

getContent

Get the full text content of an arXiv paper using Jina Reader.

Argument Type Required Description
id_or_url string Yes arXiv ID or full URL

getRecent

Get recent papers from a specific arXiv category.

Argument Type Required Description
category string No Category code (default: 'cs.AI')
count int No Number of papers, max 50 (default: 10)

listCategories

List all common arXiv categories with their codes and names.

Installation

Using uv (Recommended)

# Clone the repository
git clone https://github.com/Ray0907/arXiv-mcp.git
cd arXiv-mcp

# Install with uv
uv sync

Using pip

# Clone the repository
git clone https://github.com/Ray0907/arXiv-mcp.git
cd arXiv-mcp

# Create virtual environment
python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

# Install
pip install -e .

Configuration

Claude Desktop

Add to your Claude Desktop configuration (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):

{
  "mcpServers": {
    "arxiv": {
      "command": "uv",
      "args": [
        "--directory",
        "/path/to/arXiv-mcp",
        "run",
        "arxiv-mcp"
      ]
    }
  }
}

Claude Code

Add to your Claude Code MCP settings:

{
  "mcpServers": {
    "arxiv": {
      "command": "uv",
      "args": [
        "--directory",
        "/path/to/arXiv-mcp",
        "run",
        "arxiv-mcp"
      ]
    }
  }
}

Usage Examples

Search for papers about LLMs

Search for recent papers about "large language models"

Find papers by a specific author

Search for papers by "Yann LeCun" in the machine learning category

Get paper details

Get the details of arXiv paper 2301.00001

Browse recent papers

Show me the 10 most recent papers in cs.AI

Development

Run tests

uv run pytest

Run the server locally

uv run arxiv-mcp

Common arXiv Categories

Code Name
cs.AI Artificial Intelligence
cs.CL Computation and Language
cs.CV Computer Vision and Pattern Recognition
cs.LG Machine Learning
cs.NE Neural and Evolutionary Computing
stat.ML Machine Learning (Statistics)

Use listCategories tool to get the full list.

Changelog

v0.2.0

Breaking Changes:

  • Renamed entry point from arxiv-server.py to arxiv-mcp command
  • Renamed get tool to getContent for clarity

New Features:

  • searchAdvanced - Advanced search with title, abstract, date range filters
  • getPaper - Get detailed paper metadata (authors, categories, dates, PDF URL)
  • getRecent - Browse recent papers by category
  • listCategories - List 33 common arXiv categories
  • Pagination support (page, page_size parameters)
  • Sort options (relevance, date_desc, date_asc)
  • Filter by author and category in basic search

Improvements:

  • Migrated to pyproject.toml with uv for dependency management
  • Replaced requests with httpx (async-ready)
  • Added Pydantic models for type-safe data structures
  • Reduced dependencies from 33 to 4 core packages
  • Added proper timeout handling (30s)
  • Modular project structure (src/arxiv_mcp/)

v0.1.0

  • Initial release
  • Basic search and get tools

License

MIT License - see LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iflow_mcp_ray0907_arxiv_mcp-0.2.0.tar.gz (59.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

iflow_mcp_ray0907_arxiv_mcp-0.2.0-py3-none-any.whl (10.1 kB view details)

Uploaded Python 3

File details

Details for the file iflow_mcp_ray0907_arxiv_mcp-0.2.0.tar.gz.

File metadata

  • Download URL: iflow_mcp_ray0907_arxiv_mcp-0.2.0.tar.gz
  • Upload date:
  • Size: 59.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_ray0907_arxiv_mcp-0.2.0.tar.gz
Algorithm Hash digest
SHA256 fb9e68011286a24164dc49209bb6b1b9ed8b39793983e8b65729162a153240a4
MD5 b913cdccdbff249dda76841a3a168659
BLAKE2b-256 8f8263953a97e879ec428ebe08ef94ca2d1d1d1fa2672910b651c0e02e9a8f62

See more details on using hashes here.

File details

Details for the file iflow_mcp_ray0907_arxiv_mcp-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: iflow_mcp_ray0907_arxiv_mcp-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 10.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_ray0907_arxiv_mcp-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d9f18886af2ba1c3bdd6c4943c9728c061e000bf1d8566edee79471bbfd6383d
MD5 513c8c8b3e05cc9dbe753b7f930c11bc
BLAKE2b-256 c7df8697778fd546b00dbc021648554c0d15e777d77fc3a3503fa5765f027882

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page