A flexible arXiv search and analysis service with MCP protocol support

Project description

ArXiv MCP Server

🔍 Enable AI assistants to search and access arXiv papers through a simple MCP interface.

The ArXiv MCP Server provides a bridge between AI assistants and arXiv's research repository through the Model Context Protocol (MCP). It allows AI models to search for papers and access their content in a programmatic way.

🤝 Contribute • 📝 Report Bug

✨ Core Features

🔎 Paper Search: Query arXiv papers with filters for date ranges and categories
📄 Paper Access: Download and read paper content
📋 Paper Listing: View all downloaded papers
🗃️ Local Storage: Papers are saved locally for faster access
📝 Prompts: A Set of Research Prompts

💼 Pro Features

🧠 Semantic Search: semantic_search finds conceptually similar papers using local embeddings.
🔁 Index Rebuild: reindex rebuilds the local semantic index from downloaded papers.
🕸️ Citation Graph: citation_graph returns references and citation backlinks via Semantic Scholar.
🔔 Research Alerts: watch_topic and check_alerts track topics and return newly published papers.
🧩 Advanced Prompts: summarize_paper, compare_papers, and literature_review for deeper workflows.

Install pro extras in development environments:

uv pip install -e ".[pro]"

🚀 Quick Start

Installing via Smithery

To install ArXiv Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install arxiv-mcp-server --client claude

Installing Manually

Important — use uv tool install, not uv pip install

Running uv pip install arxiv-mcp-server installs the package into the current virtual environment but does not place the arxiv-mcp-server executable on your PATH. You must use uv tool install so that uv creates an isolated environment and exposes the executable globally:

uv tool install arxiv-mcp-server

After this, the arxiv-mcp-server command will be available on your PATH.

PDF fallback (older papers): Most arXiv papers have an HTML version which the base install handles automatically. For older papers that only have a PDF, the server needs the [pdf] extra (pymupdf4llm). Install it with:
uv tool install 'arxiv-mcp-server[pdf]'

You can verify it with:

arxiv-mcp-server --help

If you previously ran uv pip install arxiv-mcp-server and the command is missing, uninstall it and re-install with uv tool install as shown above.

For development:

# Clone and set up development environment
git clone https://github.com/blazickjp/arxiv-mcp-server.git
cd arxiv-mcp-server

# Create and activate virtual environment
uv venv
source .venv/bin/activate

# Install with test dependencies (development only — no global executable)
uv pip install -e ".[test]"

🔌 MCP Integration

Add this configuration to your MCP client config file:

{
    "mcpServers": {
        "arxiv-mcp-server": {
            "command": "uv",
            "args": [
                "tool",
                "run",
                "arxiv-mcp-server",
                "--storage-path", "/path/to/paper/storage"
            ]
        }
    }
}

For Development:

{
    "mcpServers": {
        "arxiv-mcp-server": {
            "command": "uv",
            "args": [
                "--directory",
                "path/to/cloned/arxiv-mcp-server",
                "run",
                "arxiv-mcp-server",
                "--storage-path", "/path/to/paper/storage"
            ]
        }
    }
}

🔒 Security Note

arXiv papers are user-generated, untrusted content. Paper text returned by this server may contain prompt injection attempts — crafted text designed to manipulate an AI assistant's behavior. Treat all paper content as untrusted input.

In production environments, apply appropriate sandboxing and avoid feeding raw paper content into agentic pipelines that have access to sensitive tools or data without review. See SECURITY.md for the full security policy.

💡 Available Tools

Core Workflow

The typical workflow for deep paper research is:

search_papers → download_paper → read_paper

list_papers shows what you have locally. semantic_search searches across your local collection.

1. Paper Search

Search arXiv with optional category, date, and boolean filters. Enforces arXiv's 3-second rate limit automatically. If rate limited, wait 60 seconds before retrying.

result = await call_tool("search_papers", {
    "query": "\"KAN\" OR \"Kolmogorov-Arnold Networks\"",
    "max_results": 10,
    "date_from": "2024-01-01",
    "categories": ["cs.LG", "cs.AI"],
    "sort_by": "date"   # or "relevance" (default)
})

Supported categories include cs.AI, cs.LG, cs.CL, cs.CV, cs.NE, stat.ML, math.OC, quant-ph, eess.SP, and more. See tool description for the full list.

2. Paper Download

Download a paper by its arXiv ID. Tries HTML first, falls back to PDF. Stores the paper locally for read_paper and semantic_search.

result = await call_tool("download_paper", {
    "paper_id": "2401.12345"
})

For older papers that only have a PDF, install the [pdf] extra: uv tool install 'arxiv-mcp-server[pdf]'

3. List Papers

List all papers downloaded locally. Returns arXiv IDs only — use read_paper to access content.

result = await call_tool("list_papers", {})

4. Read Paper

Read the full text of a locally downloaded paper in markdown. Requires download_paper to be called first.

result = await call_tool("read_paper", {
    "paper_id": "2401.12345"
})

5. Semantic Search (Pro)

Semantic similarity search over your locally downloaded papers only. Returns empty results if no papers have been downloaded yet.

result = await call_tool("semantic_search", {
    "query": "test-time adaptation in multimodal transformers",
    "max_results": 5
})
# or find papers similar to a known paper:
result = await call_tool("semantic_search", {
    "paper_id": "2404.19756",
    "max_results": 5
})

6. Citation Graph (Pro)

Fetch references and citing papers via Semantic Scholar. Works on any arXiv ID — no local download required.

result = await call_tool("citation_graph", {
    "paper_id": "2401.12345"
})

7. Research Alerts (Pro)

Save topic watches and poll for newly published papers since last check. Uses the same query syntax as search_papers.

# Register a watch (idempotent — calling again updates the existing watch)
await call_tool("watch_topic", {
    "topic": "\"multi-agent reinforcement learning\"",
    "categories": ["cs.AI", "cs.LG"],
    "max_results": 10
})

# Check all watches — returns only papers published since last check
result = await call_tool("check_alerts", {})

# Check a single watch
result = await call_tool("check_alerts", {"topic": "\"multi-agent reinforcement learning\""})

📝 Research Prompts

The server offers specialized prompts to help analyze academic papers:

Paper Analysis Prompt

A comprehensive workflow for analyzing academic papers that only requires a paper ID:

result = await call_prompt("deep-paper-analysis", {
    "paper_id": "2401.12345"
})

This prompt includes:

Detailed instructions for using available tools (list_papers, download_paper, read_paper, search_papers)
A systematic workflow for paper analysis
Comprehensive analysis structure covering:
- Executive summary
- Research context
- Methodology analysis
- Results evaluation
- Practical and theoretical implications
Future research directions
Broader impacts

Pro Prompt Pack

summarize_paper: concise structured summary for one paper.
compare_papers: side-by-side technical comparison across paper IDs.
literature_review: thematic synthesis across a topic and optional paper set.

⚙️ Configuration

Configure through environment variables:

Variable	Purpose	Default
`ARXIV_STORAGE_PATH`	Paper storage location	~/.arxiv-mcp-server/papers

🧪 Testing

Run the test suite:

python -m pytest

📄 License

Released under the MIT License. See the LICENSE file for details.

Made with ❤️ by the Pearl Labs Team

Project details

Release history Release notifications | RSS feed

0.4.11

Apr 3, 2026

0.4.10

Apr 3, 2026

0.4.9

Apr 3, 2026

0.4.8

Apr 3, 2026

0.4.7

Apr 3, 2026

0.4.6

Apr 3, 2026

0.4.5

Apr 3, 2026

This version

0.4.4

Apr 3, 2026

0.4.3

Apr 3, 2026

0.4.1

Apr 3, 2026

0.4.0

Apr 2, 2026

0.3.2

Jan 26, 2026

0.3.1

Aug 19, 2025

0.3.0

Aug 11, 2025

0.2.11

Jun 6, 2025

0.2.10

Apr 22, 2025

0.2.9

Apr 4, 2025

0.2.8

Mar 12, 2025

0.2.7

Mar 11, 2025

0.2.6

Jan 21, 2025

0.2.5

Jan 8, 2025

0.2.4

Dec 8, 2024

0.2.3

Dec 7, 2024

0.2.2

Dec 4, 2024

0.2.1

Dec 4, 2024

0.2.0

Dec 4, 2024

0.1.0

Dec 3, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

arxiv_mcp_server-0.4.4.tar.gz (212.4 kB view details)

Uploaded Apr 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

arxiv_mcp_server-0.4.4-py3-none-any.whl (43.2 kB view details)

Uploaded Apr 3, 2026 Python 3

File details

Details for the file arxiv_mcp_server-0.4.4.tar.gz.

File metadata

Download URL: arxiv_mcp_server-0.4.4.tar.gz
Upload date: Apr 3, 2026
Size: 212.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.4

File hashes

Hashes for arxiv_mcp_server-0.4.4.tar.gz
Algorithm	Hash digest
SHA256	`e36f52ac18221a836b74beea67a2136d909f47dee26d47de7d7ad981b276162e`
MD5	`84319f727b0b66b6e4529cb14f637c9b`
BLAKE2b-256	`c1dbb83b57d153802ec1725a98c0d1235b8848e45cb6bf6b66591b441a4bef53`

See more details on using hashes here.

File details

Details for the file arxiv_mcp_server-0.4.4-py3-none-any.whl.

File metadata

Download URL: arxiv_mcp_server-0.4.4-py3-none-any.whl
Upload date: Apr 3, 2026
Size: 43.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.4

File hashes

Hashes for arxiv_mcp_server-0.4.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c305e5dd44f069c8f1a6f4cc418a326391ee30573d17747dccb87f706eee6f5a`
MD5	`b6acb3f7c51082924d49c94e24046152`
BLAKE2b-256	`4d89dd3dfcdaaf08bb7fd5a056ce7516f684cdeeaa51cbd4f6d3c9791bcdc901`

See more details on using hashes here.

arxiv-mcp-server 0.4.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

ArXiv MCP Server

✨ Core Features

💼 Pro Features

🚀 Quick Start

Installing via Smithery

Installing Manually

🔌 MCP Integration

🔒 Security Note

💡 Available Tools

Core Workflow

1. Paper Search

2. Paper Download

3. List Papers

4. Read Paper

5. Semantic Search (Pro)

6. Citation Graph (Pro)

7. Research Alerts (Pro)

📝 Research Prompts

Paper Analysis Prompt

Pro Prompt Pack

⚙️ Configuration

🧪 Testing

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes