MCP server for searching Google Scholar — papers, authors, citations, and BibTeX

These details have not been verified by PyPI

Project links

Project description

google-scholar-mcp

An MCP (Model Context Protocol) server for searching Google Scholar: papers, authors, citations, and BibTeX entries. Designed to integrate into AI assistants (Claude, etc.) with academic literature search.

Features
Installation
Configuration
Usage
Examples
Rate Limiting
Troubleshooting
Contributing

Features

Paper Search: Query Google Scholar by keyword with filtering, sorting, and pagination
Author Lookup: Find researcher profiles with publication lists and h-index metrics
Citation Tracking: Retrieve papers that cite a given work
Paper Details: Get full metadata, citations-per-year graphs, and public access info
BibTeX Export: Generate citation entries in BibTeX format
Bulk Search: Batch search multiple queries with automatic rate limiting
Rate Limiting: Built-in delays between requests to avoid being blocked
Proxy Support: Optional proxy configuration (free, single, or ScraperAPI)

Installation

Requirements

Python 3.11 or later
Dependencies: mcp[cli]>=1.4.0, scholarly>=1.7.11, pydantic>=2.0 (see pyproject.toml)

Build from Source

git clone https://github.com/yourusername/google-scholar-mcp.git
cd google-scholar-mcp
pip install -e .

Note: This server uses the scholarly library to access Google Scholar. Respect Google's Terms of Service and use rate limiting appropriately to avoid being blocked.

Configuration

Configure the MCP server via environment variables:

Variable	Default	Description
`GS_MIN_DELAY`	`5.0`	Minimum seconds between requests
`GS_MAX_DELAY`	`15.0`	Maximum seconds between requests
`GS_MAX_RETRIES`	`3`	Number of retries on failure
`GS_PROXY_TYPE`	`none`	Proxy mode: `none`, `free`, `single`, `scraperapi`
`GS_PROXY_HTTP`	—	HTTP proxy URL (for `single` mode)
`GS_PROXY_HTTPS`	—	HTTPS proxy URL (for `single` mode)
`GS_SCRAPERAPI_KEY`	—	ScraperAPI key (for `scraperapi` mode)
`GS_TIMEOUT`	`30`	Request timeout in seconds

Proxy Configuration Examples

No Proxy (Default)

export GS_PROXY_TYPE=none

Free Proxy

export GS_PROXY_TYPE=free

Single Proxy

export GS_PROXY_TYPE=single
export GS_PROXY_HTTP=http://proxy.example.com:8080
export GS_PROXY_HTTPS=https://proxy.example.com:8080

ScraperAPI

export GS_PROXY_TYPE=scraperapi
export GS_SCRAPERAPI_KEY=your_key_here

Usage

Running the Server

# Start the MCP server (communicates via stdio)
google-scholar-mcp

Integration with Claude Desktop

Add the server to your Claude Desktop configuration:

On macOS/Linux: ~/.config/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "google-scholar": {
      "command": "python",
      "args": ["-m", "google_scholar_mcp.server"],
      "env": {
        "GS_MIN_DELAY": "5.0",
        "GS_MAX_DELAY": "15.0",
        "GS_PROXY_TYPE": "none"
      }
    }
  }
}

On Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "google-scholar": {
      "command": "python",
      "args": ["-m", "google_scholar_mcp.server"],
      "env": {
        "GS_MIN_DELAY": "5.0",
        "GS_MAX_DELAY": "15.0",
        "GS_PROXY_TYPE": "none"
      }
    }
  }
}

After updating the config, restart Claude Desktop. The Google Scholar tools will appear in the MCP Tools panel.

Integration with Other MCP Clients

Any MCP client (e.g., Cline, Continue, or custom tools) can use this server. Configure the connection to:

Command: python -m google_scholar_mcp.server
Transport: stdio

Examples

Example 1: Search for Recent Papers in Machine Learning

Query: "Find 5 papers on deep learning from 2023-2024"

Claude Request:

Using the search_papers tool, find papers on "deep learning" published between 2023 and 2024, 
sorted by relevance, limiting to 5 results.

Example 2: Find an Author and Their Work

Query: "Find papers by Yann LeCun"

Claude Request:

Use search_author to find the scholar profile for "Yann LeCun" and show me their top 10 publications.

Example 3: Citation Chain Analysis

Query: "What papers cite 'Attention is All You Need'?"

Claude Request:

Use get_citations to find papers that cite "Attention is All You Need" and show me the top 5.

Example 4: Literature Review

Query: "Search for papers on: reinforcement learning, deep Q-learning, policy gradients"

Claude Request:

Use bulk_search with queries ["reinforcement learning", "deep Q-learning", "policy gradients"],
returning 5 papers per query to help me build a literature review.

Example 5: Get BibTeX for Citation Management

Query: "Get BibTeX for 'ImageNet-21k Pretraining for the Masses'"

Claude Request:

Use get_bibtex to get the citation entry for "ImageNet-21k Pretraining for the Masses" 
so I can add it to my bibliography.

Rate Limiting

The server automatically enforces rate limiting between requests to avoid overloading Google Scholar's servers:

Min Delay (default 5s): Minimum wait between consecutive requests
Max Delay (default 15s): Maximum wait (randomized to avoid patterns)
Max Retries (default 3): Retry failed requests up to this many times

These settings help prevent being blocked by Google Scholar. Adjust via environment variables if needed:

export GS_MIN_DELAY=3.0
export GS_MAX_DELAY=10.0
export GS_MAX_RETRIES=5

⚠️ IP Blocking Warning

If you exceed Google Scholar's rate limits despite the rate limiter:

Your IP may be temporarily blocked (usually 24-48 hours)
All requests will fail with connection errors or 429 responses
Blocked IPs cannot make requests even with valid proxies on the same IP range
Repeated violations may trigger permanent blocks or require CAPTCHA solving

Recommended Practices:

Never decrease delays below 5 seconds — the defaults are tuned for reliability
Use the bulk_search tool instead of rapid sequential searches — it includes built-in delays
Add extra buffer during bulk operations — consider setting GS_MIN_DELAY=10.0 for large jobs
Use a proxy service (free proxy or ScraperAPI) to distribute requests across multiple IPs
Monitor for 429 errors — if you see them, increase delays immediately and wait before retrying
Spread requests over time — don't run 100 queries in 5 minutes, even with delays

Recovery from IP Blocks

If your IP gets blocked:

Wait 24-48 hours for the temporary block to expire
Use a proxy — enable GS_PROXY_TYPE=free or scraperapi to route through different IPs
Change your network — use a different WiFi/ISP temporarily if possible
Contact support — for persistent blocks, escalate to Google Scholar support

Choosing Appropriate Delays

Scenario	GS_MIN_DELAY	GS_MAX_DELAY	Notes
Single searches	5.0	15.0	Default; safe for occasional queries
Bulk operations	10.0	20.0	Use for batch jobs; prevents rapid-fire requests
Heavy load	15.0	30.0	Use with proxy for large-scale research
Aggressive ⚠️	<5.0	<10.0	Not recommended; high risk of IP blocking

Troubleshooting

"Error: 429 Too Many Requests"

You've hit Google Scholar's rate limit. Solutions:

Increase delays: Set higher GS_MIN_DELAY and GS_MAX_DELAY
Use a proxy: Set GS_PROXY_TYPE=free or use ScraperAPI
Wait and retry: Google Scholar may be temporarily blocking; try again later

"No results found"

Check your query syntax (Google Scholar supports advanced search operators)
Ensure the author/paper name is spelled correctly
Try a simpler query with fewer keywords

"Connection timeout"

Increase GS_TIMEOUT if your network is slow
Check your internet connection
Verify proxy settings if using a proxy

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch (git checkout -b feature/your-feature)
Commit your changes with clear messages
Push to your fork
Open a pull request

Support

For issues, questions, or feature requests, please open an issue on GitHub.

License

See LICENSE file

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.0

May 4, 2026

0.1.2

May 3, 2026

0.1.1

Apr 28, 2026

This version

0.1.0

Apr 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

google_scholar_search_mcp-0.1.0.tar.gz (9.1 kB view details)

Uploaded Apr 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

google_scholar_search_mcp-0.1.0-py3-none-any.whl (11.8 kB view details)

Uploaded Apr 28, 2026 Python 3

File details

Details for the file google_scholar_search_mcp-0.1.0.tar.gz.

File metadata

Download URL: google_scholar_search_mcp-0.1.0.tar.gz
Upload date: Apr 28, 2026
Size: 9.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for google_scholar_search_mcp-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`809f71f715a26ac9cf805bd29053a8365d8963e03d77641afc55714fdac86d31`
MD5	`f1f47e476b7e37798e111ceee5a9af49`
BLAKE2b-256	`07e98d62f4cab0d8210325f652f374e44985fdefb21602313abdb57cf33173a5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for google_scholar_search_mcp-0.1.0.tar.gz:

Publisher: pypi-release.yml on LWaetzig/google-scholar-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: google_scholar_search_mcp-0.1.0.tar.gz
- Subject digest: 809f71f715a26ac9cf805bd29053a8365d8963e03d77641afc55714fdac86d31
- Sigstore transparency entry: 1397848862
- Sigstore integration time: Apr 28, 2026
Source repository:
- Permalink: LWaetzig/google-scholar-mcp@454b1ff0a3cc721891cab435c5bd30f8f72dfd22
- Branch / Tag: refs/heads/master
- Owner: https://github.com/LWaetzig
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-release.yml@454b1ff0a3cc721891cab435c5bd30f8f72dfd22
- Trigger Event: workflow_dispatch

File details

Details for the file google_scholar_search_mcp-0.1.0-py3-none-any.whl.

File metadata

Download URL: google_scholar_search_mcp-0.1.0-py3-none-any.whl
Upload date: Apr 28, 2026
Size: 11.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for google_scholar_search_mcp-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2a399783e2802645c034437369c8fe58231cbe4fb847df3910dd49513711120a`
MD5	`54d4fa107b25a7fffb432be6de98708a`
BLAKE2b-256	`4cbeb44df726b62c55db3c84735c0b7d0466a3c07230a7889159fa8fc7ad492c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for google_scholar_search_mcp-0.1.0-py3-none-any.whl:

Publisher: pypi-release.yml on LWaetzig/google-scholar-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: google_scholar_search_mcp-0.1.0-py3-none-any.whl
- Subject digest: 2a399783e2802645c034437369c8fe58231cbe4fb847df3910dd49513711120a
- Sigstore transparency entry: 1397848882
- Sigstore integration time: Apr 28, 2026
Source repository:
- Permalink: LWaetzig/google-scholar-mcp@454b1ff0a3cc721891cab435c5bd30f8f72dfd22
- Branch / Tag: refs/heads/master
- Owner: https://github.com/LWaetzig
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-release.yml@454b1ff0a3cc721891cab435c5bd30f8f72dfd22
- Trigger Event: workflow_dispatch

google-scholar-search-mcp 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

google-scholar-mcp

Table of Contents

Features

Installation

Requirements

Build from Source

Configuration

Proxy Configuration Examples

Usage

Running the Server

Integration with Claude Desktop

Integration with Other MCP Clients

Examples

Example 1: Search for Recent Papers in Machine Learning

Example 2: Find an Author and Their Work

Example 3: Citation Chain Analysis

Example 4: Literature Review

Example 5: Get BibTeX for Citation Management

Rate Limiting

⚠️ IP Blocking Warning

Recovery from IP Blocks

Choosing Appropriate Delays

Troubleshooting

"Error: 429 Too Many Requests"

"No results found"

"Connection timeout"

Contributing

Support

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance