MCP Server for FineData web scraping API - enables AI agents to scrape any website
Project description
FineData MCP Server
MCP (Model Context Protocol) server for FineData web scraping API.
Enables AI agents like Claude, Cursor, and GPT to scrape any website with:
- Antibot Bypass - Cloudflare, DataDome, PerimeterX, and more
- JavaScript Rendering - Full browser rendering with Playwright
- Captcha Solving - reCAPTCHA, hCaptcha, Cloudflare Turnstile, Yandex
- Proxy Rotation - 87K+ datacenter, residential, and mobile proxies
- Smart Retry - Automatic retries with block detection
Installation
Using uvx (Recommended)
# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh
# Run directly with uvx
FINEDATA_API_KEY=fd_xxx uvx finedata-mcp
Using pip
pip install finedata-mcp
# Run
FINEDATA_API_KEY=fd_xxx finedata-mcp
Using npx
npx @finedata/mcp-server
Configuration
Claude Desktop
Add to your claude_desktop_config.json:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
{
"mcpServers": {
"finedata": {
"command": "uvx",
"args": ["finedata-mcp"],
"env": {
"FINEDATA_API_KEY": "fd_your_api_key_here"
}
}
}
}
Cursor IDE
Add to your MCP settings in Cursor:
{
"mcpServers": {
"finedata": {
"command": "uvx",
"args": ["finedata-mcp"],
"env": {
"FINEDATA_API_KEY": "fd_your_api_key_here"
}
}
}
}
Alternative: Using npx
{
"mcpServers": {
"finedata": {
"command": "npx",
"args": ["-y", "@finedata/mcp-server"],
"env": {
"FINEDATA_API_KEY": "fd_your_api_key_here"
}
}
}
}
Environment Variables
| Variable | Required | Description |
|---|---|---|
FINEDATA_API_KEY |
Yes | Your FineData API key |
FINEDATA_API_URL |
No | API URL (default: https://api.finedata.ai) |
FINEDATA_TIMEOUT |
No | Default timeout in seconds (default: 60) |
Available Tools
scrape_url
Scrape content from any web page with antibot bypass.
scrape_url(
url: "https://example.com",
use_js_render: false, # Enable Playwright for SPAs
use_residential: false, # Use residential proxy
use_undetected: false, # Use Undetected Chrome
solve_captcha: false, # Auto-solve captchas
timeout: 60 # Timeout in seconds
)
Token costs:
- Base request: 1 token
- Antibot bypass: +2 tokens
- JS rendering: +5 tokens
- Residential proxy: +3 tokens
- Captcha solving: +10 tokens
scrape_async
Submit an async scraping job for long-running requests.
scrape_async(
url: "https://heavy-site.com",
use_js_render: true,
timeout: 120,
callback_url: "https://your-webhook.com/callback"
)
Returns a job_id for status polling.
get_job_status
Get the status of an async scraping job.
get_job_status(job_id: "550e8400-e29b-41d4-a716-446655440000")
Statuses: pending, processing, completed, failed, cancelled
batch_scrape
Scrape multiple URLs in a single batch (up to 100 URLs).
batch_scrape(
urls: ["https://example.com/1", "https://example.com/2"],
use_js_render: false,
callback_url: "https://your-webhook.com/batch-done"
)
get_usage
Get current API token usage.
get_usage()
Examples
Basic Scraping
Ask Claude or your AI agent:
"Scrape https://example.com and show me the content"
JavaScript Rendered Page
"Scrape https://spa-website.com with JavaScript rendering enabled"
Protected Site with Captcha
"Scrape https://protected-site.com using residential proxy and captcha solving"
Batch Scraping
"Scrape these URLs: https://example.com/1, https://example.com/2, https://example.com/3"
Pricing
FineData uses token-based pricing. Each feature adds tokens:
| Feature | Tokens |
|---|---|
| Base request | 1 |
| Antibot (TLS fingerprinting) | +2 |
| JS Rendering (Playwright) | +5 |
| Undetected Chrome | +5 |
| Residential Proxy | +3 |
| Mobile Proxy | +4 |
| reCAPTCHA / hCaptcha | +10 |
| Cloudflare Turnstile | +12 |
| Yandex SmartCaptcha | +15 |
Get your API key and free trial tokens at finedata.ai.
Support
- Documentation: https://docs.finedata.ai
- Email: support@finedata.ai
- Issues: https://github.com/quality-network/finedata-mcp/issues
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file finedata_mcp-0.1.0.tar.gz.
File metadata
- Download URL: finedata_mcp-0.1.0.tar.gz
- Upload date:
- Size: 10.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d42ba377034789e7b51e5fd2b46abd077915a059308caf5c8ae5c1466d42f09a
|
|
| MD5 |
73158fc35c814d889c012a0cd7da20a1
|
|
| BLAKE2b-256 |
6bb8249f250c0269ed8a2aa1f82ef707d2253660f3bc3a63bdde5de8b56197b5
|
File details
Details for the file finedata_mcp-0.1.0-py3-none-any.whl.
File metadata
- Download URL: finedata_mcp-0.1.0-py3-none-any.whl
- Upload date:
- Size: 12.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2813553202a8a239f8498a45211949728c8ca8debaadd1a6dab53b54628ab34c
|
|
| MD5 |
920ee61d4be1e061b188bcefeb3efe16
|
|
| BLAKE2b-256 |
d2d90f6c6ee47eb5a439bd737f3f3ce45fb4b9795bb7ce877aef0d4bf26233c0
|