MCP Server for FineData web scraping API - enables AI agents to scrape any website
Project description
FineData MCP Server
MCP (Model Context Protocol) server for FineData web scraping API.
Enables AI agents like Claude, Cursor, and GPT to scrape any website with:
- Antibot Bypass - Cloudflare, DataDome, PerimeterX, and more
- JavaScript Rendering - Full browser rendering with Playwright
- Captcha Solving - reCAPTCHA, hCaptcha, Cloudflare Turnstile, Yandex
- Proxy Rotation - 87K+ datacenter, residential, and mobile proxies
- Smart Retry - Automatic retries with block detection
- Markdown Output - Returns clean Markdown by default (optimized for AI agents)
- AI Extraction - Extract structured data using natural language prompts (Qwen3-32B)
Installation
Using uvx (Recommended)
# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh
# Run directly with uvx
FINEDATA_API_KEY=fd_xxx uvx finedata-mcp
Using pip
pip install finedata-mcp
# Run
FINEDATA_API_KEY=fd_xxx finedata-mcp
Using npx
npx -y @finedata/mcp-server
Configuration
Cursor IDE
Step 1: Open Cursor Settings → MCP
Or create/edit ~/.cursor/mcp.json:
macOS/Linux:
mkdir -p ~/.cursor && nano ~/.cursor/mcp.json
Windows:
%USERPROFILE%\.cursor\mcp.json
Step 2: Add FineData MCP server:
{
"mcpServers": {
"finedata": {
"command": "uvx",
"args": ["finedata-mcp"],
"env": {
"FINEDATA_API_KEY": "fd_your_api_key_here"
}
}
}
}
Step 3: Restart Cursor
Step 4: Test by asking the agent:
"Scrape https://example.com and show me the title"
Alternative: Using npx (if uv not installed)
{
"mcpServers": {
"finedata": {
"command": "npx",
"args": ["-y", "@finedata/mcp-server"],
"env": {
"FINEDATA_API_KEY": "fd_your_api_key_here"
}
}
}
}
Note: npx requires Python 3.10+ and uv/pipx installed. uvx is recommended.
Claude Desktop
Step 1: Open config file:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
Step 2: Add MCP server:
{
"mcpServers": {
"finedata": {
"command": "uvx",
"args": ["finedata-mcp"],
"env": {
"FINEDATA_API_KEY": "fd_your_api_key_here"
}
}
}
}
Step 3: Restart Claude Desktop
Environment Variables
| Variable | Required | Description |
|---|---|---|
FINEDATA_API_KEY |
Yes | Your FineData API key |
FINEDATA_API_URL |
No | API URL (default: https://api.finedata.ai) |
FINEDATA_TIMEOUT |
No | Default timeout in seconds (default: 60) |
Available Tools
scrape_url
Scrape content from any web page with antibot bypass.
scrape_url(
url: "https://example.com",
formats: ["markdown"], # Output: markdown, rawHtml, text, links (default: ["markdown"])
only_main_content: true, # Extract main content only (default: true)
extract_prompt: null, # AI extraction instruction (e.g. "Extract all product prices")
use_js_render: false, # Enable Playwright for SPAs
stealth_antibot: false, # Stealth mode for Cloudflare, DataDome (+7 tokens)
stealth_antibot_headful: false, # Maximum bypass with real browser (+25 tokens)
stealth_new: false, # Experimental engine (+15 tokens)
use_residential: false, # Use residential proxy
solve_captcha: false, # Auto-solve captchas
timeout: 60 # Timeout in seconds
)
Note: The MCP server returns Markdown by default (not raw HTML), which is optimized for AI agent consumption.
Token costs:
- Base request: 1 token
- Antibot bypass: +2 tokens
- JS rendering: +5 tokens
- Stealth Antibot: +7 tokens
- Stealth Headful: +25 tokens
- Stealth New: +15 tokens
- Residential proxy: +3 tokens
- Captcha solving: +10 tokens
scrape_async
Submit an async scraping job for long-running requests.
scrape_async(
url: "https://heavy-site.com",
use_js_render: true,
timeout: 120,
callback_url: "https://your-webhook.com/callback"
)
Returns a job_id for status polling.
get_job_status
Get the status of an async scraping job.
get_job_status(job_id: "550e8400-e29b-41d4-a716-446655440000")
Statuses: pending, processing, completed, failed, cancelled
batch_scrape
Scrape multiple URLs in a single batch (up to 100 URLs).
batch_scrape(
urls: ["https://example.com/1", "https://example.com/2"],
use_js_render: false,
callback_url: "https://your-webhook.com/batch-done"
)
get_usage
Get current API token usage.
get_usage()
Examples
Basic Scraping
Ask Claude or your AI agent:
"Scrape https://example.com and show me the content"
JavaScript Rendered Page
"Scrape https://spa-website.com with JavaScript rendering enabled"
Protected Site with Captcha
"Scrape https://protected-site.com using residential proxy and captcha solving"
AI Data Extraction
"Scrape https://shop.com/products and extract all product names and prices"
Batch Scraping
"Scrape these URLs: https://example.com/1, https://example.com/2, https://example.com/3"
Pricing
FineData uses token-based pricing. Each feature adds tokens:
| Feature | Tokens |
|---|---|
| Base request | 1 |
| Antibot (TLS fingerprinting) | +2 |
| JS Rendering (Playwright) | +5 |
| Stealth Antibot | +7 |
| Stealth Headful | +25 |
| Stealth New | +15 |
| Residential Proxy | +3 |
| Mobile Proxy | +4 |
| reCAPTCHA / hCaptcha | +10 |
| Cloudflare Turnstile | +12 |
| Yandex SmartCaptcha | +15 |
| AI Extraction (Qwen3-32B) | +5 |
Get your API key and free trial tokens at finedata.ai.
Troubleshooting
"No module named finedata_mcp"
Install uv and use uvx:
curl -LsSf https://astral.sh/uv/install.sh | sh
"externally-managed-environment" on macOS
This happens with Homebrew Python. Use uvx instead of pip:
{
"command": "uvx",
"args": ["finedata-mcp"]
}
MCP server not appearing in Cursor
- Check
~/.cursor/mcp.jsonsyntax (valid JSON) - Ensure
FINEDATA_API_KEYis set - Restart Cursor completely
- Check Cursor Output → MCP for errors
Support
- Documentation: https://docs.finedata.ai
- Email: support@finedata.ai
- Issues: https://github.com/quality-network/finedata-mcp/issues
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file finedata_mcp-0.1.8.tar.gz.
File metadata
- Download URL: finedata_mcp-0.1.8.tar.gz
- Upload date:
- Size: 11.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9129cf04839d10cd54e5e38f2cf0a9cc37d9349b40013d13cdb28d6234f857fe
|
|
| MD5 |
b6a3d88cb4fee0c5750bec9fef5093cf
|
|
| BLAKE2b-256 |
2d03e401b28e5dd6eb3bb7e32d9c0828740ce965a8065b44043b166b71bfdb43
|
File details
Details for the file finedata_mcp-0.1.8-py3-none-any.whl.
File metadata
- Download URL: finedata_mcp-0.1.8-py3-none-any.whl
- Upload date:
- Size: 14.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3b2bfcd0d47849b3b9d9f4f68fdd65df0b8bb036ead745fcc8291cb2b8fe7cc1
|
|
| MD5 |
400ac687ab2fa925308e070d4f05d83b
|
|
| BLAKE2b-256 |
bc4a23eca92a98e17d4e82ead774d4046b9cba419edabf37109a3e2ed8c2e75d
|