Python SDK for the MrScraper web-scraping API

These details have not been verified by PyPI

Project links

Project description

MrScraper Python SDK

A Simple Python SDK for the MrScraper web-scraping API. Supports async / await usage.

Installation

pip install mrscraper-sdk

Requires Python 3.9+.

Authentication

Every client is initialised with your MrScraper API token. Get yours at https://app.mrscraper.com.

from mrscraper import MrScraper

client = MrScraper(token="MRSCRAPER_API_TOKEN")

Quick Start

Fetch raw HTML (stealth browser)

Fetch the rendered HTML of a page using the MrScraper stealth browser. Handles JavaScript rendering, bot detection evasion, and optional geolocation proxying.

result = await client.fetch_html(
    "https://stockx.com/air-jordan-1-retro-low-og-chicago-2025",
    geo_code="US",
    timeout=120,
    block_resources=False,
)

Parameter	Type	Default	Description
`url`	`str`	(required)	Target URL to scrape
`timeout`	`int`	`120`	Maximum seconds to wait for the page to load
`geo_code`	`str`	`"US"`	ISO country code for proxy-based geolocation (e.g. `"US"`, `"GB"`, `"ID"`, `"SG"`)
`block_resources`	`bool`	`False`	When `True`, blocks images, CSS, and fonts to speed up the request

Returns: dict with keys status_code, data (raw HTML string), and headers.

Create an AI scraper

Create an AI-powered scraper using natural-language instructions and run it immediately. No CSS selectors required.

result = await client.create_scraper(
    url="https://example.com/products",
    message="Extract all product names, prices, and ratings",
    agent="listing",          # "general" | "listing" | "map"
    proxy_country="US",
)
scraper_id = result["data"]["data"]["scraperId"]

Parameter	Type	Default	Description
`url`	`str`	(required)	Target URL to scrape
`message`	`str`	(required)	Natural-language description of what to extract (e.g. `"Extract all product names and prices"`)
`agent`	`"general"` \| `"listing"` \| `"map"`	`"general"`	AI agent type (see table below)
`proxy_country`	`str \| None`	`None`	ISO country code for proxy selection (e.g. `"US"`, `"GB"`, `"SG"`)
`max_depth`	`int`	`2`	(map agent only) Crawl depth from the start URL. `0` = start URL only
`max_pages`	`int`	`50`	(map agent only) Maximum number of pages to process
`limit`	`int`	`1000`	(map agent only) Maximum number of records to extract
`include_patterns`	`str`	`""`	(map agent only) `\|\|`-separated URL regex patterns to include when following links
`exclude_patterns`	`str`	`""`	(map agent only) `\|\|`-separated URL regex patterns to skip when following links

Agent	Best used for
`"general"`	Default; handles almost any page
`"listing"`	Product listings, job boards, search results
`"map"`	Crawling all sub-pages / sitemaps of a site

Returns: dict with keys status_code, data (scraper info including the scraper ID), and headers.

Rerun a scraper on a new URL

Reuse the extraction logic from a previously created AI scraper on any compatible URL.

result = await client.rerun_scraper(
    scraper_id=scraper_id,
    url="https://example.com/products?page=2",
)

Parameter	Type	Default	Description
`scraper_id`	`str`	(required)	ID of the scraper to rerun (from `create_scraper`)
`url`	`str`	(required)	Target URL — can be the original URL or a different page
`max_depth`	`int`	`2`	(map agent only) Crawl depth from the start URL
`max_pages`	`int`	`50`	(map agent only) Maximum number of pages to process
`limit`	`int`	`1000`	(map agent only) Maximum number of records to extract
`include_patterns`	`str`	`""`	(map agent only) `\|\|`-separated URL regex patterns to include
`exclude_patterns`	`str`	`""`	(map agent only) `\|\|`-separated URL regex patterns to skip

Returns: dict with keys status_code, data, and headers.

Bulk rerun on multiple URLs (AI scraper)

Rerun an existing AI scraper on multiple URLs in a single batch request. More efficient than calling rerun_scraper in a loop — all URLs are dispatched in parallel server-side.

result = await client.bulk_rerun_ai_scraper(
    scraper_id=scraper_id,
    urls=[
        "https://example.com/products/item1",
        "https://example.com/products/item2",
        "https://example.com/products/item3",
    ],
)

Parameter	Type	Default	Description
`scraper_id`	`str`	(required)	ID of the scraper to rerun (from `create_scraper`)
`urls`	`list[str]`	(required)	List of target URLs. Must contain at least one URL

Returns: dict with keys status_code, data, and headers.

Rerun a manually configured scraper

Rerun a scraper that was created manually through the MrScraper dashboard (with custom CSS selectors or XPath rules) on a new URL. Must not be an AI scraper. Find your scraper ID at https://app.mrscraper.com.

result = await client.rerun_manual_scraper(
    scraper_id="SCRAPER_ID",
    url="https://example.com/products/new-item",
)

Parameter	Type	Default	Description
`scraper_id`	`str`	(required)	ID of the manual scraper (found in the MrScraper dashboard)
`url`	`str`	(required)	Target URL — page structure should match the original scraper's target for selectors to work

Returns: dict with keys status_code, data, and headers.

Bulk rerun manual scraper on multiple URLs

Rerun a manually configured scraper on multiple URLs in a single batch. All URLs are processed in parallel server-side. Find your scraper ID at https://app.mrscraper.com.

result = await client.bulk_rerun_manual_scraper(
    scraper_id="SCRAPER_ID",
    urls=[
        "https://www.example.com/products/item1",
        "https://www.example.com/products/item2",
        "https://www.example.com/products/item3",
    ],
)

Parameter	Type	Default	Description
`scraper_id`	`str`	(required)	ID of the manual scraper (from the MrScraper dashboard). Must be a scraper created manually via the web interface, not an AI scraper
`urls`	`list[str]`	(required)	List of target URLs. Each URL is processed independently using the scraper's logic

Returns: dict with keys status_code, data (bulk job info including job ID and status), and headers.

Fetch Google SERP

Fetch Google search engine results pages (SERP) synchronously.

result = await client.fetch_google_serp(
    "https://www.google.com/search?q=iphone+17",
    raw=True,
)

Parameter	Type	Default	Description
`url`	`str`	(required)	Full Google search URL to scrape (e.g. `"https://www.google.com/search?q=iphone+17"`)
`raw`	`bool`	`True`	When `True`, return the raw SERP payload
`timeout`	`float`	`600.0`	Maximum seconds to wait for the request

Returns: dict with keys status_code, data, and headers.

Retrieve results

Fetch previously stored scraping results with pagination, sorting, and filtering.

# All results (paginated)
results = await client.get_all_results(
    sort_field="updatedAt",
    sort_order="DESC",
    page_size=20,
    page=1,
    search="product",
    date_range_column="updatedAt",
    start_at="2024-01-01",
    end_at="2024-01-31",
)

# A specific result by ID
result = await client.get_result_by_id("RESULT_ID")

`get_all_results` parameters

Parameter	Type	Default	Description
`sort_field`	`str`	`"updatedAt"`	Field to sort by. Options: `"createdAt"`, `"updatedAt"`, `"id"`, `"type"`, `"url"`, `"status"`, `"error"`, `"tokenUsage"`, `"runtime"`
`sort_order`	`"ASC"` \| `"DESC"`	`"DESC"`	Sort direction
`page_size`	`int`	`10`	Number of results per page
`page`	`int`	`1`	Page number (1-indexed)
`search`	`str \| None`	`None`	Free-text search query across result fields
`date_range_column`	`str \| None`	`None`	Column name to filter by date range (e.g. `"updatedAt"`, `"createdAt"`)
`start_at`	`str \| None`	`None`	ISO-8601 start date for the date range filter (e.g. `"2024-01-01"`)
`end_at`	`str \| None`	`None`	ISO-8601 end date for the date range filter (e.g. `"2024-01-31"`)

Returns: dict with keys status_code, data (paginated results and pagination metadata), and headers.

`get_result_by_id` parameters

Parameter	Type	Default	Description
`result_id`	`str`	(required)	Unique identifier of the result (returned by scraper execution methods and `get_all_results`)

Returns: dict with keys status_code, data (complete result object), and headers.

Exceptions

Exception	Raised when
`MrScraperError`	Base class for all SDK errors
`AuthenticationError`	API token is invalid or missing (HTTP 401)
`APIError`	API returned a non-2xx error; has `.status_code` attribute
`NetworkError`	Connection timeout or network-level failure

from mrscraper.exceptions import AuthenticationError, APIError, NetworkError

try:
    result = await client.fetch_html("https://example.com")
except AuthenticationError:
    print("Check your API token at https://app.mrscraper.com")
except APIError as e:
    print(f"API error {e.status_code}: {e}")
except NetworkError as e:
    print(f"Network problem: {e}")

Compliance & Legal Risk

WARNING Scraping login-protected pages carries serious legal and compliance risks. Many websites explicitly prohibit automated access in their Terms of Service, and bypassing authentication to scrape content may expose you to legal action including lawsuits, account termination, and financial penalties. By proceeding on scraping login-protected pages, you confirm that you have read and understood the target website's Terms of Service, and you fully accept all legal, financial, and ethical responsibility for your actions.

License

MIT © MrScraper

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.1

Jun 2, 2026

0.2.0

May 26, 2026

0.1.2

Mar 9, 2026

0.1.1

Mar 9, 2026

0.1.0

Mar 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mrscraper_sdk-0.2.1.tar.gz (11.0 kB view details)

Uploaded Jun 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mrscraper_sdk-0.2.1-py3-none-any.whl (11.5 kB view details)

Uploaded Jun 2, 2026 Python 3

File details

Details for the file mrscraper_sdk-0.2.1.tar.gz.

File metadata

Download URL: mrscraper_sdk-0.2.1.tar.gz
Upload date: Jun 2, 2026
Size: 11.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for mrscraper_sdk-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`34ae2e5eba684536773132484daea65d889a878967650d82ee43ae96f67595e6`
MD5	`40266177a557e126758512f4c3549906`
BLAKE2b-256	`66b28082b588e11e8769b69dbe92d674996d8a3415d25c0634a8806deaa4a322`

See more details on using hashes here.

File details

Details for the file mrscraper_sdk-0.2.1-py3-none-any.whl.

File metadata

Download URL: mrscraper_sdk-0.2.1-py3-none-any.whl
Upload date: Jun 2, 2026
Size: 11.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for mrscraper_sdk-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9d740d6516f4ca7efaad0a5540061a8c00b5fbe2d44566f486084cfc622be8c3`
MD5	`77a86ff258fb9cbec707e291581ad841`
BLAKE2b-256	`8a2f6a67ad17bcd220b9e7ed2666f5f2cc5ffe1517be5cdd491ebb519b01395c`

See more details on using hashes here.

mrscraper-sdk 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MrScraper Python SDK

Installation

Authentication

Quick Start

Fetch raw HTML (stealth browser)

Create an AI scraper

Rerun a scraper on a new URL

Bulk rerun on multiple URLs (AI scraper)

Rerun a manually configured scraper

Bulk rerun manual scraper on multiple URLs

Fetch Google SERP

Retrieve results

get_all_results parameters

get_result_by_id parameters

Exceptions

Compliance & Legal Risk

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`get_all_results` parameters

`get_result_by_id` parameters