Python SDK for the MrScraper web-scraping API

These details have not been verified by PyPI

Project links

Project description

MrScraper Python SDK

A Simple Python SDK for the MrScraper web-scraping API. Supports async / await usage.

Installation

pip install mrscraper-sdk

Requires Python 3.9+.

Authentication

Every client is initialised with your MrScraper API token. Get yours at https://app.mrscraper.com.

from mrscraper import MrScraper

client = MrScraper(token="MRSCRAPER_API_TOKEN")

Quick Start

Fetch raw HTML (stealth browser)

import asyncio
from mrscraper import MrScraper

async def main():
    client = MrScraper(token="MRSCRAPER_API_TOKEN")

    result = await client.fetch_html(
        "https://stockx.com/air-jordan-1-retro-low-og-chicago-2025",
        geo_code="US",
        timeout=120,
        block_resources=False,
    )
    print(result["data"])   # raw HTML string

asyncio.run(main())

Create an AI scraper

result = await client.create_scraper(
    url="https://example.com/products",
    message="Extract all product names, prices, and ratings",
    agent="listing",          # "general" | "listing" | "map"
    proxy_country="US",
)
scraper_id = result["data"]["data"]["id"]
print("Scraper ID:", scraper_id)

Rerun a scraper on a new URL

result = await client.rerun_scraper(
    scraper_id=scraper_id,
    url="https://example.com/products?page=2",
)

Bulk rerun on multiple URLs (AI scraper)

result = await client.bulk_rerun_ai_scraper(
    scraper_id=scraper_id,
    urls=[
        "https://example.com/products/item1",
        "https://example.com/products/item2",
        "https://example.com/products/item3",
    ],
)

Rerun a manually configured scraper

result = await client.rerun_manual_scraper(
    scraper_id="manual_scraper_67890",
    url="https://example.com/products/new-item",
)

Bulk rerun manual scraper on multiple URLs

result = await client.bulk_rerun_manual_scraper(
    scraper_id="scraper_12345",
    urls=[
        "https://www.example.com/products/item1",
        "https://www.example.com/products/item2",
        "https://www.example.com/products/item3",
    ],
)

Fetch Google SERP

result = await client.fetch_google_serp(
    "https://www.google.com/search?q=iphone+17",
    raw=True,
)
print(result["data"])

Retrieve results

# All results (paginated)
page = await client.get_all_results(
    sort_field="updatedAt",
    sort_order="DESC",
    page_size=20,
    page=1,
    search="product",
    date_range_column="updatedAt",
    start_at="2024-01-01",
    end_at="2024-01-31",
)
print(page["data"])

# A specific result by ID
result = await client.get_result_by_id("result_12345")
print(result["data"])

API Reference

`MrScraper`

All methods are coroutines and must be awaited.

Method	Description
`fetch_html(url, *, timeout, geo_code, block_resources)`	Fetch rendered HTML via the MrScraper stealth browser
`fetch_google_serp(url, *, raw, timeout)`	Fetch Google search results (SERP) synchronously
`create_scraper(url, message, *, agent, proxy_country, ...)`	Create & run an AI-powered scraper
`rerun_scraper(scraper_id, url, *, max_depth, max_pages, limit, ...)`	Rerun an AI scraper on a new URL
`bulk_rerun_ai_scraper(scraper_id, urls)`	Rerun an AI scraper on multiple URLs in one batch
`rerun_manual_scraper(scraper_id, url)`	Rerun a manually configured scraper on a single URL
`bulk_rerun_manual_scraper(scraper_id, urls)`	Rerun a manual scraper on multiple URLs in one batch
`get_all_results(*, sort_field, sort_order, page_size, page, search, ...)`	List all results with filtering & pagination
`get_result_by_id(result_id)`	Fetch a single result by its ID

All methods return a dict with the following keys:

Key	Type	Description
`status_code`	`int`	HTTP status code
`data`	`Any`	Parsed JSON body or raw text
`headers`	`dict`	Response headers

`bulk_rerun_manual_scraper`

Reruns a manually configured scraper on multiple URLs simultaneously in a single batch operation. This is more efficient than calling rerun_manual_scraper multiple times, as it processes all URLs in parallel and returns consolidated results. Ideal for scraping multiple pages, products, or articles with the same extraction logic.

Argument	Description
`scraper_id`	The ID of the manual scraper to rerun (obtained from the MrScraper dashboard). Must be a scraper created manually through the web interface, not an AI scraper. Find it at https://app.mrscraper.com
`urls`	A list of target URLs to scrape (required, must contain at least one URL). Each URL will be processed independently using the scraper's extraction logic. Example: `["https://example.com/page1", "https://example.com/page2"]`

Returns: A dict with status_code, data (bulk job info including job ID, status, metadata; use get_all_results or get_result_by_id to fetch per-URL results), and headers.

Example:

result = await client.bulk_rerun_manual_scraper(
    scraper_id="scraper_12345",
    urls=[
        "https://www.example.com/products/item1",
        "https://www.example.com/products/item2",
        "https://www.example.com/products/item3",
    ],
)

`create_scraper` — agent types

Agent	Best used for
`"general"`	Default; handles almost any page
`"listing"`	Product listings, job boards, search results
`"map"`	Crawling all sub-pages / sitemaps of a site

The max_depth, max_pages, limit, include_patterns, and exclude_patterns parameters are only meaningful when agent="map".

Exceptions

Exception	Raised when
`MrScraperError`	Base class for all SDK errors
`AuthenticationError`	API token is invalid or missing (HTTP 401)
`APIError`	API returned a non-2xx error; has `.status_code` attribute
`NetworkError`	Connection timeout or network-level failure

from mrscraper.exceptions import AuthenticationError, APIError, NetworkError

try:
    result = await client.fetch_html("https://example.com")
except AuthenticationError:
    print("Check your API token at https://app.mrscraper.com")
except APIError as e:
    print(f"API error {e.status_code}: {e}")
except NetworkError as e:
    print(f"Network problem: {e}")

Compliance & Legal Risk

WARNING Scraping login-protected pages carries serious legal and compliance risks. Many websites explicitly prohibit automated access in their Terms of Service, and bypassing authentication to scrape content may expose you to legal action including lawsuits, account termination, and financial penalties. By proceeding on scraping login-protected pages, you confirm that you have read and understood the target website's Terms of Service, and you fully accept all legal, financial, and ethical responsibility for your actions.

License

MIT © MrScraper

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

May 26, 2026

0.1.2

Mar 9, 2026

0.1.1

Mar 9, 2026

0.1.0

Mar 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mrscraper_sdk-0.2.0.tar.gz (66.4 kB view details)

Uploaded May 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mrscraper_sdk-0.2.0-py3-none-any.whl (69.5 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file mrscraper_sdk-0.2.0.tar.gz.

File metadata

Download URL: mrscraper_sdk-0.2.0.tar.gz
Upload date: May 26, 2026
Size: 66.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for mrscraper_sdk-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`d1fd0c56816d56f9301bca5840f9a6365dd39329018d270c04308998bd63833a`
MD5	`7f5f8c66999a95aa23639c11ca541cee`
BLAKE2b-256	`43abf26e1cefa9de72401a9e05e14442dab7ccc2ddb70995a4be2eb5390cf4f8`

See more details on using hashes here.

File details

Details for the file mrscraper_sdk-0.2.0-py3-none-any.whl.

File metadata

Download URL: mrscraper_sdk-0.2.0-py3-none-any.whl
Upload date: May 26, 2026
Size: 69.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for mrscraper_sdk-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e6eb617a8ba191315a16a9580f4aa334fde0dc827cfcfcac52c70433cb6d95d2`
MD5	`030fa3a7ed22a4966216f474363ee335`
BLAKE2b-256	`8959966dc7eb16b878437c0d6db21d47627c8949a85a8b53b4dbaf017655739f`

See more details on using hashes here.

mrscraper-sdk 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MrScraper Python SDK

Installation

Authentication

Quick Start

Fetch raw HTML (stealth browser)

Create an AI scraper

Rerun a scraper on a new URL

Bulk rerun on multiple URLs (AI scraper)

Rerun a manually configured scraper

Bulk rerun manual scraper on multiple URLs

Fetch Google SERP

Retrieve results

API Reference

MrScraper

bulk_rerun_manual_scraper

create_scraper — agent types

Exceptions

Compliance & Legal Risk

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`MrScraper`

`bulk_rerun_manual_scraper`

`create_scraper` — agent types