Official Python SDK for ScrapeBadger - Async web scraping APIs for Twitter and more

These details have not been verified by PyPI

Project links

Project description

ScrapeBadger

ScrapeBadger Python SDK

The official Python SDK for ScrapeBadger - async web scraping APIs for Twitter, Google, Vinted, Reddit, and more.

Features

Async-first - Built with asyncio for high-performance concurrent scraping
Type-safe - Full type hints and Pydantic models for all responses
Automatic pagination - Iterator methods with smart rate limit handling
Resilient retries - Exponential backoff on transient errors
37+ Twitter endpoints - Tweets, users, lists, communities, trends, geo, real-time streams
19 Google product APIs - Search (with optional deferred AI Overview follow-up), Maps, News, Hotels, Trends (incl. topic autocomplete), Jobs, Shopping (+ merchant URL enrichment), Patents, Scholar (search + profiles + author + author citation + cite formats), Images, Videos, Finance, AI Mode, Lens, Local Pack, Shorts, Flights, Products
Vinted scraping - Search items, item details, user profiles, brands, colors, markets
Reddit scraping - Search posts/subreddits/users/domains, subreddit posts, post comments, user profiles, trophies, wiki pages, moderators
Web scraping - Anti-bot bypass, JS rendering, and AI data extraction

Installation

pip install scrapebadger

Or with uv:

uv add scrapebadger

Quick Start

import asyncio
from scrapebadger import ScrapeBadger

async def main():
    async with ScrapeBadger(api_key="your-api-key") as client:
        # Get a user profile
        user = await client.twitter.users.get_by_username("elonmusk")
        print(f"{user.name} has {user.followers_count:,} followers")

        # Scrape a website
        result = await client.web.scrape("https://scrapebadger.com", format="markdown")
        print(result.content)

        # Search tweets
        tweets = await client.twitter.tweets.search("python programming")
        for tweet in tweets.data:
            print(f"@{tweet.username}: {tweet.text[:100]}...")

asyncio.run(main())

Authentication

Get your API key from scrapebadger.com and pass it to the client:

from scrapebadger import ScrapeBadger

client = ScrapeBadger(api_key="sb_live_xxxxxxxxxxxxx")

You can also set the SCRAPEBADGER_API_KEY environment variable:

export SCRAPEBADGER_API_KEY="sb_live_xxxxxxxxxxxxx"

Available APIs

API	Description	Documentation
Web Scraping	Scrape any website with JS rendering, anti-bot bypass, and AI extraction	Web Scraping Guide
Twitter	37+ endpoints for tweets, users, lists, communities, trends, and real-time streams	Twitter Guide
Google	19 products — Search, Maps, News, Hotels, Trends, Jobs, Shopping, Patents, Scholar, Images, Videos, Finance, AI Mode, Lens, Autocomplete, Local, Shorts, Flights, Products	Google Guide
Vinted	Search items, item details, user profiles, brands, colors, statuses, and markets	Vinted Guide
Reddit	Search posts, subreddits, users, and domains; fetch post comments, subreddit rules, moderators, wiki pages, user trophies	Reddit Guide
Amazon	14 endpoints — search, autocomplete, product detail, offers, reviews, bestsellers, new releases, deals, category browse, seller profile/products/feedback, markets, categories	Amazon Guide
TikTok	25 endpoints — user profile/videos/followers/following/liked/reposts, video detail/comments/replies/related/transcript/oEmbed, search (general/videos/hashtags/users), music detail/videos, hashtag detail/videos, trending videos/hashtags/songs, ad library, regions	TikTok Guide

Error Handling

from scrapebadger import (
    ScrapeBadger,
    ScrapeBadgerError,
    AuthenticationError,
    RateLimitError,
    InsufficientCreditsError,
    NotFoundError,
    ValidationError,
    ServerError,
)

async with ScrapeBadger(api_key="your-key") as client:
    try:
        user = await client.twitter.users.get_by_username("elonmusk")
    except AuthenticationError:
        print("Invalid API key")
    except RateLimitError as e:
        print(f"Rate limited. Retry after {e.retry_after} seconds")
        print(f"Limit: {e.limit}, Remaining: {e.remaining}")
    except InsufficientCreditsError:
        print("Out of credits! Purchase more at scrapebadger.com")
    except NotFoundError:
        print("User not found")
    except ValidationError as e:
        print(f"Invalid parameters: {e}")
    except ServerError:
        print("Server error, try again later")
    except ScrapeBadgerError as e:
        print(f"API error: {e}")

Configuration

Custom Timeout and Retries

from scrapebadger import ScrapeBadger

client = ScrapeBadger(
    api_key="your-key",
    timeout=120.0,      # Request timeout in seconds (default: 300)
    max_retries=5,      # Retry attempts (default: 10)
)

Advanced Configuration

from scrapebadger import ScrapeBadger
from scrapebadger._internal import ClientConfig

config = ClientConfig(
    api_key="your-key",
    base_url="https://scrapebadger.com",
    timeout=300.0,
    connect_timeout=10.0,
    max_retries=10,
    retry_on_status=(502, 503, 504),
    headers={"X-Custom-Header": "value"},
)

client = ScrapeBadger(config=config)

Retry Behavior

The SDK automatically retries requests that fail with 502, 503, or 504 status codes using exponential backoff (1s, 2s, 4s, 8s, ...). Each retry logs a warning:

⚠ 503 Service Unavailable — retrying in 4s (attempt 3/10)

To see these warnings, configure Python logging:

import logging
logging.basicConfig(level=logging.WARNING)

Rate Limit Aware Pagination

When using *_all pagination methods, the SDK reads X-RateLimit-Remaining and X-RateLimit-Reset headers from each response. When remaining requests drop below 20% of your tier's limit, pagination automatically slows down to spread requests across the remaining window — preventing 429 errors. A warning is logged when throttling activates:

⚠ Rate limit: 25/300 remaining (resets in 42s), throttling pagination to ~0.6 req/s

This works transparently with all tier levels (Free: 60/min, Basic: 300/min, Pro: 1000/min, Enterprise: 5000/min).

Development

Setup

# Clone the repository
git clone https://github.com/scrape-badger/scrapebadger-python.git
cd scrapebadger-python

# Install dependencies with uv
uv sync --dev

# Install pre-commit hooks
uv run pre-commit install

Running Tests

# Run all tests
uv run pytest

# Run with coverage
uv run pytest --cov=src/scrapebadger --cov-report=html

# Run specific tests
uv run pytest tests/test_client.py -v

Code Quality

# Lint
uv run ruff check src/ tests/

# Format
uv run ruff format src/ tests/

# Type check
uv run mypy src/

# All checks
uv run ruff check src/ tests/ && uv run ruff format --check src/ tests/ && uv run mypy src/

Contributing

Contributions are welcome! Please read our Contributing Guide for details.

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes
Run tests and linting (uv run pytest && uv run ruff check)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

Documentation: docs.scrapebadger.com
Issues: GitHub Issues
Email: support@scrapebadger.com
Discord: Join our community

Made with ❤️ by ScrapeBadger

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.15.5

Jul 2, 2026

0.15.3

Jun 29, 2026

0.15.2

Jun 23, 2026

0.15.1

Jun 21, 2026

0.15.0

Jun 21, 2026

This version

0.14.0

Jun 13, 2026

0.13.1

Jun 13, 2026

0.12.0

Jun 10, 2026

0.11.0

Jun 3, 2026

0.10.0

May 30, 2026

0.9.0

May 29, 2026

0.8.3

May 28, 2026

0.8.2

May 28, 2026

0.8.1

May 28, 2026

0.8.0

May 28, 2026

0.7.0

Apr 20, 2026

0.5.2

Mar 31, 2026

0.5.1

Mar 31, 2026

0.5.0

Mar 30, 2026

0.4.0

Mar 15, 2026

0.3.1

Mar 10, 2026

0.3.0

Mar 7, 2026

0.2.0

Mar 4, 2026

0.1.10

Feb 20, 2026

0.1.9

Feb 20, 2026

0.1.8

Jan 31, 2026

0.1.7

Dec 28, 2025

0.1.6

Dec 28, 2025

0.1.5

Dec 28, 2025

0.1.4

Dec 28, 2025

0.1.3

Dec 28, 2025

0.1.2

Dec 28, 2025

0.1.1

Dec 27, 2025

0.1.0

Dec 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapebadger-0.14.0.tar.gz (93.7 kB view details)

Uploaded Jun 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scrapebadger-0.14.0-py3-none-any.whl (140.8 kB view details)

Uploaded Jun 13, 2026 Python 3

File details

Details for the file scrapebadger-0.14.0.tar.gz.

File metadata

Download URL: scrapebadger-0.14.0.tar.gz
Upload date: Jun 13, 2026
Size: 93.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for scrapebadger-0.14.0.tar.gz
Algorithm	Hash digest
SHA256	`da2885d75267374234ef775af7667f17f6eeee6eba4fd52760a7d3b4bb979032`
MD5	`ae5147ebc826340f7a21518216cd5c17`
BLAKE2b-256	`0bd9a14f4fafb3b7d45c5176e4f95fc5f4a9ec09df13ca62c49a3faf8115848d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for scrapebadger-0.14.0.tar.gz:

Publisher: publish.yml on scrape-badger/scrapebadger-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: scrapebadger-0.14.0.tar.gz
- Subject digest: da2885d75267374234ef775af7667f17f6eeee6eba4fd52760a7d3b4bb979032
- Sigstore transparency entry: 1809432443
- Sigstore integration time: Jun 13, 2026
Source repository:
- Permalink: scrape-badger/scrapebadger-python@667cb384f856cbef51767a119da1c61594de85fb
- Branch / Tag: refs/tags/v0.14.0
- Owner: https://github.com/scrape-badger
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@667cb384f856cbef51767a119da1c61594de85fb
- Trigger Event: release

File details

Details for the file scrapebadger-0.14.0-py3-none-any.whl.

File metadata

Download URL: scrapebadger-0.14.0-py3-none-any.whl
Upload date: Jun 13, 2026
Size: 140.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for scrapebadger-0.14.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5fbf6686f32d55e73513f7bbad8de35ec7865b46bd5843801ffc07c504310710`
MD5	`cb595f64b46a8f7ca45ee20d17dc98a9`
BLAKE2b-256	`0ea7fc7bdd3404e690f16fe217ab066a22b4d26ac80b268c0bad47ce99950b57`

See more details on using hashes here.

Provenance

The following attestation bundles were made for scrapebadger-0.14.0-py3-none-any.whl:

Publisher: publish.yml on scrape-badger/scrapebadger-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: scrapebadger-0.14.0-py3-none-any.whl
- Subject digest: 5fbf6686f32d55e73513f7bbad8de35ec7865b46bd5843801ffc07c504310710
- Sigstore transparency entry: 1809432483
- Sigstore integration time: Jun 13, 2026
Source repository:
- Permalink: scrape-badger/scrapebadger-python@667cb384f856cbef51767a119da1c61594de85fb
- Branch / Tag: refs/tags/v0.14.0
- Owner: https://github.com/scrape-badger
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@667cb384f856cbef51767a119da1c61594de85fb
- Trigger Event: release

scrapebadger 0.14.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ScrapeBadger Python SDK

Features

Installation

Quick Start

Authentication

Available APIs

Error Handling

Configuration

Custom Timeout and Retries

Advanced Configuration

Retry Behavior

Rate Limit Aware Pagination

Development

Setup

Running Tests

Code Quality

Contributing

License

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance