Lightweight utilities for web scraping with requests and Selenium.

These details have not been verified by PyPI

Project description

ScraperETC

ScraperETC is a lightweight Python package that streamlines browser automation and HTTP scraping. It wraps Selenium and requests with clean, Pythonic interfaces that remove the usual boilerplate - especially for waits, drivers, and headers. ScraperETC is designed with anti-bot detection in mind, using smart defaults to reduce the chance of blocks or bans.

Why Use ScraperETC?

Selenium imports are long, clunky, and almost impossible to remember. This package wraps what you need so you don't have to memorize boilerplate.
HTTP requests are often blocked by anti-bot filters. ScraperETC provides default headers that reduce detection without extra effort.
Verifying file downloads shouldn't require writing custom content checks. This package includes built-in PDF validation tools to save you time.

ScraperETC was built to reduce the friction of browser automation and HTTP scraping, especially when using headless Chrome.

Features

Minimal wrappers for selenium.webdriver.Chrome and undetected_chromedriver to get up and running fast
webdriver_wait() handles selector validation and WebDriverWait behind the scenes
http_GET() adds default headers that mimic a modern browser to help you evade bot detection
Built-in tools for validating PDF downloads and checking response status
Optional exception-raising on failure to let you choose between passive and strict workflows
Currently supports only the Chrome web browser, which must be installed and available on your system PATH

Installation

pip install scraper-etc

Requires Python 3.10 or later.

Example Usage

from scraper_etc import setup_chrome_driver, webdriver_wait, http_GET_valid_pdf

# start a headless Chrome driver (using undetected_chromedriver under the hood)
driver = setup_chrome_driver(headless=True)

# wait for a div with a specific ID to appear
elem = webdriver_wait(driver, by="XPATH", selector="//div[@id='main']")

# validate a remote PDF and save it
res = http_GET_valid_pdf("https://example.com/sample.pdf")
if res:
    with open("sample.pdf", "wb") as f:
        f.write(res.content)

Development

ScraperETC includes a modern CI/CD pipeline:

Ruff for linting and auto-formatting
mypy for static type checking
Bandit for security scanning
pytest with unit tests covering all core logic
Codecov integration for test coverage
GitHub Actions CI to run it all on push
Dependabot for automated dependency updates

CI workflows live in .github/workflows.

License

This project is released under CC0 (public domain). You are free to use, modify, and redistribute it without restriction.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.5

Jul 22, 2025

This version

0.1.4

Jul 14, 2025

0.1.3

Jul 14, 2025

0.1.2

Jul 14, 2025

0.1.1

Jul 14, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scraper_etc-0.1.4.tar.gz (10.2 kB view details)

Uploaded Jul 14, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scraper_etc-0.1.4-py3-none-any.whl (8.6 kB view details)

Uploaded Jul 14, 2025 Python 3

File details

Details for the file scraper_etc-0.1.4.tar.gz.

File metadata

Download URL: scraper_etc-0.1.4.tar.gz
Upload date: Jul 14, 2025
Size: 10.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for scraper_etc-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`19777fbd6169b5679d8946d1afcfff0330f9ca9e313289754c1a765ce4431604`
MD5	`4c61d97212d4ac5e08cc06d3d2177330`
BLAKE2b-256	`9ce1791ba19b5fbe1a08f3d00f36e0583ae4ab6591fdbf21e246e2007c6c93e0`

See more details on using hashes here.

File details

Details for the file scraper_etc-0.1.4-py3-none-any.whl.

File metadata

Download URL: scraper_etc-0.1.4-py3-none-any.whl
Upload date: Jul 14, 2025
Size: 8.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.13

File hashes

Hashes for scraper_etc-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4086378ac8f6ae9a06890540b11cb248a8cf9b9d19364216ded7cd0e21395c96`
MD5	`0396c7d4b70d0f5620255c0c45bba98e`
BLAKE2b-256	`b71c9b39b9f2e9ef21fcca6d3a125e85fc8e33540d456e2fbd233a124ef14563`

See more details on using hashes here.

scraper-etc 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

ScraperETC

Why Use ScraperETC?

Features

Installation

Example Usage

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes