Skip to main content

Stealth-oriented utilities for web scraping with rotating user agents, proxies, and Selenium helpers.

Project description

Orb Weaver Tools

Utilities for stealth-oriented scraping workflows: Selenium driver setup, rotating headers, proxy support, and browser interaction helpers.

Features

  • OrbDriver Selenium bootstrap with optional PIA VPN support
  • Spoofed requests with rotating headers and optional proxies
  • Proxy harvesting and validation helpers
  • Human-like browser interaction utilities
  • CLI for common actions (orb version, orb user-agent, orb spoof-request)
  • Retry/backoff for network calls

Installation

Base package:

pip install orbweaver-tools

With Selenium support:

pip install "orbweaver-tools[selenium]"

With scraping table/proxy parsing support:

pip install "orbweaver-tools[scraping]"

Everything:

pip install "orbweaver-tools[all]"

Quick Start

from orb.config import OrbConfig
from orb.scraper.utils import spoof_request

config = OrbConfig.from_env()
response = spoof_request("https://example.com", config=config)
print(response.status_code)

CLI

orb version
orb user-agent
orb spoof-request https://example.com --no-proxy
orb proxy-test http://1.2.3.4:8080 https://1.2.3.4:8080

Environment Variables

  • ORB_REQUEST_TIMEOUT (default: 15)
  • ORB_MAX_RETRIES (default: 3)
  • ORB_BACKOFF_SECONDS (default: 0.5)
  • ORB_USE_PROXIES (true/false, default: true)
  • ORB_USE_USER_AGENT (true/false, default: true)

Development

poetry install --with dev --all-extras
poetry run pytest
poetry run ruff check .
poetry run mypy
poetry run bandit -q -r orb -x orb/common/vpn,orb/common/design,orb/spinner -s B311,B404,B603,B110
poetry run pip-audit

Release

  1. Bump version in pyproject.toml.
  2. Update CHANGELOG.md.
  3. Build and publish:
poetry build
poetry publish --build

Changelog

See CHANGELOG.md.

Documentation

Full documentation is available at https://tom3man.github.io/orb-weaver/.

You can build it locally with:

poetry install --with docs
poetry run mkdocs serve

Licence

MIT. See LICENSE.

Responsible Use

Only run scraping/automation against systems where you are authorized to do so.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orbweaver_tools-1.3.3.tar.gz (15.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orbweaver_tools-1.3.3-py3-none-any.whl (20.0 kB view details)

Uploaded Python 3

File details

Details for the file orbweaver_tools-1.3.3.tar.gz.

File metadata

  • Download URL: orbweaver_tools-1.3.3.tar.gz
  • Upload date:
  • Size: 15.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.3.tar.gz
Algorithm Hash digest
SHA256 397737ab52d9e07461d2ed249b37271c0ba8ce117105d5c919d9a4738f639d11
MD5 027d030dee8cb4a749dbe877ae174d1d
BLAKE2b-256 f5ba6c819a9a2eb65967875fa42a673b3b5697d42bc86612e140e9370252998a

See more details on using hashes here.

File details

Details for the file orbweaver_tools-1.3.3-py3-none-any.whl.

File metadata

  • Download URL: orbweaver_tools-1.3.3-py3-none-any.whl
  • Upload date:
  • Size: 20.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 71d4c441e84876b20115f7c94a80b848fae17ef15a855a08232a555747724a48
MD5 38c7da3f642113e62ce257928c72d7fc
BLAKE2b-256 1ca078b8d489d33c40b35d8ccd23e85927531818c93b4e2b8279dc9851f888c5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page