Skip to main content

Stealth-oriented utilities for web scraping with rotating user agents, proxies, and Selenium helpers.

Project description

Orb Weaver Tools

Utilities for stealth-oriented scraping workflows: Selenium driver setup, rotating headers, proxy support, and browser interaction helpers.

Features

  • OrbDriver Selenium bootstrap with optional PIA VPN support
  • Spoofed requests with rotating headers and optional proxies
  • Proxy harvesting and validation helpers
  • Human-like browser interaction utilities
  • CLI for common actions (orb version, orb user-agent, orb spoof-request)
  • Retry/backoff for network calls

Installation

Base package:

pip install orbweaver-tools

With Selenium support:

pip install "orbweaver-tools[selenium]"

With scraping table/proxy parsing support:

pip install "orbweaver-tools[scraping]"

Everything:

pip install "orbweaver-tools[all]"

Quick Start

from orb.config import OrbConfig
from orb.scraper.utils import spoof_request

config = OrbConfig.from_env()
response = spoof_request("https://example.com", config=config)
print(response.status_code)

CLI

orb version
orb user-agent
orb spoof-request https://example.com --no-proxy
orb proxy-test http://1.2.3.4:8080 https://1.2.3.4:8080

Environment Variables

  • ORB_REQUEST_TIMEOUT (default: 15)
  • ORB_MAX_RETRIES (default: 3)
  • ORB_BACKOFF_SECONDS (default: 0.5)
  • ORB_USE_PROXIES (true/false, default: true)
  • ORB_USE_USER_AGENT (true/false, default: true)

Development

poetry install --with dev --all-extras
poetry run pytest
poetry run ruff check .
poetry run mypy
poetry run bandit -q -r orb -x orb/common/vpn,orb/common/design,orb/spinner -s B311,B404,B603,B110
poetry run pip-audit

Release

  1. Bump version in pyproject.toml.
  2. Update CHANGELOG.md.
  3. Build and publish:
poetry build
poetry publish --build

Changelog

See CHANGELOG.md.

Documentation

Full documentation is available at https://tom3man.github.io/orb-weaver/.

You can build it locally with:

poetry install --with docs
poetry run mkdocs serve

Licence

MIT. See LICENSE.

Responsible Use

Only run scraping/automation against systems where you are authorized to do so.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orbweaver_tools-1.3.1.tar.gz (15.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orbweaver_tools-1.3.1-py3-none-any.whl (20.0 kB view details)

Uploaded Python 3

File details

Details for the file orbweaver_tools-1.3.1.tar.gz.

File metadata

  • Download URL: orbweaver_tools-1.3.1.tar.gz
  • Upload date:
  • Size: 15.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.1.tar.gz
Algorithm Hash digest
SHA256 fb061852bd977e50203f661663f79e195c46686d47174fb494999e56d5cda534
MD5 99b97a8be4f88df964c8925be696b373
BLAKE2b-256 a6d14ef01e4f495e7f10f2762f4fc56b18f1142375f8bb45fc1b9ddc44085063

See more details on using hashes here.

File details

Details for the file orbweaver_tools-1.3.1-py3-none-any.whl.

File metadata

  • Download URL: orbweaver_tools-1.3.1-py3-none-any.whl
  • Upload date:
  • Size: 20.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7c1de50f4c6f5a73b66000c76a389c184fdffd1e90b94070b5e6f515ffde2627
MD5 0957eb97638b75b0e5b15e50f369c102
BLAKE2b-256 86ae1c1c5a48d4658a5242ac7f3ea18e08f8231decc050822ee73dc5fac301fe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page