Skip to main content

Stealth-oriented utilities for web scraping with rotating user agents, proxies, and Selenium helpers.

Project description

Orb Weaver Tools

Utilities for stealth-oriented scraping workflows: Selenium driver setup, rotating headers, proxy support, and browser interaction helpers.

Features

  • OrbDriver Selenium bootstrap with optional PIA VPN support
  • Spoofed requests with rotating headers and optional proxies
  • Proxy harvesting and validation helpers
  • Human-like browser interaction utilities
  • CLI for common actions (orb version, orb user-agent, orb spoof-request)
  • Retry/backoff for network calls

Installation

Base package:

pip install orbweaver-tools

With Selenium support:

pip install "orbweaver-tools[selenium]"

With scraping table/proxy parsing support:

pip install "orbweaver-tools[scraping]"

Everything:

pip install "orbweaver-tools[all]"

Quick Start

from orb.config import OrbConfig
from orb.scraper.utils import spoof_request

config = OrbConfig.from_env()
response = spoof_request("https://example.com", config=config)
print(response.status_code)

CLI

orb version
orb user-agent
orb spoof-request https://example.com --no-proxy
orb proxy-test http://1.2.3.4:8080 https://1.2.3.4:8080

Environment Variables

  • ORB_REQUEST_TIMEOUT (default: 15)
  • ORB_MAX_RETRIES (default: 3)
  • ORB_BACKOFF_SECONDS (default: 0.5)
  • ORB_USE_PROXIES (true/false, default: true)
  • ORB_USE_USER_AGENT (true/false, default: true)

Development

poetry install --with dev --all-extras
poetry run pytest
poetry run ruff check .
poetry run mypy
poetry run bandit -q -r orb -x orb/common/vpn,orb/common/design,orb/spinner -s B311,B404,B603,B110
poetry run pip-audit

Release

  1. Bump version in pyproject.toml.
  2. Update CHANGELOG.md.
  3. Build and publish:
poetry build
poetry publish --build

Changelog

See CHANGELOG.md.

Documentation

Full documentation is available at https://tom3man.github.io/orb-weaver/.

You can build it locally with:

poetry install --with docs
poetry run mkdocs serve

Licence

MIT. See LICENSE.

Responsible Use

Only run scraping/automation against systems where you are authorized to do so.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orbweaver_tools-1.3.2.tar.gz (15.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orbweaver_tools-1.3.2-py3-none-any.whl (20.0 kB view details)

Uploaded Python 3

File details

Details for the file orbweaver_tools-1.3.2.tar.gz.

File metadata

  • Download URL: orbweaver_tools-1.3.2.tar.gz
  • Upload date:
  • Size: 15.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.2.tar.gz
Algorithm Hash digest
SHA256 76b3b2908f748ad25240012439bd7059d3b43b06f23ba5a45a5e306f87752331
MD5 aaa8432466cfaca3f2b7d300b960709d
BLAKE2b-256 a09e8b504f3b7b14162f36436439c4614a0e50d686dd721526af271e18409f60

See more details on using hashes here.

File details

Details for the file orbweaver_tools-1.3.2-py3-none-any.whl.

File metadata

  • Download URL: orbweaver_tools-1.3.2-py3-none-any.whl
  • Upload date:
  • Size: 20.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 1ba4d53a1a274a4369bdd701426c77e3dbbf44989a6046db4dbc571ab4514d5f
MD5 33a6c5be8f4c51e5e3b7842dc146d6f3
BLAKE2b-256 81aea2ff0071dd8f165c9791a9aed5398d6b3e1ba3234283513a8dfa2972c0de

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page