Skip to main content

Stealth-oriented utilities for web scraping with rotating user agents, proxies, and Selenium helpers.

Project description

Orb Weaver Tools

Utilities for stealth-oriented scraping workflows: Selenium driver setup, rotating headers, proxy support, and browser interaction helpers.

Features

  • OrbDriver Selenium bootstrap with optional PIA VPN support
  • Spoofed requests with rotating headers and optional proxies
  • Proxy harvesting and validation helpers
  • Human-like browser interaction utilities
  • CLI for common actions (orb version, orb user-agent, orb spoof-request)
  • Retry/backoff for network calls

Installation

Base package:

pip install orbweaver-tools

With Selenium support:

pip install "orbweaver-tools[selenium]"

With scraping table/proxy parsing support:

pip install "orbweaver-tools[scraping]"

Everything:

pip install "orbweaver-tools[all]"

Quick Start

from orb.config import OrbConfig
from orb.scraper.utils import spoof_request

config = OrbConfig.from_env()
response = spoof_request("https://example.com", config=config)
print(response.status_code)

CLI

orb version
orb user-agent
orb spoof-request https://example.com --no-proxy
orb proxy-test http://1.2.3.4:8080 https://1.2.3.4:8080

Environment Variables

  • ORB_REQUEST_TIMEOUT (default: 15)
  • ORB_MAX_RETRIES (default: 3)
  • ORB_BACKOFF_SECONDS (default: 0.5)
  • ORB_USE_PROXIES (true/false, default: true)
  • ORB_USE_USER_AGENT (true/false, default: true)

Development

poetry install --with dev --all-extras
poetry run pytest
poetry run ruff check .
poetry run mypy
poetry run bandit -q -r orb -x orb/common/vpn,orb/common/design,orb/spinner -s B311,B404,B603,B110
poetry run pip-audit

Release

  1. Bump version in pyproject.toml.
  2. Update CHANGELOG.md.
  3. Build and publish:
poetry build
poetry publish --build

Changelog

See CHANGELOG.md.

Documentation

Full documentation is available at https://tom3man.github.io/orb-weaver/.

You can build it locally with:

poetry install --with docs
poetry run mkdocs serve

Licence

MIT. See LICENSE.

Responsible Use

Only run scraping/automation against systems where you are authorized to do so.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orbweaver_tools-1.3.4.tar.gz (15.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orbweaver_tools-1.3.4-py3-none-any.whl (20.0 kB view details)

Uploaded Python 3

File details

Details for the file orbweaver_tools-1.3.4.tar.gz.

File metadata

  • Download URL: orbweaver_tools-1.3.4.tar.gz
  • Upload date:
  • Size: 15.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.4.tar.gz
Algorithm Hash digest
SHA256 acd5bfad75d537a8c9d4cec03b0d1bc1d75d0e8f554a47f1d6269794b0b37c4e
MD5 9d05862fd47f38534f0590e79ec71c8a
BLAKE2b-256 b2ed6c8bb3436bc182a648d60db4c8ff237ac918dc7f21c09c9659c0acd2b99c

See more details on using hashes here.

File details

Details for the file orbweaver_tools-1.3.4-py3-none-any.whl.

File metadata

  • Download URL: orbweaver_tools-1.3.4-py3-none-any.whl
  • Upload date:
  • Size: 20.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.17.0-14-generic

File hashes

Hashes for orbweaver_tools-1.3.4-py3-none-any.whl
Algorithm Hash digest
SHA256 674fcd1dfb82ce8c55a6a1e7673bc68b96108b8d08a84301a04d7d61c5c050ec
MD5 c5b523e62fcfd5d2e6c4b054eb9c4539
BLAKE2b-256 7b9f175c48177068d55cd87c2c173a0b32e68fe791a41c331bb4092dc540cebb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page