Skip to main content

A fast CLI static web scraper

Project description

StaticWeb

A fast, multi-threaded CLI tool to scrape static websites.

Installation

pip install staticweb

Usage

sw https://example.com --link

Features

  • Extract links (--link)
  • Extract paragraphs (--para)
  • Extract full text (--all)
  • Extract custom tags (--tag h1)
  • JSON output (--json)
  • Save output (--save file.txt)
  • Multi-threaded (--threads)
  • Rich UI with progress bar

Example

sw https://example.com --para sw https://example.com --tag h1 sw https://example.com --link --json

Disclaimer

Use responsibly. Respect website terms and robots.txt.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

staticweb-0.7.0.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

staticweb-0.7.0-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file staticweb-0.7.0.tar.gz.

File metadata

  • Download URL: staticweb-0.7.0.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for staticweb-0.7.0.tar.gz
Algorithm Hash digest
SHA256 1acce016629ee8e81c1f5fa4bd420b6559a83f1811bc20078a768f6e45959326
MD5 c27535b35684260f7146981c62f46673
BLAKE2b-256 03ff8a1870d6ec10414fd0b6952014e1aea8b91a79ee965d3104ffcd0b7cdcbf

See more details on using hashes here.

File details

Details for the file staticweb-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: staticweb-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 4.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for staticweb-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 50636ca4ede26189d849f15815cf739290d312bfc6f6ba464c4342ea43f7c833
MD5 246ab2df2bbc25823e6af9341b1e9b94
BLAKE2b-256 3862ea3648b5ba9c45d9319eeb7e5f6510800659fbf0007711ea322f43535210

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page