Skip to main content

Recursively download files from a web page, with specified extensions, listed in <a> elements. The list of files can be reviewed first.

Project description

Download_files

Usage

One liner

uvx download_files download https://www.example.com .jpg .png

With review step

uvx download_files search https://www.example.com .jpg .png > files.txt

uvx download_files download https://www.example.com --files files.txt

Overview

Download_files parses the anchor elements in an html page (<a/> elements). If another folder is found in the href, the link is followed and any page found is parsed recursively. If the last part of the href matches a specified file extension, it is printed, for piping to the download list (if using search) or downloaded later.

No special efforts are made to circumvent rate limits. The code does not magically allow files to be downloaded, that you do not already have permission to download some other way.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

download_files-0.1.0.tar.gz (4.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

download_files-0.1.0-py3-none-any.whl (5.7 kB view details)

Uploaded Python 3

File details

Details for the file download_files-0.1.0.tar.gz.

File metadata

  • Download URL: download_files-0.1.0.tar.gz
  • Upload date:
  • Size: 4.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.11

File hashes

Hashes for download_files-0.1.0.tar.gz
Algorithm Hash digest
SHA256 814a742d4b556c57f7a317b850f7a1d1d8755a7051b3b428d77d93cbfb71f2f9
MD5 26ec6e7df6ed717bed67044c682c9401
BLAKE2b-256 9840451bca1df8d3b9e6b6d97d655299c249dc72f27a738c30592c1cbdc3cada

See more details on using hashes here.

File details

Details for the file download_files-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for download_files-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 33686b8f207217332154704e1930aeff030464a8807806fa56b3783daa33654f
MD5 03782696cb74b7dc3de0c135fa416aa9
BLAKE2b-256 653afa74fd6403df3ae40b7dec032860877061023d9923c21c2164d03547793a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page