Skip to main content

An interactive tool for downloading URLs

Project description

URLoad

URLoad is an interactive CLI tool for scraping websites, designed to make extracting and managing web data simple and efficient.

Features

  • Interactive command-line interface for scraping and processing URLs
  • Modular commands for extracting links, images, and other web content
  • Easy-to-use options for saving, sorting, and filtering results
  • Configurable settings saved in urload.toml
  • Extensible architecture for adding new commands

Installation

pip install --user urload

Usage

Start the interactive CLI:

$ urload
Welcome to URLoad! Type 'help' for commands.
Current session directory: 0000
URLoad (0) >

Common Commands

  • add <url>: Add a URL to the current list
  • list: List all URLs
  • get: Fetch all URLs in the current list
  • img <url>: Extract image links from the current list
  • href <url>: Extract hyperlinks from the current list
  • save <filename>: Save the current URL list to a file
  • load <filename>: Load the URL list from a file
  • sort: Sort the URL list alphabetically
  • uniq: Remove duplicate URLs
  • help: Show help for commands

All commands can be explored interactively.

  • Use help to list available commands.
  • Use help <command> to get detailed help for a specific command.

Development

The project uses uv for dependency management.

To run the project locally, clone the repository and run with uv:

git clone https://github.com/JohnStrunk/URLoad.git
cd URLoad
uv run urload

Testing

To run all tests and code checks:

./hack/check.sh

This will run linting, formatting, and all unit tests.

License

SPDX-License-Identifier: AGPL-3.0-or-later

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

urload-0.9.0.tar.gz (27.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

urload-0.9.0-py3-none-any.whl (38.4 kB view details)

Uploaded Python 3

File details

Details for the file urload-0.9.0.tar.gz.

File metadata

  • Download URL: urload-0.9.0.tar.gz
  • Upload date:
  • Size: 27.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.7.22

File hashes

Hashes for urload-0.9.0.tar.gz
Algorithm Hash digest
SHA256 a32a7371bddfcff6ad60073cd3a066b02c5e00c033a5ee7e3b3ed50006ca21bd
MD5 257dac2df427991db42a3207a636c40d
BLAKE2b-256 ccb9d522d68b2a216fd5d44fa5f6f70df4d2e56cb1b1db8d1f0798b318829d7a

See more details on using hashes here.

File details

Details for the file urload-0.9.0-py3-none-any.whl.

File metadata

  • Download URL: urload-0.9.0-py3-none-any.whl
  • Upload date:
  • Size: 38.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.7.22

File hashes

Hashes for urload-0.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 782f29ac037366b165c0623d67e8ce6a492bb8ec2b4e80ae7c67b2f4e0f8b287
MD5 2af28156f2144af529d0d9fa8a6a76f8
BLAKE2b-256 92fb36e457f864d2add6fd25f5535627e41c012c76d56b5412d4847f229ef3a2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page