Skip to main content

Download utility for thetrove.is

Project description

thetrove-downloader

version_pypi version_gitlab version_python

issues_gitlab

This Python package allows to easily download any resource, be it folder or file, from The Trove.is.

Installation and Requirements

The program requires Python 3.9 or above.

To install the program it is sufficient to use Python pip and get the package thetrove-downloader. The package will add a new command thetrove-downloader.

python3 -m pip install thetrove-downloader

Updates should be performed via pip.

python3 -m pip install --upgrade thetrove-downloader

Usage

thetrove-downloader [-h] [-t, --target TARGET] [--target-url URL]
                    [-j, --json JSON] [-f, --folder FOLDER]
                    [-o, --output OUTPUT] [-b, --blacklist BLACKLIST]
                    [-w, --whitelist WHITELIST] [--no-download]
                    [--no-preserve-root]
  • -h, --help show help message

  • -t, --target <TARGET> download target (folder or file)
    To download a specific resource, start the program with the --target command and supply a valid path (e.g., " Books/White Star").

  • --target-url download target (full URL to folder or file)
    Overrides --target argument with a full URL and sets the download target to the path of the URL ( e.g., "https://thetrove.is/Open%20Source/BIND%20RPG" to "Open Source/BIND RPG").

  • -j, --json <JSON> save/read instructions from a JSON file
    The --json option allows to specify a file to save the target, folder, output, whitelist, and blacklist instructions in JSON format. Launching the program with only the --json option will load the instructions previously saved in the file and perform all of them in succession. This can be used to keep a local copy of specific resources up to date.

  • -f, --folder <FOLDER> destination folder
    The --folder option allows to specify a folder in which the targeted resource will be downloaded.

  • -o, --output <OUTPUT> output name of download target
    The --output option allows to override the folder/file name of the resource. Note that if the targeted resource is a folder, the option will only apply to the top-level folder and leave all contents therein unaltered.

  • -b, --blacklist <BLACKLIST> regex blacklist for files/folders
    The --whitelist option allows to specify a regex pattern that will be used to skip files and folder that do not contain the supplied pattern. Overwrites blacklist options.

  • -w, --whitelist <WHITELIST> regex whitelist for files/folders, overrides blacklist
    The --whitelist option allows to specify a regex pattern that will be used to skip files and folder that do not contain the supplied pattern. Overwrites blacklist options.

  • --no-download list content without downloading
    The --no-download flag allows to list the content under the supplied target without performing any download. The instructions will still be saved in a JSON file if the --json options is used, but not the flag.

  • --no-preserve-root allows the download of root folders
    The --no-preserve-root flag allows downloading entire root folders such as "/Books/". This flag is not preserved with the --json option and must be passed manually every time.

NOTE: the program requires to use either --target, --json, or both --target and --json. Without one of these combinations the program won't have a valid download instruction.

Notes on Crawling

The program performs all GET requests without throttling or delay and with no disallowed paths as per the instructions at thetrove.is/robots.txt.

However, please do take care to not overuse this utility and risk overwhelming their connection. Thank you.

Root folders such as "/Books" are protected and cannot be downloaded without using the --no-preserve-root flag.

Contributing

All contributions and suggestions are welcome!

If you have suggestions for fixes or improvements, you can open an issue with your idea, see #Issues for details.

Issues

If you encounter any problem during usage of the program, an issue can be opened on the project's page on GitLab.

When opening an issue for a problem, please copy the error message – if any – and describe the operation in progress when the error occurred.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thetrove-downloader-1.1.0.tar.gz (11.0 kB view details)

Uploaded Source

Built Distribution

thetrove_downloader-1.1.0-py3-none-any.whl (11.2 kB view details)

Uploaded Python 3

File details

Details for the file thetrove-downloader-1.1.0.tar.gz.

File metadata

  • Download URL: thetrove-downloader-1.1.0.tar.gz
  • Upload date:
  • Size: 11.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.1.6 CPython/3.9.5 Darwin/20.5.0

File hashes

Hashes for thetrove-downloader-1.1.0.tar.gz
Algorithm Hash digest
SHA256 b52348b7ad33264a9c228a69724f91dc2a7a347ba3f06e360a22767c72096d19
MD5 ea40c9e9ece93ab70b0b944e1361f2c2
BLAKE2b-256 9342ae46b73a812c7e4e8139c136d58b45dae157acb5c9c09edd58879222b45c

See more details on using hashes here.

File details

Details for the file thetrove_downloader-1.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for thetrove_downloader-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5e4c3c6dc7c70ee0af2ffaca14285e87536474d1f197f7920aa38b0e9a4e3801
MD5 97c28f2afc59d9eeae8d43b5e9de600b
BLAKE2b-256 b7b531af17a7e50a6f2595fa38c55166354ed5955e38c9c30f81a6aa92abe118

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page