Download utility for thetrove.is
Project description
thetrove-downloader
This Python package allows to easily download any resource, be it folder or file, from The Trove.is.
Installation and Requirements
The program requires Python 3.9 or above.
To install the program it is sufficient to use Python pip and get the package thetrove-downloader. The package will
add a new command thetrove-downloader.
python3 -m pip install thetrove-downloader
Updates should be performed via pip.
python3 -m pip install --upgrade thetrove-downloader
Usage
thetrove-downloader [-h] [-t, --target TARGET] [--target-url URL]
[-j, --json JSON] [-f, --folder FOLDER]
[-o, --output OUTPUT] [-b, --blacklist BLACKLIST]
[-w, --whitelist WHITELIST] [--no-download]
[--no-preserve-root]
-
-h, --helpshow help message -
-t, --target <TARGET>download target (folder or file)
To download a specific resource, start the program with the--targetcommand and supply a valid path (e.g., " Books/White Star"). -
--target-urldownload target (full URL to folder or file)
Overrides--targetargument with a full URL and sets the download target to the path of the URL ( e.g., "https://thetrove.is/Open%20Source/BIND%20RPG" to "Open Source/BIND RPG"). -
-j, --json <JSON>save/read instructions from a JSON file
The--jsonoption allows to specify a file to save the target, folder, output, whitelist, and blacklist instructions in JSON format. Launching the program with only the--jsonoption will load the instructions previously saved in the file and perform all of them in succession. This can be used to keep a local copy of specific resources up to date. -
-f, --folder <FOLDER>destination folder
The--folderoption allows to specify a folder in which the targeted resource will be downloaded. -
-o, --output <OUTPUT>output name of download target
The--outputoption allows to override the folder/file name of the resource. Note that if the targeted resource is a folder, the option will only apply to the top-level folder and leave all contents therein unaltered. -
-b, --blacklist <BLACKLIST>regex blacklist for files/folders
The--whitelistoption allows to specify a regex pattern that will be used to skip files and folder that do not contain the supplied pattern. Overwrites blacklist options. -
-w, --whitelist <WHITELIST>regex whitelist for files/folders, overrides blacklist
The--whitelistoption allows to specify a regex pattern that will be used to skip files and folder that do not contain the supplied pattern. Overwrites blacklist options. -
--no-downloadlist content without downloading
The--no-downloadflag allows to list the content under the supplied target without performing any download. The instructions will still be saved in a JSON file if the--jsonoptions is used, but not the flag. -
--no-preserve-rootallows the download of root folders
The--no-preserve-rootflag allows downloading entire root folders such as "/Books/". This flag is not preserved with the--jsonoption and must be passed manually every time.
NOTE: the program requires to use either --target, --json, or both --target and --json. Without one of
these combinations the program won't have a valid download instruction.
Notes on Crawling
The program performs all GET requests without throttling or delay and with no disallowed paths as per the instructions at thetrove.is/robots.txt.
However, please do take care to not overuse this utility and risk overwhelming their connection. Thank you.
Root folders such as "/Books" are protected and cannot be downloaded without using the --no-preserve-root flag.
Contributing
All contributions and suggestions are welcome!
If you have suggestions for fixes or improvements, you can open an issue with your idea, see #Issues for details.
Issues
If you encounter any problem during usage of the program, an issue can be opened on the project's page on GitLab.
When opening an issue for a problem, please copy the error message – if any – and describe the operation in progress when the error occurred.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file thetrove-downloader-1.1.0.tar.gz.
File metadata
- Download URL: thetrove-downloader-1.1.0.tar.gz
- Upload date:
- Size: 11.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.6 CPython/3.9.5 Darwin/20.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b52348b7ad33264a9c228a69724f91dc2a7a347ba3f06e360a22767c72096d19
|
|
| MD5 |
ea40c9e9ece93ab70b0b944e1361f2c2
|
|
| BLAKE2b-256 |
9342ae46b73a812c7e4e8139c136d58b45dae157acb5c9c09edd58879222b45c
|
File details
Details for the file thetrove_downloader-1.1.0-py3-none-any.whl.
File metadata
- Download URL: thetrove_downloader-1.1.0-py3-none-any.whl
- Upload date:
- Size: 11.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.6 CPython/3.9.5 Darwin/20.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5e4c3c6dc7c70ee0af2ffaca14285e87536474d1f197f7920aa38b0e9a4e3801
|
|
| MD5 |
97c28f2afc59d9eeae8d43b5e9de600b
|
|
| BLAKE2b-256 |
b7b531af17a7e50a6f2595fa38c55166354ed5955e38c9c30f81a6aa92abe118
|