A Python implementation of VRipper
Project description
vripper
A Python implementation of VRipper. See the list of supported websites below.
Installation
# Python 3.6+
pip install vripper
Usage
from vripper import download_thread
if should_disable_logging:
logging.getLogger("vripper").setLevel(logging.CRITICAL)
thread_url = "https://..."
dest = os.path.join(".", "vripper")
download_thread(thread_url, dest, **options)
Options
Not in any particular order
Name | Type | Default Value | Description |
---|---|---|---|
mode |
vripper.enum .downloadmode .DownloadMode |
FIRST_POST_ONLY |
Determines which post(s) within the thread to download. |
output_format |
vripper.enum .outputformat .OutputFormat |
None | The desired output format. Currently supported: zip |
download_threadpool_size |
int |
5 | The number of concurrent threads for the download. The minimum allowed value is 1. |
processing_pool_size |
int |
0 | The number of concurrent processes for the pre/post-processing. Set it to 0 to disable multiprocessing. |
max_filecount |
int |
None | The max number of files (images) allowed in a post. If the number of files exceeds the given value, the module skip a subset of images. Ex. Download every 3rd image. |
min_dimension |
int (pixels) |
0 | The min dimension allowed for an image. Images whose dimensions are smaller than this value will be deleted. |
max_dimension |
int (pixels) |
None | The max dimension allowed for an image. If the downloaded image exceeds the given value, the image will be resized. |
max_filesize |
int (bytes) |
None | Any files larger than this threshold will be deleted. |
min_filesize |
int (bytes) |
None | Any files smaller than this threshold will be deleted. |
acceptable_filesize |
int (bytes) |
None | Any files smaller than this threshold will not be considered for resize/compression. Expressed in bytes. |
min_hitrate |
float |
.85 | A minimum percentage of files in a post required to be downloaded and processed successfully. |
download_connect_timeout |
float (seconds) |
1.0 | The time it takes to abort the connection with the image host. |
download_read_timeout |
float (seconds) |
10.0 | The time it takes to abort the read with the image host. |
compression_quality |
int |
65 | JPEG compression quality -- more info here. Set it to 0 to disable the compression. The value cannot be 0 if max_dimension has been specified. |
processing_priority |
vripper.enum .processingpriority .ProcessingPriority |
None | Determines the course of action, given two conflicting options. Example TBD |
Exceptions
For each image subject to download, there is a potential for the download to be considered unsuccessful. To name a few reasons:
- Temporary network issues;
- The host going offline temporarily or permanently;
- The image was deleted by the uploader;
- The downloaded file is corrupted; or
- The downloaded file size is too small/big based on the user-specified threshold.
In the end, if the total number of successfully downloaded images does not meet min_hitrae
, the module will throw one of the following exceptions:
Note: If the post has more images than the value specified in max_filecount
, not all images will be subject to download. The skipped images will not be considered for the min_hitrate
calculation.
Name | Calculation Time | Description |
---|---|---|
PermissionError |
Pre-download | Raised when the requested thread is private. |
UnsupportedHostError |
Pre-download | Raised when the number of images hosted by unsupported websites is too high. |
DeprecatedHostError |
Pre-download | Raised when the number of images hosted by deprecated websites is too high. |
ImagePermanentlyUnavailableError |
Post-download | Raised when the number of deleted images is too high. |
InsufficientFileCountError |
Final | Raised when the number of resulting files in a post is too low. |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file vripper-0.5.37.tar.gz
.
File metadata
- Download URL: vripper-0.5.37.tar.gz
- Upload date:
- Size: 17.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1da8924a346a9588e1006d774128f36a1ec4e16e043a30aa0a8e4ff7abf14f45 |
|
MD5 | 22d922c77fb6ac5a1d365d879492edaa |
|
BLAKE2b-256 | 525ff97fce3da46807bd7d9fa2ff44dd4b74ba4d90f995c84945ffc852e2fea8 |
File details
Details for the file vripper-0.5.37-py3-none-any.whl
.
File metadata
- Download URL: vripper-0.5.37-py3-none-any.whl
- Upload date:
- Size: 31.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 77ccebe797fc98ee52153da4a48cf302a081effd982320b1993bf3710be96929 |
|
MD5 | a9bb38df58ee20f2537e3a869776848c |
|
BLAKE2b-256 | 85088c3c9e1934a59bdd016e39a6d422def9f60c2f15f40de641f23b93a76aa1 |