Bulk downloader for booru imageboards with evil intentions
Project description
Bulk downloader for booru imageboards with evil intentions
Jigoku is a CLI tool for downloading content arround the imageboards, seamlessly integrates with popular website like danbooru, rule34, gelbooru, and many more. It's also modular, no matter what kind of links you have, what kind imageboard you want to download from, If still covered by the flow, this tools will adapt.
The Problem
Hell opening fuckton of tabs, and download them one by one. Piece of crap
Welp, It's not tough actually if still same website, there is also tampermonkey hacks to mock them, but what if you want to get 'em from multiple websites?
The Solution
No more tampermonkey hacks, no more opening a fuckton of tabs, no more spliting your note between danbooru, gelbooru, r34 or anything else. Just one command through single file, this tools will adapt.
Jigoku apply connection retry on failed, every your network changes or something else, It will keep waiting.
Features
- Modular bulk download
- 75% tested
- Plenty of booru support
- Connection retry on failure
- Pure scraping, does not hit the API
- Download with ease
- Interactive prompt
Site support
Jigoku supports the following imageboards:
- rule34
- danbooru
- gelbooru
- safebooru
- tbib
- xbooru
- realbooru
- yandere
- lolibooru
- konachan
- konachan.net
- hypnohub
- e621
- e926
- allthefallen
- paheal
Prerequisites
NOTE: Python 3.7 or above |
Jigoku depends on
- requests Python HTTP Client
- beautifulsoup4 HTML Parser
Installation
It's fairly simple to use jigoku
pip install jigoku
- Or manual build by cloning this repository and run
python setup.py install
Usage
Just run without any arguments and interactive prompt will get you,
$ jigoku
-
First prompt will ask you to input your file
- Input your file.txt
-
Second prompt will ask you what kind of links you have
(1)
multiple posts or galleries(2)
multiple pages
-
Third prompt will ask you for image resolution
(1)
original, means the original, big size(2)
sample, means smaller than original
Running tests
Is current state not covers enough? Feel free to add more test cases and submit a pull request.
Multiple posts test: workflows/jigoku_posts.yml
Multiple pages test: workflows/jigoku_pages.yml
Example file input: jigoku/tree/master/test
Phrases
This pattern used for validating posts or galleries
/posts/
danbooru based,&id=
gelbooru based,?id=
gelbooru based,/show
yandere based,/post/view/
paheal based
Otherwise, it will assign to pages
For example, If you confused between (1)
and (2)
, check this out:
https://danbooru.donmai.us/posts/5874589
https://rule34.xxx/index.php?page=post&s=view&id=7004047
- This is post, choose
(1)
- This is post, choose
https://danbooru.donmai.us/posts?tags=1girl&z=1
https://rule34.xxx/index.php?page=post&s=list&tags=1girl
- This is pages, choose
(2)
- This is pages, choose
https://danbooru.donmai.us/posts?tags=kiryuuin_satsuki+&z=5
https://rule34.xxx/index.php?page=post&s=list&tags=kiryuuin_satsuki
- This is pages from tags, choose
(2)
- This is pages from tags, choose
https://danbooru.donmai.us/posts?page=2&tags=kiryuuin_satsuki+
https://rule34.xxx/index.php?page=post&s=list&tags=kiryuuin_satsuki&pid=42
- This is pages from tags with page number, choose
(2)
- This is pages from tags with page number, choose
https://danbooru.donmai.us/posts?tags=hews+&z=5
https://rule34.xxx/index.php?page=post&s=list&tags=belko
- This is pages from artist, choose
(2)
- This is pages from artist, choose
Limitations
You may notice that some part were strictly to "Original" resolution and "Sample" is no use, I'm pretty sure there are some limitations, for example old posts perhaps has different structures, but I haven't found any yet, current test cases has over 1000+ posts and took 20+ minutes, and it works fine.
Legal
This tool can be freely copied, modified, altered, distributed without any attribution whatsoever. However, if you feel like this tool deserves an attribution, mention it. It won't hurt anybody.
Licence: WTF.
Pronunciation
ja_JP
• jigoku — 地獄、じごく, meaning "hell". The other mean if you opening fuckton of rule34 on google chrome tabs, then downloading one by one, Definitely hell and piece of crap
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file jigoku-2.1.8.tar.gz
.
File metadata
- Download URL: jigoku-2.1.8.tar.gz
- Upload date:
- Size: 14.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b8d65c9655c8c08b0e9186671494de1167f108b42ccf76b0f778398907355609 |
|
MD5 | e59ec46c320065bc77fc91108ecdccfc |
|
BLAKE2b-256 | 5625c0f404510037c0c7ac2382213b64179954b44c3b63b98f5605bb2c06728d |