xeHentai Downloader
Project description
E-Hentai Dōjinshi Downloader
TL;DR
Windows users can download packed binaries from here or here. The package is built using PyInstaller.
Or run directly from source code:
pip install -U requests[socks]
git clone https://github.com/fffonion/xeHentai.git
cd xeHentai
python ./setup.py install
xeH
The program is running in non-interactive mode by default. To run interactively, use xeH.py -i
.
For pros
Configuration file
If you are running from source code, please copy xeHentai/config.py
to your current directory first. Use that file as your config file.
The priority of configuration is: Interactive inputs > Command line options > User config.py > Internal config.py.
Configuration keys:
-
daemon Set to run in default mode, can only use on posix-compatible systems. Refer to Running Modes. Default to
False
. -
dir Download directory. Default to current directory.
-
download_ori Set to download original images or not. Default to
False
. -
jpn_title Set to select Japanese title or not. If set to
False
, English or Romaji title will be used. Default toTrue
. -
rename_ori Set to rename images to their orginal names. If set to
False
, image will be named in sequence numbers. Default toFalse
. -
proxy Proxy list. Refer to Proxies.
-
proxy_image Set to use proxy both on downloading images and scanning webpages. Default to
True
. -
proxy_image_only Set to use proxy only on downloading images. Default to
False
. -
rpc_interface RPC server binding IP. Refer to JSON-RPC. Default to
localhost
. -
rpc_port RPC server binding port. Default to
none
(not serving). -
rpc_secret RPC secret key. Default to
None
. -
delete_task_files Set to delete downloaded files when deleting a task. Default to
False
. -
make_archive Set to make a ZIP archive after download and delete downloaded directory. Default to
False
. -
download_range Set image download range. Refer to Download range. Default to download all images.
-
scan_thread_cnt Thread count for scanning webpages. Default to
1
. -
download_thread_cnt Thread count for downloading images. Default to
5
. -
download_timeout Timeout of download images. Default to
10
s. -
ignored_errors Set the error codes to ignore and continue downloading. Default to empty. Error codes can be obtained from const.py.
-
log_path Set log file path. Default to
eh.log
. -
log_verbose Set log level with integer from 1 to 3. Bigger value means more verbose output. Default to
2
. -
save_tasks Set to save uncompleted tasks in
h.json
. Default toFalse
.
Command line options
Usage: xeH [-u USERNAME] [-k KEY] [-c COOKIE] [-i] [--daemon] [-d DIR] [-o]
[-j BOOL] [-r BOOL] [-p PROXY] [--proxy-image | --proxy-image-only]
[--rpc-interface ADDR] [--rpc-port PORT] [--rpc-secret ...]
[--delete-task-files BOOL] [-a BOOL] [--download-range a-b,c-d,e]
[-t N] [--timeout N] [-f] [-l /path/to/eh.log] [-v] [-h]
[--version]
[url [url ...]]
xeHentai Downloader NG
positional arguments:
url gallery url(s) to download
optional arguments:
-u USERNAME, --username USERNAME
username
-k KEY, --key KEY password
-c COOKIE, --cookie COOKIE
cookie string, will be overriden if given -u and -k
-i, --interactive interactive mode, will be ignored in daemon mode
(default: False)
--daemon daemon mode, can't use with -i (default: False)
-d DIR, --dir DIR set download directory (current:
/Users/fffonion/Dev/Python/xeHentai)
-o, --download-ori download original images, needs to login (current:
True)
-j BOOL, --jpn-title BOOL
use Japanese title, use English/Romaji title if turned
off (default: True)
-r BOOL, --rename-ori BOOL
rename gallery image to original name, use sequence
name if turned off (default: False)
-p PROXY, --proxy PROXY
set download proxies, can be used multiple times,
currenlty supported: socks5/4a, http(s), glype.
Proxies are only used on webpages by default (current:
['socks5h://127.0.0.1:16963'])
--proxy-image use proxies on images and webpages (default: True)
--proxy-image-only only use proxies on images, not webpages (current:
False)
--rpc-interface ADDR bind jsonrpc server to this address (current:
localhost)
--rpc-port PORT bind jsonrpc server to this port (default: 8010)
--rpc-secret ... jsonrpc secret string (default: None)
--delete-task-files BOOL
delete downloaded files when deleting a task (default:
True)
-a BOOL, --archive BOOL
make an archive (.zip) after download and delete
directory (default: False)
--download-range a-b,c-d,e
specify ranges of images to be downloaded, in format
start-end, or single index, use comma to concat
multiple ranges, e.g.: 5-10,15,20-25, default to
download all images
-t N, --thread N download threads count (default: 5)
--timeout N set image download timeout (default: 10s)
-f, --force download regardless of quota exceeded warning
(default: False)
-l /path/to/eh.log, --logpath /path/to/eh.log
define log path (current:
/Users/fffonion/Dev/Python/xeHentai/eh.log)
-v, --verbose show more detailed log (default: 3)
-h, --help show this help message and exit
--version show program's version number and exit
If options are not defined, values from config.py
will be used.
JSON-RPC
If rpc_interface
and rpc_port
are set, xeHentai will start a RPC server. The request and response follows the JSON-RPC 2.0 standard.
$ curl localhost:8010/jsonrpc -d '{"jsonrpc": "2.0", "id": 1, "method":"xeH.addTask", "params":[[args],{kwargs}]}'
{"jsonrpc": "2.0", "id": 1, "result": "36df423e"}
rpc_secret
is a secret key to your RPC server. If it's set, client should include this value in the request. For example when rpc_secret
is set to hentai:
$ curl localhost:8010/jsonrpc -d '{"jsonrpc": "2.0", "id": 1, "method":"xeH.addTask", "params":["token:hentai",[args],{kwargs}]}'
{"jsonrpc": "2.0", "id": 1, "result": "36df423e"}
The method filed should start with xeH. and should be a public class method of xeHentai from core.py. And change the name from lower_case_with_underscores notation to lowerCamelCase notation. For example, add_task
becomes addTask
.
Refer to xeHentai class from core.py for parameters list.
If your browser has a Userscript plugin, you can use xeHentaiHelper.user.js to create tasks directly on e-hentai website. Chrome user will need to install Tampermonkey, for firefox Greasemonkey, and ViolentMonkey for Opera and Maxthon users.
Because e-hentai has enabled https, Chrome user will needs to click on the shield icon in the far right of the address bar and click "Load anyway" or "Load unsafe scripts"
Running modes
If xeHentai is ran from command line interface or interative mode, the program will exit after it finishes the tasks in h.json
(if exists) and given URL.
If there's no URL given from command line, the program will exit after it finishes the tasks in h.json
(if exists).
If program is running on daemon mode (-d
is set or daemon
is set to True
), the program will keep running in background.
Proxies
xeHentai supports three types of proxies:
- socks proxy:
socks5h://127.0.0.1:1080
. If you want to resolve DNS on client side, usesocks5://127.0.0.1:1080
. - http(s) proxy:
http://127.0.0.1:8080
. - glype proxy:
http://example.com/browse.php?u=a&b=4
. Please set value ofb
accordingly. glype is a widely used PHP proxy script. When using, uncheck Encrypt URL, Remove Scripts and check Allow Cookies and open a random URL. The paste the address into configuration.
Multiple proxies can be specified at the same time. The format can be like : ['socks5h://127.0.0.1:1080', 'http://127.0.0.1:8080']
.
By default proxies are used to download images and scan webpages. If you don't want to use proxy on downloading images, set proxy_image
to False
.
glype users are encouraged to set proxy_image
to False
。
If you only want to use proxy to download image, set proxy_image_only
to True in config.py
or use the --proxy-image-only
CLI option. If both proxy_image
and proxy_image_only
are set to True, proxy_image
will be ignored.
Download range
Download ranges are set in format start_positoin-end_positoin
. For example, 5-10
means number download first 5 to 10 images, including 5 and 10. Or use 15
to download number 15 only.
Multiple ranges can be seperated with comma. For example,5-10,15
.
If no range is given, xeHentai will download all images.
Misc
Image limit
Downloading images will be count towards image limit. This is calculated regarding the popularity of gallery, the server load and/or Hentai@Home bandwidth by e-hentai server.
License
GPLv3
Changelog
2.020
- 增加RPC帮助函数:
get_info
,get_config
,update_config
,get_image
- 增加通过RPC看图和下载压缩包功能
- 增加
delete_task_files
选项,设置是否删除任务时同时删除下载的文件 - 修复Windows文件夹不能以空格结尾以及文件不能以
.
结尾 - 修复下载时的临时文件在Windows下报错Error 32的问题
2.019
- 增加图片地址解析时的错误处理
- 增加使用流模式下载图片,优化判断丧尸线程
- 修复无法从环境变量中获得LOCALE时的问题
- 修复unichr被当成局部变量的问题
- 修复交互模式在Python3中的一个问题
2.018
- 增加
jpn_title
选项,选择是否使用日语标题 - 增加
download_range
选项,选择下载范围 - 增加
timeout
选项,设置下载图片的超时 - 增加
proxy_image_only
选项,设置仅使用代理下载图片 - 命令行增加
--force
选项,设置忽略配额继续下载; 配置增加ignored_errors
选项,设置忽略的错误码 - 检查下载的图片是否完整
- 识别
socks5h
代理字符串
2.017
- 修复匹配网址的正则表达式
- 修复表站自动转换里站逻辑
- 修复下载图片重试后重命名失败的问题
- 修复原始文件名与自动编号冲突时重命名异常的问题
2.016
- 修复超出配额的判断
- 修复可能会下到评论中的图的问题
- 修复无法匹配安装在根目录glype的代理问题
- 某些错误现在会显示详细信息
- 增加
proxy_image
选项,选择是否使用代理下载图片
2.015
- 显示重命名时的错误
- 修复扩展名中多余的
.
- 修复Windows下文件名的保留字符
<
,>
2.014
- 修复cookie中只有nw判断为已登录的bug
- 登录失败时显示网页上的错误信息
- 交互支持逗号分割多个任务,命令行模式支持同时添加多个任务
- 修复重命名的bug
2.013
- 修复页数>=1000页时抽风的bug
- 原始文件名冲突时自动+1
2.012
- 修复Windows下中文路径的问题
2.011
- 修复每页缩略图数量不是40时下载不完整的bug
- 90秒没有新下载图片则自动结束任务
- 本子包含重复图片时直接复制
2.010
- 图片404时重试
2.009
- 交互模式默认值改为配置中设置的值
2.008
- 跳过Content Warning
2.007
- 修复本子中有重复图时无法自动退出的bug
- 其他稳定性修复
2.006
- 增加
make_archive
, 下载完成后生成zip压缩包并删除下载目录 - 完善reload机制
- 检测IP是否被ban并自动更换代理IP
2.005
- 增加
rpc_secret
i18n/zh_cn
更名为i18n/zh_hans
2.004
- 支持Python3
2.003
- 读取 .ehentai.cookie
- 交互模式不保存任务
- 添加
--rename-ori
参数和配置 - 如果用户配置有问题,从内置配置读取默认值
- 其他更新
2.002
- 支持
glype
代理类型
2.001
- 初始发布
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file xeHentai-2.2.tar.gz
.
File metadata
- Download URL: xeHentai-2.2.tar.gz
- Upload date:
- Size: 45.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.14.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.35.0 CPython/2.7.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe7699326139daebe5aa50bf0478a8bb437b96706c2f6564e1f8f1ecd5294190 |
|
MD5 | c71075618858c1b75ac2e0c17d076729 |
|
BLAKE2b-256 | d3e69602eb226a38b853264d7a0fc39a07f1a58b826d93c203fb68b0b1be3ea5 |