novel grab crawler module using python3 and lxml
Project description
novel grab crawler module using python3 and lxml
multiprocesssing with multithread version
winxos, AISTLAB Since 2017-02-19
INSTALL:
pip3 install aistlab_novel_grab
1. USAGE:
RUN COMMAND IN CONSOLE:
novel_grab http://the_url_of_novel_chapters_page
EXAMPLE:
novel_grab http://book.zongheng.com/showchapter/654086.html
SUPPORTED SITES: * http://book.zongheng.com * http://www.aoyuge.com * http://www.quanshu.net
2. USAGE AS PYTHON MODULE:
from novel_grab.novel_grab import Downloader
d = Downloader()
print(d.get_info())
if d.set_url('http://book.zongheng.com/showchapter/221579.html'):
d.start()
**TIPS** \* When d = Downloader(), d.get\_info() can get supported
sites info. \* Once d.set\_url(url) will return the url is valid or
not. \* Of course you can use d.get\_info() to access the state of d
at any time. \* While finished, will create :math:`novel_name`.zip
file in your current path, default zip method using
zipfile.ZIP\_DEFLATED
Just for educational purpose, take care of yourself.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file AISTLAB_novel_grab-1.2.12.tar.gz
.
File metadata
- Download URL: AISTLAB_novel_grab-1.2.12.tar.gz
- Upload date:
- Size: 5.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 369b6530d185ca642ec5ec3824f104e4222c8a2738dc86419cf6529b27a456f2 |
|
MD5 | 6e22523a028db92f6bccf50208f8e2d5 |
|
BLAKE2b-256 | 6c516ea42eb72b6c75b0e4155469063b02c9d606775672a4f9ab457ef5a262f9 |
File details
Details for the file AISTLAB_novel_grab-1.2.12-py2.py3-none-any.whl
.
File metadata
- Download URL: AISTLAB_novel_grab-1.2.12-py2.py3-none-any.whl
- Upload date:
- Size: 9.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9945cbf8801f0bef2a8004e0b830ffbaea48702e51b30dc93113cdcfdfb93b4d |
|
MD5 | a07db9ce62d4a718ce0bb55719e1914c |
|
BLAKE2b-256 | 76ac2cf2ee7f72ac0a5a09c25924bc2fa58ca02f37f1f1ad1bbe198270cfb717 |