Skip to main content

novel grab crawler module using python3 and lxml

Project description

novel grab crawler module using python3 and lxml

multiprocesssing with multithread version

winxos, AISTLAB Since 2017-02-19

INSTALL:

pip3 install aistlab_novel_grab

1. USAGE:

RUN COMMAND IN CONSOLE:

novel_grab http://the_url_of_novel_chapters_page

EXAMPLE:

novel_grab http://book.zongheng.com/showchapter/654086.html

SUPPORTED SITES: * http://book.zongheng.com * http://www.aoyuge.com * http://www.quanshu.net

2. USAGE AS PYTHON MODULE:

    from novel_grab.novel_grab import Downloader
    d = Downloader()
    print(d.get_info())
    if d.set_url('http://book.zongheng.com/showchapter/221579.html'):
        d.start()

**TIPS** \* When d = Downloader(), d.get\_info() can get supported
sites info. \* Once d.set\_url(url) will return the url is valid or
not. \* Of course you can use d.get\_info() to access the state of d
at any time. \* While finished, will create :math:`novel_name`.zip
file in your current path, default zip method using
zipfile.ZIP\_DEFLATED

Just for educational purpose, take care of yourself.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

AISTLAB_novel_grab-1.2.12.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

AISTLAB_novel_grab-1.2.12-py2.py3-none-any.whl (9.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file AISTLAB_novel_grab-1.2.12.tar.gz.

File metadata

File hashes

Hashes for AISTLAB_novel_grab-1.2.12.tar.gz
Algorithm Hash digest
SHA256 369b6530d185ca642ec5ec3824f104e4222c8a2738dc86419cf6529b27a456f2
MD5 6e22523a028db92f6bccf50208f8e2d5
BLAKE2b-256 6c516ea42eb72b6c75b0e4155469063b02c9d606775672a4f9ab457ef5a262f9

See more details on using hashes here.

File details

Details for the file AISTLAB_novel_grab-1.2.12-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for AISTLAB_novel_grab-1.2.12-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 9945cbf8801f0bef2a8004e0b830ffbaea48702e51b30dc93113cdcfdfb93b4d
MD5 a07db9ce62d4a718ce0bb55719e1914c
BLAKE2b-256 76ac2cf2ee7f72ac0a5a09c25924bc2fa58ca02f37f1f1ad1bbe198270cfb717

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page