Webcomic downloader

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Framework
- Scrapy
Intended Audience
- End Users/Desktop
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Internet :: WWW/HTTP

Project description

webcomix

Description

webcomix is a webcomic downloader that can additionally create a .cbz (Comic Book ZIP) file once downloaded.

Notice

This program is for personal use only. Please be aware that by making the downloaded comics publicly available without the permission of the author, you may be infringing upon various copyrights.

Installation

Dependencies

Python (3.8 or newer)
click
scrapy (Some additional steps might be required to include this package and can be found here)
scrapy-splash
scrapy-fake-useragent
tqdm
Docker (To be able to download JavaScript-dependent websites with -j option)

Process

End user

Install Python 3
Install the command line interface tool with pip install webcomix

Developer

Install Python 3
Clone this repository and open a terminal in its directory
Install poetry with pip install poetry
Download the dependencies by running poetry install
Install pre-commit hooks with pre-commit install

Usage

webcomix [OPTIONS] COMMAND [ARGS]

Global Flags

help

Show the help message and exit.

Version

Show the version number and exit.

Commands

comics

Shows all predefined comics which can be used with the download command.

download

Downloads a predefined comic. Supports the --cbz flag, which creates a .cbz archive of the downloaded comic.

search

Searches for an XPath that can download the whole comic. Supports the --cbz flag, which creates a .cbz archive of the downloaded comic,-s, which verifies only the provided page of the comic, -y, which skips the verification prompt, and -j, which runs the javascript on pages before downloading.

custom

Downloads a user-defined comic. To download a specific comic, you'll need a link to the first page, an XPath expression giving out the link to the next page and an XPath expression giving out the link to the image. More info here. Supports the --cbz flag, which creates a .cbz archive of the downloaded comic, -s, which verifies only the provided page of the comic, and -y, which skips the verification prompt.

Examples

webcomix download xkcd
webcomix search xkcd --start-url=http://xkcd.com/1/
webcomix custom --cbz (You will be prompted about other needed arguments)
webcomix custom xkcd --start-url=http://xkcd.com/1/ --next-page-xpath="//a[@rel='next']/@href" --image-xpath="//div[@id='comic']//img/@src" --cbz (Same as before, but with all arguments declared beforehand)

Making an XPath selector

Using an HTML inspector, spot a html path to the next link's href attribute/comic image's src attribute.

e.g.: //div[@class='foo']/img/@src This will select the src attribute of the first image whose class is: foo

Note: webcomix works best on static websites, since scrapy(the framework we use to travel web pages) doesn't process Javascript.

To make sure your XPath is correct, you have to go into scrapy shell, which should be downloaded when you've installed webcomix.

scrapy shell <website> --> Use the website's url to go to it.
> response.body --> Will give you the html from the website.
> response.xpath --> Test an xpath selection. If you get [], this means your XPath expression hasn't gotten anything from the webpage.

Contribution

The procedure depends on the type of contribution:

If you simply want to request the addition of a comic to the list of supported comics, make an issue with the label "Enhancement".
If you want to request the addition of a feature to the system or a bug fix, make an issue with the appropriate label.

Running the tests

To run the tests, you have to use the pytest command in the webcomix folder.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Framework
- Scrapy
Intended Audience
- End Users/Desktop
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Internet :: WWW/HTTP

Release history Release notifications | RSS feed

This version

3.9.0

Feb 20, 2024

3.8.1

Feb 20, 2024

3.8.0

Feb 19, 2024

3.7.0

Feb 10, 2024

3.6.9

Feb 4, 2024

3.6.8

Oct 29, 2023

3.6.7

Jul 31, 2023

3.6.6

Mar 4, 2023

3.6.5

Jan 31, 2023

3.6.4

Jan 1, 2023

3.6.3

Dec 10, 2022

3.6.2

Nov 13, 2022

3.6.1

May 23, 2022

3.6.0

Oct 10, 2021

3.5.1

Oct 7, 2021

3.5.0

Jul 8, 2021

3.4.2

Jul 7, 2021

3.4.1

Dec 13, 2020

3.4

Nov 14, 2020

3.3.2

Aug 22, 2020

3.3.1

Jul 5, 2020

3.3.0

Jul 4, 2020

3.2.6

Jun 26, 2020

3.2.5

Jun 20, 2020

3.2.3

Mar 23, 2020

3.2.2

Mar 21, 2020

3.2.1

Nov 13, 2019

3.2

Sep 27, 2019

3.1.2

Sep 12, 2019

3.1.1

Aug 13, 2019

3.1

Mar 26, 2019

3.0

Jan 16, 2019

2.1

Aug 15, 2018

2.0

Aug 8, 2018

1.4

Jul 27, 2018

1.3

Apr 12, 2018

1.2

Oct 29, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webcomix-3.9.0.tar.gz (332.5 kB view hashes)

Uploaded Feb 20, 2024 Source

Built Distribution

webcomix-3.9.0-py3-none-any.whl (343.1 kB view hashes)

Uploaded Feb 20, 2024 Python 3

Hashes for webcomix-3.9.0.tar.gz

Hashes for webcomix-3.9.0.tar.gz
Algorithm	Hash digest
SHA256	`08de548ff5477c5bc4423f677076bfedc27e4d108cd9b6e10d21588addc643b5`
MD5	`975fdd4a96408fd04686ec39f98cab50`
BLAKE2b-256	`48fa9bea485f70ce2459a00d76899fe9c49d445837b15226fb3b2c14203b17c6`

Hashes for webcomix-3.9.0-py3-none-any.whl

Hashes for webcomix-3.9.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`999e72886de7071f0996aaa4ac612b6b5cbb0c5308128d2bd1bc10f861f0a773`
MD5	`0894bc4fa1953149537a8798bc698822`
BLAKE2b-256	`4eadde1e433e4c0b508223ab4e16b627d64676f7c9527b987800ca87f9b7ad81`

webcomix 3.9.0

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

webcomix

Description

Notice

Installation

Dependencies

Process

End user

Developer

Usage

Global Flags

help

Version

Commands

comics

download

search

custom

Examples

Making an XPath selector

Contribution

Running the tests

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution