Skip to main content

Webcomic downloader

Project description

webcomix

BuildCoverage StatusPyPI version

Description

webcomix is a webcomic downloader that can additionally create a .cbz (Comic Book ZIP) file once downloaded.

Notice

This program is for personal use only. Please be aware that by making the downloaded comics publicly available without the permission of the author, you may be infringing upon various copyrights.

Installation

Dependencies

  • Python (3.9 or newer)
  • click
  • scrapy (Some additional steps might be required to include this package and can be found here)
  • scrapy-splash
  • scrapy-fake-useragent
  • tqdm
  • Docker (To be able to download JavaScript-dependent websites with -j option)

Process

End user

  1. Install Python 3
  2. Install the command line interface tool with pip install webcomix

Developer

  1. Install Python 3
  2. Clone this repository and open a terminal in its directory
  3. Install poetry with pip install poetry
  4. Download the dependencies by running poetry install
  5. Install pre-commit hooks with pre-commit install

Usage

webcomix [OPTIONS] COMMAND [ARGS]

Global Flags

help

Show the help message and exit.

Version

Show the version number and exit.

Commands

comics

Shows all predefined comics which can be used with the download command.

download

Downloads a predefined comic. Supports the --cbz flag, which creates a .cbz archive of the downloaded comic.

search

Searches for an XPath that can download the whole comic. Supports the --cbz flag, which creates a .cbz archive of the downloaded comic,-s, which verifies only the provided page of the comic, -y, which skips the verification prompt, and -j, which runs the javascript on pages before downloading.

custom

Downloads a user-defined comic. To download a specific comic, you'll need a link to the first page, an XPath expression giving out the link to the next page and an XPath expression giving out the link to the image. More info here. Supports the --cbz flag, which creates a .cbz archive of the downloaded comic, -s, which verifies only the provided page of the comic, and -y, which skips the verification prompt.

Examples

  • webcomix download xkcd
  • webcomix search xkcd --start-url=http://xkcd.com/1/
  • webcomix custom --cbz (You will be prompted about other needed arguments)
  • webcomix custom xkcd --start-url=http://xkcd.com/1/ --next-page-xpath="//a[@rel='next']/@href" --image-xpath="//div[@id='comic']//img/@src" --cbz (Same as before, but with all arguments declared beforehand)

Making an XPath selector

Using an HTML inspector, spot a html path to the next link's href attribute/comic image's src attribute.

e.g.: //div[@class='foo']/img/@src This will select the src attribute of the first image whose class is: foo

Note: webcomix works best on static websites, since scrapy(the framework we use to travel web pages) doesn't process Javascript.

To make sure your XPath is correct, you have to go into scrapy shell, which should be downloaded when you've installed webcomix.

scrapy shell <website> --> Use the website's url to go to it.
> response.body --> Will give you the html from the website.
> response.xpath --> Test an xpath selection. If you get [], this means your XPath expression hasn't gotten anything from the webpage.

Contribution

The procedure depends on the type of contribution:

  • If you simply want to request the addition of a comic to the list of supported comics, make an issue with the label "Enhancement".
  • If you want to request the addition of a feature to the system or a bug fix, make an issue with the appropriate label.

Running the tests

To run the tests, you have to use the pytest command in the webcomix folder.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webcomix-3.12.0.tar.gz (332.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

webcomix-3.12.0-py3-none-any.whl (345.9 kB view details)

Uploaded Python 3

File details

Details for the file webcomix-3.12.0.tar.gz.

File metadata

  • Download URL: webcomix-3.12.0.tar.gz
  • Upload date:
  • Size: 332.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for webcomix-3.12.0.tar.gz
Algorithm Hash digest
SHA256 34c40e88fbad529d2e7e75114c6a7e5cbf5707b19e0151f14911f7d607458fb3
MD5 72813c85ec7af04170e7662b98556764
BLAKE2b-256 053611e04b5cd345428d4ba2850d65f0f662fd4a31f0b91a7872cdc109c64b36

See more details on using hashes here.

Provenance

The following attestation bundles were made for webcomix-3.12.0.tar.gz:

Publisher: build.yml on J-CPelletier/webcomix

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file webcomix-3.12.0-py3-none-any.whl.

File metadata

  • Download URL: webcomix-3.12.0-py3-none-any.whl
  • Upload date:
  • Size: 345.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for webcomix-3.12.0-py3-none-any.whl
Algorithm Hash digest
SHA256 75975b992fbce26ffb0b5670f81c4ac03f56cafc82dd43099f2395ff352cd8c2
MD5 7d13333bcdcabbfa0a625f1a40c84aa1
BLAKE2b-256 108a96323ef51583b7c3c109b62dd924a79195aa4dd06b6f07a0ee9319b43e21

See more details on using hashes here.

Provenance

The following attestation bundles were made for webcomix-3.12.0-py3-none-any.whl:

Publisher: build.yml on J-CPelletier/webcomix

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page