Skip to main content

Crawler for finding manga to read.

Project description

Simple crawler for finding new chapters to read.

  1. Reads mangalist file for supported list site.

  2. Finds all of the mangas that has 5 or more new chapters from supported manga sites.

  3. Outputs .csv file with all of the needed data.

Installing and usage

Installing

  1. Install python3 and pip.

  2. Install with pip or Build from source.

    • Install with pip:

      • pip install MangaCrawler

    • Build from source:

      • git clone https://github.com/SanteriHetekivi/MangaCrawler.git

      • cd MangaCrawler/

      • python setup.py install

Usage

  1. Run the program with arguments

    • For example: MangaCrawler -f new -s mangafox -v -c 5 -m manga.xml

  2. Will output result to CSV file. (HTML output is under development)

Arguments

Short

Long

Descripti on

-v

Print verbose status messages.

-h

–help

Print Help message.

-f

–find

Manga find mode to use. (new or updated) (-f upd ated)

-s

–site

Manga site to use. (-s man gafox)

-c

–min-c hapter s

Minimum amount of new chapters that manga has to have before it gets to output csv. (-c 5 )

-m

–manga -xml-f ile

MyAnimeLi st mangalist xml file. (-m man ga.xml) If NOT given and current directory has XML files, program gives a opportuni ty to choose from them.

-a

–azure -accou nt-key

API key for Bing Web Search API.

-o

–outpu t-file

File to put output on. If NOT given will make file to current directory .

Deployment information

Deploying

  1. Install python3 and pip

  2. Run command to build dependencies. pip install -r requirements.txt

  3. Run the program

    • With output python manga-crawler.py -f updated -s mangafox -v

    • Without output python manga-crawler.py -f updated -s mangafox

Support and licenses

Supported List Sites

Supported Manga Sites

Credits and license

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

MangaCrawler-1.0.0a3-py3.6.egg (51.2 kB view details)

Uploaded Egg

File details

Details for the file MangaCrawler-1.0.0a3-py3.6.egg.

File metadata

File hashes

Hashes for MangaCrawler-1.0.0a3-py3.6.egg
Algorithm Hash digest
SHA256 92257d0169acc7b8f1a7e50f7edb4cea8ec179229ec02785a07b3dc80023e473
MD5 4d4a840c51ef41c4b5e34c9e9c66e2f9
BLAKE2b-256 27870ee81fa095076fc384e43ca5f2bb118462bed064567a84cdc1d1ae62032b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page