Skip to main content

Download books from bookwalker.jp

Project description

Unpolished bookwalker scraper

It just works. -- Todd Howard

TODO List

  • Image EPUB export
  • OCR integration
    • OCR text EPUB export
    • OCR to database

Usage

Installation

Requires Chrome/Chromium to be installed. For Chromium, you need to modify "browser" in config.json to "chromium".

pip install -U poetry
cd fuckBookWalker
poetry install

Running

poetry run python bookphucker <url or uuid of books>

You should see something like this. sample

Configuration

wip...

By default, bookphucker will try to reuse previous cookies, using --no-cache to clear cookies.

Common Issues

Cannot log in

You may encounter CAPTCHA during the login process.

bookphucker will ask you to use non-headless mode to pass the captcha if your config sets headless to true.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fuckbookwalker-0.1.3.tar.gz (10.4 kB view hashes)

Uploaded Source

Built Distribution

fuckbookwalker-0.1.3-py3-none-any.whl (13.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page