Download books from bookwalker.jp
Project description
Unpolished bookwalker scraper
It just works. -- Todd Howard
TODO List
- Image
EPUB
export -
OCR
integration- OCR text
EPUB
export - OCR to database
- OCR text
Usage
Installation
Requires Chrome/Chromium to be installed. For Chromium, you need to modify "browser"
in config.json
to "chromium"
.
pip install -U poetry
cd fuckBookWalker
poetry install
Running
poetry run python bookphucker <url or uuid of books>
You should see something like this.
Configuration
wip...
By default, bookphucker
will try to reuse previous cookies
, using --no-cache
to clear cookies
.
Common Issues
Cannot log in
You may encounter CAPTCHA during the login process.
bookphucker
will ask you to use non-headless mode to pass the captcha if your config sets headless
to true
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fuckbookwalker-0.1.3.tar.gz
(10.4 kB
view hashes)
Built Distribution
Close
Hashes for fuckbookwalker-0.1.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f605468b5e870d53012901e6df24b0d1974ac408be553c01e2ee30ec7459a70b |
|
MD5 | 214512f4e6d423a429a55b907efd1cd2 |
|
BLAKE2b-256 | 2eae0f1b8ef8b9f10a7358f7f366462bca17de678b2a548d841e8372fbe8ee08 |