Project Gutemberg ZIM creator for Offline Use
Project description
A scraper that downloads the whole repository of [Project Gutenberg] (http://www.gutenberg.org) and puts it into a locally browsable directory and then in a ZIM file (http://www.openzim.org), a clean and user friendly format for storing content for offline usage.
Dependencies
Ubuntu/debian
python-pip python-dev libxml2-dev libxslt-dev advancecomp jpegoptim pngquant p7zip-full gifsicle
macOS
brew install advancecomp jpegoptim pngquant p7zip gifsicle
Usage
gutenberg2zim
By default (no argument), it runs all the steps: download, parse, export and zim.
-h --help Display this help message -y --wipe-db Do not wipe the DB during parse stage -F --force Redo step even if target already exist -l --languages=<list> Comma-separated list of lang codes to filter export to (preferably ISO 639-1, else ISO 639-3) -f --formats=<list> Comma-separated list of formats to filter export to (epub, html, pdf, all) -m --mirror=<url> Use URL as base for all downloads. -r --rdf-folder=<folder> Don't download rdf-files.tar.bz2 and use extracted folder instead -e --static-folder=<folder> Use-as/Write-to this folder static HTML -z --zim-file=<file> Write ZIM into this file path -t --zim-title=<title> Set ZIM title -n --zim-desc=<description> Set ZIM description -d --dl-folder=<folder> Folder to use/write-to downloaded ebooks -u --rdf-url=<url> Alternative rdf-files.tar.bz2 URL -b --books=<ids> Execute the processes for specific books, separated by commas, or dashes for intervals -c --concurrency=<nb> Number of concurrent process for download and parsing tasks -x --zim-title=<title> Custom title for the ZIM file -q --zim-desc=<desc> Custom description for the ZIM file --check Check dependencies --prepare Download & extract rdf-files.tar.bz2 --parse Parse all RDF files and fill-up the DB --download Download ebooks based on filters --export Export downloaded content to zim-friendly static HTML --dev Exports *just* Home+JS+CSS files (overwritten by --zim step) --zim Create a ZIM file
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Filename, size | File type | Python version | Upload date | Hashes |
---|---|---|---|---|
Filename, size gutenberg2zim-1.1.5-py3-none-any.whl (1.8 MB) | File type Wheel | Python version py3 | Upload date | Hashes View |
Filename, size gutenberg2zim-1.1.5.tar.gz (1.7 MB) | File type Source | Python version None | Upload date | Hashes View |
Close
Hashes for gutenberg2zim-1.1.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 90ba4709a8da07aba3352282fff06d21a750d265c1cd214d00751e513062724e |
|
MD5 | 8457f339315ba592aaaa9a8192d72ade |
|
BLAKE2-256 | c487fa59cef89296dae6eae188fe89936ea473e7a0fda1ca222ba7edcc132af6 |