Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparser via pip:

pip install epubparser

Usage

epubparser input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output file must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.7.tar.gz (6.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.7-py3-none-any.whl (7.1 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.7.tar.gz.

File metadata

  • Download URL: epubparser-0.1.7.tar.gz
  • Upload date:
  • Size: 6.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.7.tar.gz
Algorithm Hash digest
SHA256 32fc79e63a6bca938d4331071ea26502c009f33ff3e0009e7bbd74cd9775df26
MD5 6943c05c139c5cd0300a3e4df0bf3532
BLAKE2b-256 694330fb44599071bdbd28e45e52918be2dc4fb4984b3ca7dd9ea9e8894bccd5

See more details on using hashes here.

File details

Details for the file epubparser-0.1.7-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.7-py3-none-any.whl
  • Upload date:
  • Size: 7.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.7-py3-none-any.whl
Algorithm Hash digest
SHA256 817c1a10e4f48be61d8ec29042661e5e5f8d415fc4212944eb9d0bd1281cc81e
MD5 1c9a6bdafd14deae9a008e9b0f53566c
BLAKE2b-256 b68a1324ffd3bc5522a49d9aef93ecf76da98a090a3e44f539ba3a09325ac54b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page