Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparser via pip:

pip install epubparser

Usage

epubparser input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output file must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.6.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.6-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.6.tar.gz.

File metadata

  • Download URL: epubparser-0.1.6.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.6.tar.gz
Algorithm Hash digest
SHA256 b482d8a7a8814f6be353fad5adedb166082ebda4cb1c83a6e6e494bf1a8151fe
MD5 db476b044b47e4a62d09dceb10c63cd2
BLAKE2b-256 5654ed832eb285956b14e1a3db39bf33cb3be4dd63c01f23554bdb25b1538046

See more details on using hashes here.

File details

Details for the file epubparser-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 272805702a41221baf119dca6439743a71d10c9ca22492a4b6e14296c7c16485
MD5 7ced20ca14b9915ef86ccd1e0eabe627
BLAKE2b-256 443f04be09444f91a5f4a568a7241b5fa72f4944510bd6a837d2e7d28ba28c6f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page