Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparser via pip:

pip install epubparser

Usage

epubparser input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output file must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.5.tar.gz (6.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.5-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.5.tar.gz.

File metadata

  • Download URL: epubparser-0.1.5.tar.gz
  • Upload date:
  • Size: 6.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.5.tar.gz
Algorithm Hash digest
SHA256 076e588fac00acf44af986cfe6910ec20bce6cbf89abc4650a406e4fdc804fc9
MD5 e7e124c12469eb054ad42f17c8f66768
BLAKE2b-256 b90f12dcf655c020c473447f9df8fe4a00348405901c40f21a6f11d2da218a83

See more details on using hashes here.

File details

Details for the file epubparser-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 6.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e2edd2f275d852c4dcde169c4ce7c2e88897893345a3c2e611626ffa27b1ef9c
MD5 11bf01d01e4983e2b8b5599656505474
BLAKE2b-256 574ac9af26dd87873cfbb4fa23272b6a3c12c8caf24969e8d787459fc3c8cd37

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page