Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparser via pip:

pip install epubparser

Usage

epubparser input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output file must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.3.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.3-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.3.tar.gz.

File metadata

  • Download URL: epubparser-0.1.3.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.3.tar.gz
Algorithm Hash digest
SHA256 ae14c8a243bf7be8b0847ede47e642fa12a74b66708b844ff3057dc8c2713c65
MD5 ab66b1da26ed4f11a74fab0d20b563ef
BLAKE2b-256 6e8df1d29b74e2bcfaa3523eea474804f64157c746c32de71cadead24aa7a861

See more details on using hashes here.

File details

Details for the file epubparser-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 907b942d63dba04d425ee054caad944c72b8bf9e27bc04609cb19442627d67fd
MD5 b6b0b151acb1da71c952cc2dd10802ff
BLAKE2b-256 5b1137d118e2e2830a50ae4933cb23e1bfec6f3c18940477f432f9b16112bb22

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page