Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparser via pip:

pip install epubparser

Usage

epubparser input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output file must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.4.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.4-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.4.tar.gz.

File metadata

  • Download URL: epubparser-0.1.4.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.4.tar.gz
Algorithm Hash digest
SHA256 0dc88526187d9518eaf5a76dd24147d68267a9e63f470ebbbb49e2f64757101e
MD5 f8e2a952f5a0f24187d0381efd7b2b43
BLAKE2b-256 9de8ebd1716a5f8b8992a2ba4d9e00e1386d3bc7e9991cd892596bea3f3488f5

See more details on using hashes here.

File details

Details for the file epubparser-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 194e5a45a14f614987307c94f996b4b6927fff515fa4caf1c3171f48ac8cc40a
MD5 fc7da0b6a5331db04f54234f2ac94629
BLAKE2b-256 f5a793bcac30292822e56cf74e6a53492f531558befd9c830439f42e2aee1d2b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page