Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparser via pip:

pip install epubparser

Usage

epubparser input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output file must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.2.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.2-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.2.tar.gz.

File metadata

  • Download URL: epubparser-0.1.2.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.2.tar.gz
Algorithm Hash digest
SHA256 63c215a9e55745b00cd3b868be959d9afa604813ef6673f761a77b4ceff1db6a
MD5 94dd38192aa76e2e1915646ed7433935
BLAKE2b-256 e9ff2d619f9a1a7fffd4a725c1b1533bd15675970d3e9d0c5ae47231b1fbf477

See more details on using hashes here.

File details

Details for the file epubparser-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 627323f99ee84fcf256a8887bf19362b461fc95ecbdb4919a95f76f229cd93e0
MD5 85c4b037c4f6e80f3a65d614ab6cd50e
BLAKE2b-256 21682b5c2a694ddea03568124b154bea9bb52e8c4d3e837efbfa42a3c1dfb809

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page