Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparse via pip:

pip install epubparse

Usage

epubparse input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output fule must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.0.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.0-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.0.tar.gz.

File metadata

  • Download URL: epubparser-0.1.0.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.0.tar.gz
Algorithm Hash digest
SHA256 fc1dfdd9f8445605aef96ed3ff6a164db9be2164fd5d4226cfd2bb4d5935ee40
MD5 838f989994e57c9715a921cd2e628588
BLAKE2b-256 89e9800b14116059e8a4100886c56f70855eb992fe3a96d0c01a6872accc61ec

See more details on using hashes here.

File details

Details for the file epubparser-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 6.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d0cb472afe4e8b59d99ec41e0c35cd7325157fb3f1df9600056dbbab9dab20b3
MD5 9c448a62adbee3265851a044cf598734
BLAKE2b-256 a013e7e51505ed231fee5535091ab9ff4213d9a50a0186e93b4bce456607d731

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page