Skip to main content

Parse ebooks, extracts chapters and contents.

Project description

EpubParser

Parses epub files (wrapper around ebooklib package). Extract chapter titles and their corresponding texts. Can also extract the cover image.

Installation

You can install epubparser via pip:

pip install epubparser

Usage

epubparser input.epub output.txt 

You can apply some arguments:

--skip-toc Skip chapters whose titles match common Table of Contents variants.

--skip-license Skip chapters whose titles match common License variants.

The arguments above may not be perfect, since it depends on regex an language.

--extract-cover extracts cover to covers directory. If this argument is passed, output file must be specified as None

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epubparser-0.1.1.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epubparser-0.1.1-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file epubparser-0.1.1.tar.gz.

File metadata

  • Download URL: epubparser-0.1.1.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.1.tar.gz
Algorithm Hash digest
SHA256 a48d5cd3e6778044b063b6f6cfb2fc9e105a43889755c1f7f7a19878323b3abf
MD5 5b3446c2d772167e0dece9f631662802
BLAKE2b-256 aba8986782b6df64fc2cba91858ed2b9c9b28f57c652663ab88a03b8fffd34d8

See more details on using hashes here.

File details

Details for the file epubparser-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: epubparser-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for epubparser-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7b522da2bf4e5186b363eadd4c9de2e5232a7f9ff083ce90f27984777bb84bdb
MD5 be5003f2ff7b52e1041bff04c80d1e70
BLAKE2b-256 897022c9971fb2f6cd58405b826b12defe42bbd5b9e41043aedd7103927c0426

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page