Skip to main content

A PDF-to-text converter based on pdfminer2

Project description

The pdf2textbox converter aims at converting PDF that comes in up to three columns and a header into text without loosing too much information. Allows command line parameter -s (–slice) to indicate that only part of the PDF document is of interest. Start and end page will then either retreived from the document’s name using either ‘_’ or ‘|’ as delimiters or - if start and end page cannot be found - user input is requested.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdf2textbox-0.1.3.tar.gz (1.5 MB view hashes)

Uploaded Source

Built Distribution

pdf2textbox-0.1.3-py3-none-any.whl (2.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page