Skip to main content

A PDF-to-text converter based on pdfminer2

Project description

The pdf2textbox converter aims at converting PDF that comes in up to three columns and a header into text without loosing too much information. Allows command line parameter -s (–slice) to indicate that only part of the PDF document is of interest. Start and end page will then either retreived from the document’s name using either ‘_’ or ‘|’ as delimiters or - if start and end page cannot be found - user input is requested.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for pdf2textbox, version 0.1.3
Filename, size File type Python version Upload date Hashes
Filename, size pdf2textbox-0.1.3-py3-none-any.whl (2.7 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size pdf2textbox-0.1.3.tar.gz (1.5 MB) File type Source Python version None Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page