Skip to main content

PDF parser and analyzer

Project description

PDFMiner is a suite of programs that help extracting and analyzing text data of PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other extra information such as font information or ruled lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfminer-20100322.tar.gz (4.2 MB view details)

Uploaded Source

File details

Details for the file pdfminer-20100322.tar.gz.

File metadata

  • Download URL: pdfminer-20100322.tar.gz
  • Upload date:
  • Size: 4.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pdfminer-20100322.tar.gz
Algorithm Hash digest
SHA256 91898fdd9e24722f8f303907524f17432f4042358bca372b59982d629e5248a8
MD5 308e52d6e33e858f5085111008f4cd50
BLAKE2b-256 34c83f11e76e64c5d5e3a65ec824e28604bf8b971926e7cb92791d8c74c33b6a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page