Skip to main content

PDF parser and analyzer

Project description

PDFMiner is a suite of programs that help extracting and analyzing text data of PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other extra information such as font information or ruled lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfminer-20100104.tar.gz (3.8 MB view details)

Uploaded Source

File details

Details for the file pdfminer-20100104.tar.gz.

File metadata

  • Download URL: pdfminer-20100104.tar.gz
  • Upload date:
  • Size: 3.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pdfminer-20100104.tar.gz
Algorithm Hash digest
SHA256 b082ab4d26bfb5e4eeae50126ea2c86db62d47b496b4c8d99d0de5ad5a4dd795
MD5 4b4a9f01d332b517ccb7d1f40fdabe6e
BLAKE2b-256 f58b7360ba30e30db6af801168874a5879398ec394077640c1111b3f899d4ce7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page