Skip to main content

PDF parser and analyzer

Project description

PDFMiner is a suite of programs that help extracting and analyzing text data of PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other extra information such as font information or ruled lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfminer-20091024.tar.gz (1.9 MB view details)

Uploaded Source

File details

Details for the file pdfminer-20091024.tar.gz.

File metadata

  • Download URL: pdfminer-20091024.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pdfminer-20091024.tar.gz
Algorithm Hash digest
SHA256 d30824aeece484c892320fad1224180d4d52795a164b778dcc60aa4d6aac4984
MD5 7f30bd028980545d60cb418aa685c638
BLAKE2b-256 eca84c6a6a5527e51841f2324a90081aa30ddd4908026430dab83e74b75743a9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page