Skip to main content

PDF parser and analyzer

Project description

Fork of PDFMiner using six for Python 2+3 compatibility

PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows to obtain the exact location of texts in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfminer.six-20191107.tar.gz (10.3 MB view details)

Uploaded Source

Built Distributions

pdfminer.six-20191107-py3-none-any.whl (5.6 MB view details)

Uploaded Python 3

pdfminer.six-20191107-py2-none-any.whl (5.6 MB view details)

Uploaded Python 2

File details

Details for the file pdfminer.six-20191107.tar.gz.

File metadata

  • Download URL: pdfminer.six-20191107.tar.gz
  • Upload date:
  • Size: 10.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.37.0 CPython/3.7.4

File hashes

Hashes for pdfminer.six-20191107.tar.gz
Algorithm Hash digest
SHA256 c9b9d4a9c5f4e1df15934b845041565398b39195424ad68ec2cf40cadaf2465b
MD5 b236a4afad49dbea74f587e06b412043
BLAKE2b-256 a088799882470a92c0f5e1cf475f9c3d75705e5133c94c38ed172f4223def9b7

See more details on using hashes here.

File details

Details for the file pdfminer.six-20191107-py3-none-any.whl.

File metadata

  • Download URL: pdfminer.six-20191107-py3-none-any.whl
  • Upload date:
  • Size: 5.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.37.0 CPython/3.7.4

File hashes

Hashes for pdfminer.six-20191107-py3-none-any.whl
Algorithm Hash digest
SHA256 ca219896e3ee4bdbc4765900634aad2262bc45d89f1a74b29cc17e0f69a07805
MD5 650c1ffe86ae8e75eae2b9bac8119fcc
BLAKE2b-256 17d3551da538656a98cc5315a34c1e354b58e47b8eff5f7f67a3817893d64e1d

See more details on using hashes here.

File details

Details for the file pdfminer.six-20191107-py2-none-any.whl.

File metadata

  • Download URL: pdfminer.six-20191107-py2-none-any.whl
  • Upload date:
  • Size: 5.6 MB
  • Tags: Python 2
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.37.0 CPython/3.7.4

File hashes

Hashes for pdfminer.six-20191107-py2-none-any.whl
Algorithm Hash digest
SHA256 d6097f078f7bfd1bd32bf57406e63abbf70b5cb7ec3d2cfb03fa0a31fb036966
MD5 5fa93781058c3c5cf31ecc89dc6e4bc4
BLAKE2b-256 45a787d53742d8ddfdde51aa3e2124c8c63313a7e2a2ca099550256b24794288

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page