Skip to main content
This is a pre-production deployment of Warehouse. Changes made here affect the production instance of PyPI (pypi.python.org).
Help us improve Python packaging - Donate today!

PDF parser and analyzer

Project Description

PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows to obtain the exact location of texts in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis.

Release History

Release History

This version
History Node

20140328

History Node

20140327

History Node

20140324

History Node

20131113

History Node

20131022

History Node

20110515

History Node

20110227

History Node

20101226

History Node

20101017

History Node

20100829

History Node

20100619p1

History Node

20100424

History Node

20100327

History Node

20100322

History Node

20100213

History Node

20100131

History Node

20100104

History Node

20091219

History Node

20091129

History Node

20091024

History Node

20091004

History Node

20090912

History Node

20090830

History Node

20090824

History Node

20090721

History Node

20090711

History Node

20090517

History Node

20090330

History Node

20090325

History Node

20090201

History Node

20090117

History Node

20090110

History Node

20080906

History Node

20080727

History Node

20080629

History Node

20080427

Download Files

Download Files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
pdfminer-20140328.tar.gz (4.1 MB) Copy SHA256 Checksum SHA256 Source Mar 28, 2014

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting