Skip to main content
Help improve PyPI by participating in a 5-minute user interface survey!

PDF parser and analyzer written entirely in Python.

Project Description

PDFMiner is a suite of programs that aims to help extracting or analyzing text data from PDF documents. Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other layout information such as font size or font name, which could be useful for analyzing the document. It can be also used as a basis for a full-fledged PDF interpreter.

Release history Release notifications

History Node

20140328

History Node

20140327

History Node

20140324

History Node

20131113

History Node

20131022

History Node

20110515

History Node

20110227

History Node

20101226

History Node

20101017

History Node

20100829

History Node

20100619p1

History Node

20100424

History Node

20100327

History Node

20100322

History Node

20100213

History Node

20100131

History Node

20100104

History Node

20091219

History Node

20091129

History Node

20091024

History Node

20091004

History Node

20090912

History Node

20090830

History Node

20090824

History Node

20090721

History Node

20090711

History Node

20090517

History Node

20090330

History Node

20090325

History Node

20090201

History Node

20090117

History Node

20090110

History Node

20080906

This version
History Node

20080727

History Node

20080629

History Node

20080427

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page