Skip to main content

PDF parser

Project description

Fork of PDFMiner.six using latest python support and a more trim down approach

PDFMajor is a tool for extracting information from PDF documents. It’s main focus is on obtaining as closely related results that reflect the bare-bone structure of the pdf document. It includes a PDF converter that can transform PDF files into other text formats (such as HTML, XML, JSON). It has an extensible PDF parser that can be used for other purposes instead of text analysis.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

pdfmajor-1.0.0-py3-none-any.whl (161.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page