PDF parser
Project description
Fork of PDFMiner.six using latest python support and a more trim down approach
PDFMajor is a tool for extracting information from PDF documents. It’s main focus is on obtaining as closely related results that reflect the bare-bone structure of the pdf document. It includes a PDF converter that can transform PDF files into other text formats (such as HTML, XML, JSON). It has an extensible PDF parser that can be used for other purposes instead of text analysis.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
pdfmajor-1.0.0-py3-none-any.whl
(161.0 kB
view hashes)