Skip to main content

pdfformulas dumps the formulas of a PDF as PNG files in the ``formulas`` subfolder.

Project description

usage: pdfformulas.py [-h] [–dxmin DXMIN] [–frompage FROMPAGE]

[–topage TOPAGE] [–page PAGE] [–formulaid FORMULAID] [–stats] pdffile

Dump the formulas of a PDF as PNG files in the formulas subfolder. The subfolder formulas is created, if not there yet. The PDF content must be accessible as text.

positional arguments:

pdffile PDF file to parse and dump formulas of

optional arguments:
-h, --help

show this help message and exit

--dxmin DXMIN

Additional left margin, which defines what is normal text. If the text before a formula is the beginning of a paragraph it might start a little indented. In this case it helps to move dxmin to the right. Units are those used in the PDF. Try 10.

--frompage FROMPAGE

PDF page number to start with.

--topage TOPAGE

PDF page number to stop at.

--page PAGE

PDF page number

--formulaid FORMULAID

The regular expression by which a formula is found. Formulas are recognized by their ID on the right. The regular expression used is:: r’^s*(d*.d*)s* ‘ e.g.:: (2.13) To find the rectangle comprising the formula the text before and after is located, which begins on the left of the page (dxmin). The formula is assumed to be indented with regard to normal text.

--stats

Only print (formula,page)-refs statistics. This tells which formulas are most often referenced in normal text and are thus likely the most important ones.

Requires: Pillow, PyMuPDF (needs compatible MuPDF installed), PdfMiner Installation: LibMuPDF and PyMuPDF will need to be installed beforehand.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdfformulas-0.0.6-py2.py3-none-any.whl (8.3 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pdfformulas-0.0.6-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for pdfformulas-0.0.6-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 01e2fd76fda4809aef2bf9d9beeae06e72fab84ce0e17b7ff7b837aed1a63b77
MD5 784a4e6d63245f25e3c805f4f0a36d67
BLAKE2b-256 376f26d3eaf18ab158e35665ff334a3e43a35ba776669d2707eb0e95868fa829

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page