pdfformulas dumps the formulas of a PDF as PNG files in the ``formulas`` subfolder.
Project description
- usage: pdfformulas.py [-h] [–dxmin DXMIN] [–frompage FROMPAGE]
[–topage TOPAGE] [–page PAGE] [–formulaid FORMULAID] [–stats] pdffile
Dump the formulas of a PDF as PNG files in the formulas subfolder. The subfolder formulas is created, if not there yet. The PDF content must be accessible as text.
- positional arguments:
pdffile PDF file to parse and dump formulas of
- optional arguments:
- -h, --help
show this help message and exit
- --dxmin DXMIN
Additional left margin, which defines what is normal text. If the text before a formula is the beginning of a paragraph it might start a little indented. In this case it helps to move dxmin to the right. Units are those used in the PDF. Try 10.
- --frompage FROMPAGE
PDF page number to start with.
- --topage TOPAGE
PDF page number to stop at.
- --page PAGE
PDF page number
- --formulaid FORMULAID
The regular expression by which a formula is found. Formulas are recognized by their ID on the right. The regular expression used is:: r’^s*(d*.d*)s* ‘ e.g.:: (2.13) To find the rectangle comprising the formula the text before and after is located, which begins on the left of the page (dxmin). The formula is assumed to be indented with regard to normal text.
- --stats
Only print (formula,page)-refs statistics. This tells which formulas are most often referenced in normal text and are thus likely the most important ones.
Requires: Pillow, PyMuPDF (needs compatible MuPDF installed), PdfMiner Installation: LibMuPDF and PyMuPDF will need to be installed beforehand.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pdfformulas-0.0.6-py2.py3-none-any.whl.
File metadata
- Download URL: pdfformulas-0.0.6-py2.py3-none-any.whl
- Upload date:
- Size: 8.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
01e2fd76fda4809aef2bf9d9beeae06e72fab84ce0e17b7ff7b837aed1a63b77
|
|
| MD5 |
784a4e6d63245f25e3c805f4f0a36d67
|
|
| BLAKE2b-256 |
376f26d3eaf18ab158e35665ff334a3e43a35ba776669d2707eb0e95868fa829
|