pdfformulas

pdfformulas dumps the formulas of a PDF as PNG files in the ``formulas`` subfolder.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Education
License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python
- Python :: 2.7

Project description

usage: pdfformulas.py [-h] [–dxmin DXMIN] [–frompage FROMPAGE]: [–topage TOPAGE] [–page PAGE] [–formulaid FORMULAID] [–stats] pdffile

Dump the formulas of a PDF as PNG files in the formulas subfolder. The subfolder formulas is created, if not there yet. The PDF content must be accessible as text.

positional arguments:

pdffile PDF file to parse and dump formulas of

optional arguments:

-h, --help: show this help message and exit
--dxmin DXMIN: Additional left margin, which defines what is normal text. If the text before a formula is the beginning of a paragraph it might start a little indented. In this case it helps to move dxmin to the right. Units are those used in the PDF. Try 10.
--frompage FROMPAGE: PDF page number to start with.
--topage TOPAGE: PDF page number to stop at.
--page PAGE: PDF page number
--formulaid FORMULAID: The regular expression by which a formula is found. Formulas are recognized by their ID on the right. The regular expression used is:: r’^s*(d*.d*)s* ‘ e.g.:: (2.13) To find the rectangle comprising the formula the text before and after is located, which begins on the left of the page (dxmin). The formula is assumed to be indented with regard to normal text.
--stats: Only print (formula,page)-refs statistics. This tells which formulas are most often referenced in normal text and are thus likely the most important ones.

Requires: Pillow, PyMuPDF (needs compatible MuPDF installed), PdfMiner Installation: LibMuPDF and PyMuPDF will need to be installed beforehand.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Education
License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python
- Python :: 2.7

Release history Release notifications | RSS feed

This version

0.0.6

Jan 28, 2017

0.0.5

May 28, 2016

0.0.4

May 26, 2016

0.0.3

May 26, 2016

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pdfformulas-0.0.6-py2.py3-none-any.whl (8.3 kB view details)

Uploaded Jan 28, 2017 Python 2Python 3

File details

Details for the file pdfformulas-0.0.6-py2.py3-none-any.whl.

File metadata

Download URL: pdfformulas-0.0.6-py2.py3-none-any.whl
Upload date: Jan 28, 2017
Size: 8.3 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for pdfformulas-0.0.6-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`01e2fd76fda4809aef2bf9d9beeae06e72fab84ce0e17b7ff7b837aed1a63b77`
MD5	`784a4e6d63245f25e3c805f4f0a36d67`
BLAKE2b-256	`376f26d3eaf18ab158e35665ff334a3e43a35ba776669d2707eb0e95868fa829`

See more details on using hashes here.

pdfformulas 0.0.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes