Skip to main content

Utility to clean up an html file generated by LaTeXML

Project description

Clear html files that are generated by latexml

Description

This tool can be used to postprocess html files generated by LaTeXml.

The purpose it to remove all lxt tags and hyperrefs such that the html file can be imported into external software.

Installation

Install the tool by running:

pip install latex_html_cleaner

Usage

This tool can be used to clean html file generated by the tex to html converter LaTeXML. You can run it by:

htmlcleaner.exe filename.html

A new file with the suffix _clean.html will be created with the cleaned html.

Details

For all the details, do:

htmlcleaner.exe --help

giving the output:

usage: htmlcleaner [-h] [--version] [--output_filename STR] [-v] [-vv] [-w]
                   [-f [PATH ...]] [--clear_find_and_replace_defaults]
                   STR [STR ...]

Cleans html files and removes hyperrefs

positional arguments:
  STR                   File name of html input

options:
  -h, --help            show this help message and exit
  --version             show program's version number and exit
  --output_filename STR
                        File name of output html file
  -v, --verbose         set loglevel to INFO
  -vv, --very-verbose, --debug
                        set loglevel to DEBUG
  -w, --overwrite       Overwrite the input html. Default = False, which means
                        a new html is created withthe suffix _clean
  -f [PATH ...], --find_and_replace [PATH ...]
                    Define a list of key=value pairs to define string
                    patterns you want to replace
  --clear_find_and_replace_defaults
                        Clear the predefined find and replace patterns

Note

This project has been set up using PyScaffold 4.0.2. For details and usage information on PyScaffold see https://pyscaffold.org/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

latexml_html_cleaner-0.0.4.tar.gz (46.6 kB view details)

Uploaded Source

Built Distribution

latexml_html_cleaner-0.0.4-py2.py3-none-any.whl (9.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file latexml_html_cleaner-0.0.4.tar.gz.

File metadata

  • Download URL: latexml_html_cleaner-0.0.4.tar.gz
  • Upload date:
  • Size: 46.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.7

File hashes

Hashes for latexml_html_cleaner-0.0.4.tar.gz
Algorithm Hash digest
SHA256 fc1c7823d9de68bee6ba542a4e963530d16bc8a36b5247e8b6dbd5da3a5a7706
MD5 c294937c951bd2af42c97d908d3510ad
BLAKE2b-256 816453e99adc24c04e95c7bd252f98f21e8bdb40bfb98a137319294ba8d73448

See more details on using hashes here.

File details

Details for the file latexml_html_cleaner-0.0.4-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for latexml_html_cleaner-0.0.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 820a68bce4beac396cebc7b3d0c3d3e6a35f5c41ad9230e5a50eb353a053b185
MD5 9d504d42fec60532e53f61f1bd090e2f
BLAKE2b-256 f2848f67a486769276920d00f8e83f05f269cc109f8247faee9ee85aba59e860

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page