Skip to main content

Utility to clean up an html file generated by LaTeXML

Project description

Clear html files that are generated by latexml

Description

This tool can be used to postprocess html files generated by LaTeXml.

The purpose it to remove all lxt tags and hyperrefs such that the html file can be imported into external software.

Installation

Install the tool by running:

pip install latex_html_cleaner

Usage

This tool can be used to clean html file generated by the tex to html converter LaTeXML. You can run it by:

htmlcleaner.exe filename.html

A new file with the suffix _clean.html will be created with the cleaned html.

Details

For all the details, do:

htmlcleaner.exe --help

giving the output:

usage: htmlcleaner [-h] [--version] [--output_filename STR] [-v] [-vv] [-w]
                   [-f [PATH ...]] [--clear_find_and_replace_defaults]
                   STR [STR ...]

Cleans html files and removes hyperrefs

positional arguments:
  STR                   File name of html input

options:
  -h, --help            show this help message and exit
  --version             show program's version number and exit
  --output_filename STR
                        File name of output html file
  -v, --verbose         set loglevel to INFO
  -vv, --very-verbose, --debug
                        set loglevel to DEBUG
  -w, --overwrite       Overwrite the input html. Default = False, which means
                        a new html is created withthe suffix _clean
  -f [PATH ...], --find_and_replace [PATH ...]
                    Define a list of key=value pairs to define string
                    patterns you want to replace
  --clear_find_and_replace_defaults
                        Clear the predefined find and replace patterns

Note

This project has been set up using PyScaffold 4.0.2. For details and usage information on PyScaffold see https://pyscaffold.org/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

latexml_html_cleaner-0.0.4.tar.gz (46.6 kB view hashes)

Uploaded Source

Built Distribution

latexml_html_cleaner-0.0.4-py2.py3-none-any.whl (9.9 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page