Skip to main content

Convert HTML to markdown

Project description

html2txt

html2txt converts HTML to markdown.

Usage

Import the module.

from html2txt import converters

Call the Html2Markdown converter on HTML text to convert it to markdown.

markdown = converters.Html2Markdown().convert(html)

Run converter for unit testing

> cd html2txt/converters
> python3 -B html2markdown.py --path path-to-html-directory

Tests

Create Virtual Environment

> cd html2txt

> python3 -m venv venv

> source ./venv/bin/activate

Making Tests

The Commonmark and Breakdance tests were written for converting markdown to HTML so many tests fail with differences of whitespace and choices of markdown representation.

Breakdance is used by Dillinger, a markdown editor.

> cd html2txt/tests

> python3 -B ./config/mkhtml2txt.py --path .

> python3 -B ./config/mkcommonmark.py --path .

> python3 -B ./config/mkbreakdance.py --path .

> python3 -B ./config/mksvgweb.py --path .

> python3 -B ./config/mkmathml.py --path .

Running Tests

> pytest -vv

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

html2txt-0.6.0.tar.gz (1.1 MB view details)

Uploaded Source

File details

Details for the file html2txt-0.6.0.tar.gz.

File metadata

  • Download URL: html2txt-0.6.0.tar.gz
  • Upload date:
  • Size: 1.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.7

File hashes

Hashes for html2txt-0.6.0.tar.gz
Algorithm Hash digest
SHA256 da199d93f6020861278c40ba5ee81fda8d93bf780554902ae28cc15d1e4adb04
MD5 b15b45d5d4ef46e1a1ad79511e8fdeeb
BLAKE2b-256 0b4e78ddc093b5c55b0c9f5d6d8194aab0bdac2652b7c6189d2e55c6a73cced3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page