Convert HTML to markdown
Project description
html2txt
html2txt converts HTML to markdown.
Usage
Import the module.
from html2txt import converters
Call the Html2Markdown converter on HTML text to convert it to markdown.
markdown = converters.Html2Markdown().convert(html)
Run converter for unit testing
> cd html2txt/converters
> python3 -B html2markdown.py --path path-to-html-directory
Tests
Create Virtual Environment
> cd html2txt
> python3 -m venv venv
> source ./venv/bin/activate
Making Tests
The Commonmark and Breakdance tests were written for converting markdown to HTML so many tests fail with differences of whitespace and choices of markdown representation.
Breakdance is used by Dillinger, a markdown editor.
> cd html2txt/tests
> python3 -B ./config/mkhtml2txt.py --path .
> python3 -B ./config/mkcommonmark.py --path .
> python3 -B ./config/mkbreakdance.py --path .
> python3 -B ./config/mksvgweb.py --path .
> python3 -B ./config/mkmathml.py --path .
Running Tests
> pytest -vv
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
html2txt-0.6.0.tar.gz
(1.1 MB
view details)
File details
Details for the file html2txt-0.6.0.tar.gz
.
File metadata
- Download URL: html2txt-0.6.0.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.0.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | da199d93f6020861278c40ba5ee81fda8d93bf780554902ae28cc15d1e4adb04 |
|
MD5 | b15b45d5d4ef46e1a1ad79511e8fdeeb |
|
BLAKE2b-256 | 0b4e78ddc093b5c55b0c9f5d6d8194aab0bdac2652b7c6189d2e55c6a73cced3 |