Skip to main content

Convert HTML to markdown.

Project description

GitHub Workflow Status Pypi version License Pypi Downloads

Installation

pip install markdownify

Usage

Convert some HTML to Markdown:

from markdownify import markdownify as md
md('<b>Yay</b> <a href="http://github.com">GitHub</a>')  # > '**Yay** [GitHub](http://github.com)'

Specify tags to exclude (blacklist):

from markdownify import markdownify as md
md('<b>Yay</b> <a href="http://github.com">GitHub</a>', strip=['a'])  # > '**Yay** GitHub'

...or specify the tags you want to include (whitelist):

from markdownify import markdownify as md
md('<b>Yay</b> <a href="http://github.com">GitHub</a>', convert=['b'])  # > '**Yay** GitHub'

Options

Markdownify supports the following options:

strip

A list of tags to strip (blacklist). This option can’t be used with the convert option.

convert

A list of tags to convert (whitelist). This option can’t be used with the strip option.

autolinks

A boolean indicating whether the “automatic link” style should be used when a a tag’s contents match its href. Defaults to True

heading_style

Defines how headings should be converted. Accepted values are ATX, ATX_CLOSED, SETEXT, and UNDERLINED (which is an alias for SETEXT). Defaults to UNDERLINED.

bullets

An iterable (string, list, or tuple) of bullet styles to be used. If the iterable only contains one item, it will be used regardless of how deeply lists are nested. Otherwise, the bullet will alternate based on nesting level. Defaults to '*+-'.

strong_em_symbol

In markdown, both * and _ are used to encode strong or emphasized texts. Either of these symbols can be chosen by the options ASTERISK (default) or UNDERSCORE respectively.

newline_style

Defines the style of marking linebreaks (<br>) in markdown. The default value SPACES of this option will adopt the usual two spaces and a newline, while BACKSLASH will convert a linebreak to \\n (a backslash an a newline). While the latter convention is non-standard, it is commonly preferred and supported by a lot of interpreters.

Options may be specified as kwargs to the markdownify function, or as a nested Options class in MarkdownConverter subclasses.

Development

To run tests:

python setup.py test

To lint:

python setup.py lint

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

markdownify-0.7.2.tar.gz (9.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

markdownify-0.7.2-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file markdownify-0.7.2.tar.gz.

File metadata

  • Download URL: markdownify-0.7.2.tar.gz
  • Upload date:
  • Size: 9.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.9

File hashes

Hashes for markdownify-0.7.2.tar.gz
Algorithm Hash digest
SHA256 a8e24a7545d38209450099174fd2e44e6ec870ef037df3643d01029fbf7942d4
MD5 e736fb71aac9c4af8a29cb4dfefe38b4
BLAKE2b-256 1ccadccd7e56225f3433d5cb4dfae34230409a3bf84a6551b0c5f041a58e7fe4

See more details on using hashes here.

File details

Details for the file markdownify-0.7.2-py3-none-any.whl.

File metadata

  • Download URL: markdownify-0.7.2-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.9

File hashes

Hashes for markdownify-0.7.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0a51fc28876074b70c7fc8dd8162dc7387e65885593cfeb78725ba65229861b2
MD5 c35043a2d25489d8055f049aea1b97a9
BLAKE2b-256 6210dc8e4cac2b42b45574fd3096332462d35977ece8149ab62a3b12b23f9300

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page