Skip to main content

a Simple Enough Markup

Project description

Author:

Lele Gaifax

Contact:
lele@metapensiero.it
license:

GNU General Public License version 3 or later

Implement a minimalistic markup usable in the various descriptions, with just the needed elements: headings and paragraphs containing plain, bold or italic text, hyper <link> and unordered lists.

The internal representation is a tree of nodes, and a set of functions to transpose it to/from a textual format, HTML and Quill Delta are included.

Changes

0.27 (2022-05-20)

  • Do not fail badly on whitespace only outside a text node

0.26 (2021-02-02)

  • Suppress warning on unsupported HTML tags

0.25 (2020-06-18)

  • As an interim workaround to SemText limitation, represent “bolded links” as plain links

0.24 (2020-05-21)

  • Add option to parse_text() to fallback to plain text on parsing errors

0.23 (2019-12-02)

  • Ignore spurious orphan headers

0.22 (2019-11-08)

  • Ignore non-textual insert operations, such as “image”

0.21 (2019-09-20)

  • Explain assertion errors, to avoid meaningless logging messages

  • Suppress warning on ignored <pre> tags

0.20 (2019-09-02)

  • Ignore code-block attributes

0.19 (2019-07-16)

  • Workaround a degenerated case involving empty hrefs

0.18 (2019-07-12)

  • Completely ignore <iframe> and <script> tags

  • Extract <table> content as plain paragraph, to make them at least readable

0.17 (2019-07-03)

  • Ignore text attributes that we are not going to handle

0.16 (2019-06-25)

  • Handle HTML created with non-Unix end-of-line convention

0.15 (2018-08-23)

  • Ignore <style> tags and degenerated <a> tags

0.14 (2018-08-23)

  • Ignore <img> tags, out of scope at least for now

0.13 (2018-08-23)

  • Rewritten HTML parser, slightly more robust and versatile

0.12 (2018-08-17)

  • Try harder to handle degenerated paragraphs represented with DIVs

  • Replace asserts with a custom exception to signal parsing errors

0.11 (2018-08-15)

  • Handle degenerated paragraphs represented with DIVs

  • Add an option to swallow HTML parsing exceptions and falling back to plain text

0.10 (2018-08-01)

  • Handle SPANs inside headings

0.9 (2018-07-12)

  • Ignore standalone BRs in the HTML parser

0.8 (2018-07-12)

  • Ignore BRs inside headings in the HTML parser

0.7 (2018-06-26)

  • Better handling of nested DIVs in the HTML parser

0.6 (2018-06-13)

  • Handle implicit list item indexes in SEMPrinter

0.5 (2018-04-26)

  • Properly escape also the link’s address

0.4 (2018-04-26)

0.3 (2018-04-20)

  • Support for hyperlinks

  • Support for headings

  • New function to emit a Quill Delta representation of an AST

0.2 (2018-03-10)

  • Fix HTML representation of numbered list items without a value

  • Raise a specific InvalidNestingError exception instead of generic AssertionError

0.1 (2018-02-25)

  • Renamed to metapensiero.markup.semtext

0.0 (unreleased)

  • Initial effort.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metapensiero.markup.semtext-0.27.tar.gz (19.2 kB view details)

Uploaded Source

File details

Details for the file metapensiero.markup.semtext-0.27.tar.gz.

File metadata

File hashes

Hashes for metapensiero.markup.semtext-0.27.tar.gz
Algorithm Hash digest
SHA256 c4ffe0b77fa0663b370c0396dfecc9d06ac1363ba7169f3bf9dacb3df80f3664
MD5 fc0441c353cf46b8b713f5f3ea6e92f1
BLAKE2b-256 7e5d17031e39a4d1eeefc489cdda6f636cbf29809b4ba786cc8ee5f7aa360715

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page