Skip to main content

a Simple Enough Markup

Project description

Author:

Lele Gaifax

Contact:
lele@metapensiero.it
license:

GNU General Public License version 3 or later

Implement a minimalistic markup usable in the various descriptions, with just the needed elements: headings and paragraphs containing plain, bold or italic text, hyper <link> and unordered lists.

The internal representation is a tree of nodes, and a set of functions to transpose it to/from a textual format, HTML and Quill Delta are included.

Changes

0.24 (2020-05-21)

  • Add option to parse_text() to fallback to plain text on parsing errors

0.23 (2019-12-02)

  • Ignore spurious orphan headers

0.22 (2019-11-08)

  • Ignore non-textual insert operations, such as “image”

0.21 (2019-09-20)

  • Explain assertion errors, to avoid meaningless logging messages

  • Suppress warning on ignored <pre> tags

0.20 (2019-09-02)

  • Ignore code-block attributes

0.19 (2019-07-16)

  • Workaround a degenerated case involving empty hrefs

0.18 (2019-07-12)

  • Completely ignore <iframe> and <script> tags

  • Extract <table> content as plain paragraph, to make them at least readable

0.17 (2019-07-03)

  • Ignore text attributes that we are not going to handle

0.16 (2019-06-25)

  • Handle HTML created with non-Unix end-of-line convention

0.15 (2018-08-23)

  • Ignore <style> tags and degenerated <a> tags

0.14 (2018-08-23)

  • Ignore <img> tags, out of scope at least for now

0.13 (2018-08-23)

  • Rewritten HTML parser, slightly more robust and versatile

0.12 (2018-08-17)

  • Try harder to handle degenerated paragraphs represented with DIVs

  • Replace asserts with a custom exception to signal parsing errors

0.11 (2018-08-15)

  • Handle degenerated paragraphs represented with DIVs

  • Add an option to swallow HTML parsing exceptions and falling back to plain text

0.10 (2018-08-01)

  • Handle SPANs inside headings

0.9 (2018-07-12)

  • Ignore standalone BRs in the HTML parser

0.8 (2018-07-12)

  • Ignore BRs inside headings in the HTML parser

0.7 (2018-06-26)

  • Better handling of nested DIVs in the HTML parser

0.6 (2018-06-13)

  • Handle implicit list item indexes in SEMPrinter

0.5 (2018-04-26)

  • Properly escape also the link’s address

0.4 (2018-04-26)

0.3 (2018-04-20)

  • Support for hyperlinks

  • Support for headings

  • New function to emit a Quill Delta representation of an AST

0.2 (2018-03-10)

  • Fix HTML representation of numbered list items without a value

  • Raise a specific InvalidNestingError exception instead of generic AssertionError

0.1 (2018-02-25)

  • Renamed to metapensiero.markup.semtext

0.0 (unreleased)

  • Initial effort.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

metapensiero.markup.semtext-0.24.tar.gz (19.2 kB view details)

Uploaded Source

File details

Details for the file metapensiero.markup.semtext-0.24.tar.gz.

File metadata

  • Download URL: metapensiero.markup.semtext-0.24.tar.gz
  • Upload date:
  • Size: 19.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.3

File hashes

Hashes for metapensiero.markup.semtext-0.24.tar.gz
Algorithm Hash digest
SHA256 df5ab87456b72743a9e018febe3b94483c4823db027fef3007a688365c2ba8f1
MD5 5a749070127214c4b5102054944bc84b
BLAKE2b-256 1203c471b2d1b0baefeda602af905ae53a9428e900322120d8997508184025e6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page