A markdown extension for converting Leiden+ epigraphic text to TEI Epidoc XML
Project description
LeidenMark
$ pip install leidenmark
A Python Markdown extension for converting Leiden+ epigraphic text to TEI XML/HTML. Inspired by the Brill plain text (BPT) format that aims to incorporate Leiden+ into a Markdown-based syntax.
>>> from leidenmark import leiden_plus
>>> content = """\
<D=.r<=
1. Lorem ipsum dolor
vac.1lin
2. sit amet, con[ca.3]c
3.-etur adipiscing
=>=D>
<D=.v<=
lost.2lin
6. ut labore et dol
7.-ore magna aliqua
=>=D>"""
>>> leiden_plus(content, indent=True)
The output of the above lines is the following XML snippet:
<div n="r" type="textpart">
<ab>
<l n="1">Lorem ipsum dolor</l>
<space quantity="1" unit="line"/>
<l n="2">sit amet, con<gap precision="low" quantity="3" unit="character"/>c</l>
<l break="no" n="3">etur adipiscing</l>
</ab>
</div>
<div n="v" type="textpart">
<ab>
<gap quantity="2" unit="line"/>
<l n="6">ut labore et dol</l>
<l break="no" n="7">ore magna aliqua</l>
</ab>
</div>
leiden_plus()
is syntactic sugar for the registered Markdown extension, and equivalent to:
>>> import leidenmark
>>> from markdown import markdown
>>> markdown(content, extensions=['leiden_plus']) # Other extensions can be added to this list
Configuration
Given that this is a Markdown extension, conventions like *italics*
and **bold**
will also be recognized an converted (these in particular will additionally be transformed to the TEI element <hi>
). Though these are in principle not part of the Leiden+ syntax, in practice the use of italics and boldface is still encountered a lot. Therefore, support is maintaned by default, which can be switched off by passing strict=True
:
>>> leiden_plus(content, strict=True)
NB: The blockprocessors for paragraphs and ordered list are always switched off, because they interfer too much with Leiden+.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for leidenmark-0.1.28-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3f47e48fbe5e9c69d3f75820af3c75e34b83c20592a2934ed83aafa03b0b0837 |
|
MD5 | 933bdea235d6c86605671814d85b39f6 |
|
BLAKE2b-256 | 805a07ffa7efc1a41fcc5adf2beae824b2c81aa10f93bb8aca46cc73927f15ed |