JavaScript Object Markup Language
Project description
JSOML
JSOML (JavaScript Object Markup Language) is an XML format that represents data structures that can be serialized in JSON/YAML. JSON/YAML can be round-tripped through JSOML and vice versa.
JSOML serializes large strings in a more readable and diff-friendly manner compared to JSON. This is especially useful for strings of HTML content (or other non-XML formats) that are stored in data structures used with JSON.
Text (without the trigram ]]>
) can be included unmodified within JSOML (in a CDATA
section).
For example, a data structure serialized in JSON as:
{
"date": "2022-04-14",
"authors": [
{
"given-names": "Begoña José",
"surname": "Sagües"
}
],
"html": "<h1 id=\"title\">On Testing</h1>\n<h2 id=\"introduction\">Introduction</h2>\n<p>\nBegoña José Sagües\nis \"testing\".\n</p>\n"
}
is serialized in JSOML as:
<obj>
<str key="date" val="2022-04-14"/>
<arr key="authors">
<obj>
<str key="given-names" val="Begoña José"/>
<str key="surname" val="Sagües"/>
</obj>
</arr>
<str key="html"><notline/><![CDATA[
<h1 id="title">On Testing</h1>
<h2 id="introduction">Introduction</h2>
<p>
Begoña José Sagües
is "testing".
</p>
]]></str>
</obj>
This Python library provides methods dump
, load
, dumps
, and loads
for
reading and writing JSOML. They behave similarly to the methods of the same name in the
json
module of the Python standard library.
YAML vs JSOML
Unlike XML, YAML provides a lightweight format like JSON. Like JSOML, large strings can be serialized in YAML in a more readable and diff-friendly manner. However, YAML requires modification of strings in the form of indenting all lines or escaping double quotes and newline characters.
In contrast, JSOML only requires the trigram ]]>
to be escaped. All other lines of a string
can be included unmodified.
CLI Tool
You can convert between JSON and JSOML using jsoml
as a command line tool.
Example
$ echo 123 > test.json
$ python3 -m jsoml test.json > test.xml
$ cat test.xml
<?xml version='1.0' encoding='UTF-8'?>
<num val="123"/>
$ python3 -m jsoml test.xml
123
Usage
usage: __main__.py [-h] [--from {json,xml,jsoml}] inpath
JSOML tool
positional arguments:
inpath input file path
options:
-h, --help show this help message and exit
--from {json,xml,jsoml}
format of source
If the --from
option is omitted, jsoml will guess based on the file extension.
Specification
JSOML is an XML format with a schema of eight XML elements. Seven of these XML elements are the "JSON XML elements". They correspond directly to the seven JSON value tokens:
<obj>
for a JSON object<arr>
for a JSON array<num/>
for a JSON number<str>
for a JSON string<null/>
for JSON "null"<true/>
for JSON "true"<false/>
for JSON "false"
The eighth XML element is <notline/>
. It only appears as a child element of <str>
and is the only XML element that <str>
can contain. It modifies the semantics of the
XML character content of <str>
. See below for more details.
XML attribute key=
A JSOML XML element has a key
attribute if and only if it is a child element of an
<obj>
element. The string value is the corresponding key value in JSON.
XML attribute val=
The <num>
element must have a val
attribute. Its value corresponds exactly
to the corresponding number token value in JSON.
An empty <str/>
element may have a val
attribute. It is an alternative to having the
string value as XML element content. Its string value corresponds to a string value in JSON.
Considerations for <str>
content
In the absence of any child <notline/>
element of <str>
, the XML contents
of <str>
are processed as usual with XML. In particular, the use of <![CDATA[
can be useful for including text unmodified until the trigram ]]>
.
The semantics of <notline/>
is to remove a newline character that follows it.
Apart from this child element and a newline character that follows it,
all other character content contained by <str>
is processed as usual with XML.
The combination of <notline/>
and <![CDATA[
together can be useful for
multiline strings as seen in the example above.
This combination is also useful for handling string values with the trigram ]]>
,
as shown in the example below. Consider a string value which in Python can be
coded as:
"""Is the mathematical condition
$$
E[x[i]]>0
$$
problematic?
"""
This Python string is encoded in JSON as:
"Is the mathematical condition\n$$\nE[x[i]]>0\n$$\nproblematic?\n"
and in JSOML it can be encoded as:
<str><notline/><![CDATA[
Is the mathematical condition
$$
E[x[i]]><notline/><![CDATA[
]]><notline/><![CDATA[
0
$$
problematic?
]]></str>
This escaping for JSOML can be achieved in code by replacing all substrings:
"]]>"
with the substring:
"]]><notline/><![CDATA[\n]]><notline/><![CDATA[\n"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file jsoml-0.3.1.tar.gz
.
File metadata
- Download URL: jsoml-0.3.1.tar.gz
- Upload date:
- Size: 8.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 357bc0e8260ccf23b79297519373c166b06731eb92034c53c2fc4b18b10308a6 |
|
MD5 | b83a787b5ac5e4df57428ad91f1b7782 |
|
BLAKE2b-256 | 8b1c0ce6e3e6455c9ddccaabdaacd87dcc0815d149baca06bc6a0d3526b75dcf |
File details
Details for the file jsoml-0.3.1-py3-none-any.whl
.
File metadata
- Download URL: jsoml-0.3.1-py3-none-any.whl
- Upload date:
- Size: 7.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7a557964f6b5f74ce22cef4310e2f80198ec89adc359223f3b558c50519b13c |
|
MD5 | dbdfe1d3c84270561309d046ee40cbd2 |
|
BLAKE2b-256 | ffc909c64ad0b4858b7e9556a4a491f878fe0625a9ae8b024da8ddcb3018805a |