A parser and serializer for the YAY data format
Project description
YAY Parser for Python
A parser for the YAY data format, implemented in Python.
Requirements
- Python 3.10+
Installation
pip install libyay
Usage
import libyay as yay
# Parse a YAY string
data = yay.loads("""
name: Alice
age: 30
tags:
- python
- data
""")
# {'name': 'Alice', 'age': 30, 'tags': ['python', 'data']}
# Parse from file
with open('config.yay') as f:
data = yay.load(f)
API
loads(source)
Parses a YAY document string and returns the corresponding Python value.
load(file)
Parses a YAY document from a file object.
dumps(value, indent=True)
Serializes a Python value to a YAY string.
dump(value, file, indent=True)
Serializes a Python value to a file in YAY format.
Type Mapping
| YAY Type | Python Type | Notes |
|---|---|---|
null |
None |
|
| big integer | int |
Arbitrary precision |
| float64 | float |
Including float('inf'), float('-inf'), float('nan') |
| boolean | bool |
|
| string | str |
|
| array | list |
|
| object | dict |
|
| bytes | bytes |
YAY Format
roses-are-red: true # There is no "yes" or "on".
violets-are-blue: false # Violets are violet.
arrays:
- "may"
- "have"
- "many"
- "values"
and-objects-too:
integers-are-distinct: 42
from-their-floating-friends: 6.283 185 307 179 586 # digit grouping
inline:
string: "is concise"
array: [infinity, -infinity, nan]
object: {bigint: 1, float64: 2.0}
bytes: <f33d face>
block:
string: `
This is a string.
There are many like it.
array:
- "But"
- "this"
- "one's"
object:
mine: null
bytes: >
b0 b5 c0 ff # Bob's Coffee
fe fa ca de # Facade.
concatenated:
"I'm not dead yet. "
"I feel happy!"
unicode-code-point: "\u{1F600}" # UTF-16 surrogates are inexpressible
"name with spaces": 'works too'
Null
The keyword null denotes a null value.
null
None
Booleans
The literals true and false denote booleans.
A true boolean value.
true
True
A false boolean value.
false
False
Big Integers
Unquoted decimal digit sequences are big integers (arbitrary precision). A leading minus sign denotes a negative big integer; the minus must not be followed by a space. Spaces may be used to group digits for readability; they do not change the value.
Python's int type is arbitrary precision, so YAY big integers map directly.
A basic positive integer.
42
42
A negative integer.
-42
-42
Spaces group digits for readability without changing the value.
867 5309
8675309
Floating-Point (Float64)
A decimal point must be present to distinguish a float from a big integer.
Decimal literals with a decimal point, or the keywords infinity, -infinity, and nan, denote 64-bit floats.
A leading minus must not be followed by a space.
Spaces may group digits.
A basic floating-point number.
6.283185307179586
6.283185307179586
A float with a leading decimal point (no integer part).
.5
0.5
A float with a trailing decimal point (no fractional part).
1.
1.0
Negative zero is distinct from positive zero.
number-float-negative-zero.yay
-0.0
-0.0
Positive infinity.
infinity
float("inf")
Negative infinity.
number-float-negative-infinity.yay
-infinity
number-float-negative-infinity.py
float("-inf")
Not-a-number (canonical NaN).
nan
float("nan")
Spaces group digits for readability in floats.
6.283 185 307 179 586
6.283185307179586
Scientific notation using e or E for the exponent (Avogadro's number).
6.022e23
602200000000000000000000.0
Block Strings
Block strings use the backtick (`) introducer.
The body continues until the next line that is indented the same or less.
The first two spaces of each content line are stripped; any further indentation is preserved.
Empty lines are interpreted as newlines.
Trailing empty lines collapse to a single trailing newline.
Block strings do not support escape sequences—a backslash is just a backslash.
Comments are also not recognized inside block strings; # is literal content.
At Root Level
At root level or as an array item, content may appear on the same line after ` (backtick + space + content).
When the backtick is alone on a line, the result has an implicit leading newline.
Content on the same line as the backtick starts the string without a leading newline.
string-block-root-same-line.yay
` I think you ought to know I'm feeling very depressed.
This will all end in tears.
string-block-root-same-line.py
"I think you ought to know I'm feeling very depressed.\nThis will all end in tears.\n"
Backtick alone on its line produces a leading newline because content starts on the following line.
string-block-root-next-line.yay
`
I've calculated your chance of survival,
but I don't think you'll like it.
string-block-root-next-line.py
"\nI've calculated your chance of survival,\nbut I don't think you'll like it.\n"
An empty line in the middle of a block string is preserved as a newline.
`
I'm getting better!
No you're not.
"\nI'm getting better!\n\nNo you're not.\n"
The # character inside a block string is literal content, not a comment.
` # this is not a comment
it is content
"# this is not a comment\nit is content\n"
A block string may be deeply nested and the indentation prefix will be absent in the value. The string will end with a single newline regardless of any subsequent newlines in the YAY text.
string-block-nested-in-object-and-array.yay
parrot:
condition: `
No, no, it's just resting!
remarks:
- ` Remarkable bird, the Norwegian Blue.
Beautiful plumage, innit?
- ` It's probably pining for the fjords.
Lovely plumage.
string-block-nested-in-object-and-array.py
{"parrot": {"condition": "No, no, it's just resting!\n", "remarks": ["Remarkable bird, the Norwegian Blue.\nBeautiful plumage, innit?\n", "It's probably pining for the fjords.\nLovely plumage.\n"]}}
As Object Property
In property context, the backtick must be alone on the line (no content after it). There is no implicit leading newline. The first content line becomes the start of the string.
message: `
By Grabthar's hammer, we live to tell the tale.
{"message": "By Grabthar's hammer, we live to tell the tale.\n"}
An empty line in the middle of a block string property is preserved.
string-block-property-empty-middle.yay
message: `
It's not pining!
It's passed on! This parrot is no more!
string-block-property-empty-middle.py
{"message": "It's not pining!\n\nIt's passed on! This parrot is no more!\n"}
A block string property followed by another property: the block ends when a line at the same or lesser indent appears. Trailing empty lines collapse to a single trailing newline.
string-block-property-trailing-empty.yay
message: `
By Grabthar's hammer... what a savings.
next: 1
string-block-property-trailing-empty.py
{"message": "By Grabthar's hammer... what a savings.\n", "next": 1}
Inline Strings
Strings may be quoted with double or single quotes.
Double-quoted strings support escape sequences: \", \\, \/, \b, \f, \n, \r, \t, and \u{XXXXXX} for Unicode code points.
Single-quoted strings are literal (no escape sequences).
A double-quoted string.
string-inline-doublequote-basic.yay
"This will all end in tears."
string-inline-doublequote-basic.py
"This will all end in tears."
A single-quoted string (literal, no escapes).
string-inline-singlequote-basic.yay
'Are you suggesting coconuts migrate?'
string-inline-singlequote-basic.py
"Are you suggesting coconuts migrate?"
A double-quoted string with escape sequences.
string-inline-doublequote-escapes.yay
"\"\\\/\b\f\n\r\t\u{263A}"
string-inline-doublequote-escapes.py
"\"\\/\b\f\n\r\t☺"
A double-quoted string with a Unicode emoji (literal UTF-8).
string-inline-doublequote-unicode-emoji.yay
"😀"
string-inline-doublequote-unicode-emoji.py
"😀"
A Unicode code point escape for a character outside the BMP (U+1F600), which requires a surrogate pair in UTF-16.
The \u{...} escape accepts 1 to 6 hexadecimal digits representing a Unicode code point (e.g. \u{41} for "A", \u{1F600} for "😀").
Surrogate code points (U+D800 through U+DFFF) are forbidden in \u{...} escapes.
Unlike JSON, the four-digit \uXXXX form is not supported; use \u{XXXX} instead.
string-inline-doublequote-unicode-surrogate-pair.yay
"\u{1F600}"
string-inline-doublequote-unicode-surrogate-pair.py
"😀"
Concatenated Strings (Quoted Lines)
Multiple quoted strings on consecutive lines are concatenated into a single string. This is useful for breaking long strings across lines without introducing newlines in the result, or including visibly escaped characters like tab that are otherwise forbidden in string blocks.
confession:
"I'm not dead yet. "
"I feel happy!"
{"confession": "I'm not dead yet. I feel happy!"}
Block Arrays
Arrays are written as a sequence of items, each introduced by - (dash and space).
The two-character - prefix is the list marker; the value follows immediately.
Items may be nested: a bullet line whose content starts with - begins an inner list.
An array may be given a name as a key followed by :.
A basic block array with three integer items.
- 5
- 3
[5, 3]
Nested arrays where each top-level item contains an inner array.
- - "a"
- "b"
- - 1
- 2
[["a", "b"], [1, 2]]
An array as the value of an object property.
complaints:
- "I didn't vote for you."
- "Help, help, I'm being repressed!"
{"complaints": ["I didn't vote for you.", "Help, help, I'm being repressed!"]}
Inline Arrays
Inline arrays use JSON-style bracket syntax with strict spacing rules: no space after [, no space before ], exactly one space after each ,.
["And there was much rejoicing.", "yay."]
["And there was much rejoicing.", "yay."]
[42, 404, 418]
[42, 404, 418]
[<b0b5>, <cafe>]
[bytes.fromhex("b0b5"), bytes.fromhex("cafe")]
[["I feel happy!", "yay."], ["And there was much rejoicing.", "yay."]]
[["I feel happy!", "yay."], ["And there was much rejoicing.", "yay."]]
Block Objects
Objects are key–value pairs.
A key is followed by : and then the value.
Object keys must be either alphanumeric or quoted.
Nested objects are indented by two spaces.
answer: 42
error: 404
{"answer": 42, "error": 404}
Nested object.
parrot:
status: "pining for the fjords"
plumage: "beautiful"
{"parrot": {"plumage": "beautiful", "status": "pining for the fjords"}}
Object keys containing spaces or special characters must be quoted.
object-multiline-doublequote-key.yay
"key name": 1
object-multiline-doublequote-key.py
{"key name": 1}
empty: {}
{"empty": {}}
Inline Objects
Inline objects use JSON-style brace syntax with strict spacing rules.
{answer: 42, error: 404}
{"answer": 42, "error": 404}
{name: 'Marvin', mood: 'depressed'}
{"mood": "depressed", "name": "Marvin"}
{luggage: {combination: 12345}, air: ["canned", "Perri-Air"]}
{"air": ["canned", "Perri-Air"], "luggage": {"combination": 12345}}
Block Byte Arrays
Block byte arrays use the > introducer.
Each line may hold hex chunks and comments.
At root level, hex may appear on the same line after > .
> b0b5
c0ff
bytes.fromhex("b0b5c0ff")
bytearray-block-comment-only.yay
> # header comment
b0b5 c0ff
bytearray-block-comment-only.py
bytes.fromhex("b0b5c0ff")
bytearray-block-hex-and-comment.yay
> b0b5 # first chunk
c0ff # second chunk
bytearray-block-hex-and-comment.py
bytes.fromhex("b0b5c0ff")
In property context, > must be followed only by a comment or newline.
data: >
b0b5 c0ff
eefa cade
{"data": bytes.fromhex("b0b5c0ffeefacade")}
bytearray-block-property-comment.yay
data: > # raw bytes
b0b5 c0ff
bytearray-block-property-comment.py
{"data": bytes.fromhex("b0b5c0ff")}
Inline Byte Arrays
Binary data is written as hexadecimal inside angle brackets. Hexadecimal must be lowercase.
<>
b''
<b0b5c0ffeefacade>
bytes.fromhex("b0b5c0ffeefacade")
data: <b0b5c0ffeefacade>
{"data": bytes.fromhex("b0b5c0ffeefacade")}
Error Handling
Errors include line and column numbers for debugging:
import libyay as yay
from libyay import YayError, YaySyntaxError
try:
data = yay.loads('invalid: [')
except YaySyntaxError as e:
print(f"Syntax error at line {e.line}, column {e.col}: {e}")
except YayError as e:
print(f"YAY error: {e}")
Whitespace Rules
YAY has strict whitespace rules that the parser enforces:
- Two spaces for indentation (tabs are illegal)
- No trailing spaces on lines
- Exactly one space after
:in key-value pairs - Exactly one space after
,in inline arrays/objects - No spaces after
[or{, or before]or} - No space before
:in keys
Running Tests
cd python
python -m pytest test_yay.py
The test runner uses fixture files from ../test/.
Files with .yay extension contain YAY input.
Files with .py extension contain expected Python output.
References
Examples in this document pay homage to:
- The Hitchhiker's Guide to the Galaxy (Douglas Adams)
- Monty Python and the Holy Grail
- Monty Python's Flying Circus ("Dead Parrot" sketch)
- Galaxy Quest
- Spaceballs
- Tommy Tutone ("867-5309/Jenny")
- The Tau Manifesto
License
Apache 2.0
Copyright (C) 2026 Kris Kowal
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file libyay-1.0.0.tar.gz.
File metadata
- Download URL: libyay-1.0.0.tar.gz
- Upload date:
- Size: 25.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
88a9fb9cc8c3ad5e3a50ad10ba93c2c65e2e1acf48a57247561301d9043f9bc9
|
|
| MD5 |
464c5810543ef02077e9321516d26554
|
|
| BLAKE2b-256 |
394c82c6f5fcea17f0904867f66f7663014c6bd4c3a880b3b2f7c3bfe73444a0
|
File details
Details for the file libyay-1.0.0-py3-none-any.whl.
File metadata
- Download URL: libyay-1.0.0-py3-none-any.whl
- Upload date:
- Size: 20.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
be7864e971c0b6a02dd21e462d31e817fc790b0ffda6bd1360897c21e61e9a63
|
|
| MD5 |
07fc79154706a76f9e3a998b799f9dad
|
|
| BLAKE2b-256 |
2fe21d93e857c42bf1bc8e27fc4f19c7576e01399d7369f4ae3396d4aa3d6124
|