Strict, typed YAML parser
Project description
StrictYAML
StrictYAML is a type-safe YAML parser that parses and validates a restricted subset of the YAML specification.
Priorities:
- Beautiful API
- Refusing to parse the ugly, hard to read and insecure features of YAML like the Norway problem.
- Strict validation of markup and straightforward type casting.
- Clear, readable exceptions with code snippets and line numbers.
- Acting as a near-drop in replacement for pyyaml, ruamel.yaml or poyo.
- Ability to read in YAML, make changes and write it out again with comments preserved.
- Not speed, currently.
Simple example:
# All about the character
name: Ford Prefect
age: 42
possessions:
- Towel
from strictyaml import load, Map, Str, Int, Seq, YAMLError
Default parse result:
>>> load(yaml_snippet)
YAML({'name': 'Ford Prefect', 'age': '42', 'possessions': ['Towel']})
All data is string, list or OrderedDict:
>>> load(yaml_snippet).data
{'name': 'Ford Prefect', 'age': '42', 'possessions': ['Towel']}
Quickstart with schema:
from strictyaml import load, Map, Str, Int, Seq, YAMLError
schema = Map({"name": Str(), "age": Int(), "possessions": Seq(Str())})
42 is now parsed as an integer:
>>> person = load(yaml_snippet, schema)
>>> person.data
{'name': 'Ford Prefect', 'age': 42, 'possessions': ['Towel']}
A YAMLError will be raised if there are syntactic problems, violations of your schema or use of disallowed YAML features:
# All about the character
name: Ford Prefect
age: 42
For example, a schema violation:
try:
person = load(yaml_snippet, schema)
except YAMLError as error:
print(error)
while parsing a mapping
in "<unicode string>", line 1, column 1:
# All about the character
^ (line: 1)
required key(s) 'possessions' not found
in "<unicode string>", line 3, column 1:
age: '42'
^ (line: 3)
If parsed correctly:
from strictyaml import load, Map, Str, Int, Seq, YAMLError, as_document
schema = Map({"name": Str(), "age": Int(), "possessions": Seq(Str())})
You can modify values and write out the YAML with comments preserved:
person = load(yaml_snippet, schema)
person['age'] = 43
print(person.as_yaml())
# All about the character
name: Ford Prefect
age: 43
possessions:
- Towel
As well as look up line numbers:
>>> person = load(yaml_snippet, schema)
>>> person['possessions'][0].start_line
5
And construct YAML documents from dicts or lists:
print(as_document({"x": 1}).as_yaml())
x: 1
Install
$ pip install strictyaml
Why StrictYAML?
There are a number of formats and approaches that can achieve more or less the same purpose as StrictYAML. I've tried to make it the best one. Below is a series of documented justifications:
- Why not use JSON Schema for validation?
- What is wrong with TOML?
- Why shouldn't I just use Python code for configuration?
- Why not JSON5?
- Why not JSON for simple configuration files?
- Why avoid using environment variables as configuration?
- Why not use XML for configuration or DSLs?
- Why not use INI files?
- Why not use the YAML 1.2 standard? - we don't need a new standard!
- Why not use Python's schema library (or similar) for validation?
- Why not HOCON?
- Why not use SDLang?
- Why not use kwalify with standard YAML to validate my YAML?
Using StrictYAML
How to:
- Merge YAML documents
- Build a YAML document from scratch in code
- Reading in YAML, editing it and writing it back out
- Get line numbers of YAML elements
- Either/or schema validation of different, equally valid different kinds of YAML
- Labeling exceptions
- Parsing YAML without a schema
- Revalidate an already validated document
Compound validators:
- Using a YAML object of a parsed mapping
- Mapping with defined keys and a custom key validator (Map)
- Mappings with defined keys (Map)
- Updating document with a schema
- Validating optional keys in mappings (Map)
- Mappings with arbitrary key names (MapPattern)
- Optional keys with defaults (Map/Optional)
- Sequence/list validator (Seq)
- Sequences of unique items (UniqueSeq)
- Fixed length sequences (FixedSeq)
Scalar validators:
- Empty key validation
- Datetimes (Datetime)
- Floating point numbers (Float)
- Email and URL validators
- Parsing comma separated items (CommaSeparated)
- Integers (Int)
- Decimal numbers (Decimal)
- Boolean (Bool)
- Validating strings with regexes (Regex)
- Parsing strings (Str)
- Enumerated scalars (Enum)
Restrictions:
Design justifications
There are some design decisions in StrictYAML which are controversial and/or not obvious. Those are documented here:
- Why is parsing speed not a high priority for StrictYAML?
- What is syntax typing?
- What is wrong with node anchors and references?
- What is wrong with duplicate keys?
- What is wrong with explicit tags?
- The Norway Problem - why StrictYAML refuses to do implicit typing and so should you
- Why does StrictYAML not parse direct representations of Python objects?
- Why does StrictYAML only parse from strings and not files?
- Why does StrictYAML make you define a schema in Python - a Turing-complete language?
- What is wrong with flow-style YAML?
Star Contributors
- @wwoods
- @chrisburr
Contributors
- @WaltWoods
- @ChristopherGS
- @gvx
- @AlexandreDecan
- @lots0logs
- @tobbez
- @jaredsampson
- @BoboTIG
Contributing
- Before writing any code, please read the tutorial on contributing to hitchdev libraries.
- Before writing any code, if you're proposing a new feature, please raise it on github. If it's an existing feature / bug, please comment and briefly describe how you're going to implement it.
- All code needs to come accompanied with a story that exercises it or a modification to an existing story. This is used both to test the code and build the documentation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file strictyaml-1.3.2.tar.gz.
File metadata
- Download URL: strictyaml-1.3.2.tar.gz
- Upload date:
- Size: 50.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.25.0 setuptools/44.0.0 requests-toolbelt/0.9.1 tqdm/4.54.0 CPython/3.8.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
637399fd80dccc95f5287b2606b74098b23c08031b7ec33c5afb314ccbf10948
|
|
| MD5 |
5f9a7bb78f3ce2300183412346a38812
|
|
| BLAKE2b-256 |
871e7991c54a2f404885eb9c483e981adfdc94fd540a9b2915e705aabf74ceea
|