A collection of YAML utilities
Project description
yaml-extras
Some extra scripting utilities written on top of PyYAML.
See the full docs at yaml-extras.pages.dev.
Installation
Install with pip or poetry:
pip install yaml-extras
# or
poetry add yaml-extras
Usage
import yaml
from yaml_extras import ExtrasLoader
import json
with open('example.yml') as f:
data = yaml.load(f, Loader=ExtrasLoader)
print(f"data = {json.dumps(data, indent=2)}")
Features
Modularity with "import"
!import tag: Import another whole YAML file. Supports multiple-import using the "<<" merge key, as well as aliasing the result of an import using an anchor.
Note: There is no safeguard against cyclical imports. If you import a file that imports the original file, it will result in exceeding Python's maximum recursion depth.
Syntax
!import [&anchor ]<filepath>
Examples
Simple whole-file import
# example.yml
my_children:
child1: !import child1.yml
child2: !import child2.yml
# child1.yml
name: child1
age: 10
# child2.yml
name: child2
age: 7
Result when loading in Python:
data = {
"my_children": {
"child1": {
"name": "child1",
"age": 10
},
"child2": {
"name": "child2",
"age": 7
}
}
}
Nested whole-file imports
# example.yml
my_children:
child: !import child.yml
# child.yml
name: child
age: 40
grandchild: !import grandchild.yml
# grandchild.yml
name: grandchild
age: 10
Result when loading in Python:
data = {
"my_children": {
"child": {
"name": "child",
"age": 40,
"grandchild": {
"name": "grandchild",
"age": 10
}
}
}
}
Multiple whole-file imports with merge
# example.yml
all_animals:
<<:
- !import animals/american.yml
- !import animals/eurasian.yml
# animals/american.yml
bear: wild
wolf: wild
fish: domestic
dog: domestic
# animals/eurasian.yml
tiger: wild
lion: wild
cat: domestic
chicken: domestic
Result when loading in Python:
data = {
"all_animals": {
"bear": "wild",
"wolf": "wild",
"fish": "domestic",
"dog": "domestic",
"tiger": "wild",
"lion": "wild",
"cat": "domestic",
"chicken": "domestic"
}
}
Anchored whole-file imports
# example.yml
child: !import &child-anchor child.yml
again:
child: *child-anchor
# child.yml
name: child
age: 10
Result when loading in Python:
data = {
"child": {
"name": "child",
"age": 10
},
"again": {
"child": {
"name": "child",
"age": 10
}
}
}
!import.anchor tag: Import a specific anchor from another YAML file. Supports multiple-import using the "<<" merge key, as well as aliasing the result of an import using an anchor.
Note: There is no safeguard against cyclical imports. If you import an anchor that imports the original file (or anchor you define with this import), it will result in exceeding Python's maximum recursion depth.
Syntax
!import.anchor [&internal_anchor ]<filepath> &<external_anchor>
Examples
Simple anchor import
# example.yml
my_children:
child1: !import.anchor children.yml &child1
child2: !import.anchor children.yml &child2
# children.yml
child1: &child1
name: child1
age: 10
child2: &child2
name: child2
age: 7
Result when loading in Python:
data = {
"my_children": {
"child1": {
"name": "child1",
"age": 10
},
"child2": {
"name": "child2",
"age": 7
}
}
}
Nested anchor imports
# example.yml
my_children:
child: !import.anchor child.yml &child
# child.yml
name: child
age: 40
grandchild: !import.anchor grandchild.yml &grandchild
# grandchild.yml
grandchild: &grandchild
name: grandchild
age: 10
Result when loading in Python:
data = {
"my_children": {
"child": {
"name": "child",
"age": 40,
"grandchild": {
"name": "grandchild",
"age": 10
}
}
}
}
Multiple anchor imports with merge
# example.yml
all_animals:
<<:
- !import.anchor animals.yml &american
- !import.anchor animals.yml &eurasian
# animals.yml
american: &american
bear: wild
wolf: wild
fish: domestic
dog: domestic
eurasian: &eurasian
tiger: wild
lion: wild
cat: domestic
chicken: domestic
Result when loading in Python:
data = {
"all_animals": {
"bear": "wild",
"wolf": "wild",
"fish": "domestic",
"dog": "domestic",
"tiger": "wild",
"lion": "wild",
"cat": "domestic",
"chicken": "domestic"
}
}
Anchored anchor imports
# example.yml
child: !import.anchor &my-child child.yml &child-anchor
again:
child: *my-child
# child.yml
child: &child-anchor
name: child
age: 10
Result when loading in Python:
data = {
"child": {
"name": "child",
"age": 10
},
"again": {
"child": {
"name": "child",
"age": 10
}
}
}
!import-all tag: Import a glob pattern of YAML files as a sequence. Supports merging the imports using the "<<" merge key, as well as aliasing the result of an import using an anchor.
The glob pattern system only supports two types of wildcards: * and **. * matches any character except for /, while ** matches any character including /.
Note: There is no safeguard against cyclical imports. If you import a file that imports the original file, it will result in exceeding Python's maximum recursion depth.
Syntax
!import-all [&anchor ]<glob_pattern>
Examples
Simple (*) sequence import
# example.yml
all_children: !import-all children/*.yml
# children/alice.yml
name: alice
age: 10
height: 1.2
#children/bob.yml
name: bob
age: 7
height: 1.0
Result when loading in Python:
data = {
"all_children": [
{
"name": "alice",
"age": 10,
"height": 1.2
},
{
"name": "bob",
"age": 7,
"height": 1.0
}
]
}
Nested (*) sequence imports
# example.yml
all_children: !import-all children/*.yml
# children/bob.yml
name: bob
age: 28
# children/alice.yml
name: alice
age: 40
children: !import-all children/alice/*.yml
# children/alice/fred.yml
name: fred
age: 10
# children/alice/jane.yml
name: jane
age: 7
Result when loading in Python:
data = {
"all_children": [
{
"name": "bob",
"age": 28
},
{
"name": "alice",
"age": 40,
"children": [
{
"name": "fred",
"age": 10
},
{
"name": "jane",
"age": 7
}
]
}
]
}
Multiple (*) sequence imports with merge
# example.yml
all_animals:
<<:
- !import-all animals/american/*.yml
- !import-all animals/eurasian/*.yml
# animals/american/bear.yml
bear:
type: wild
size: large
# animals/american/wolf.yml
wolf:
type: wild
size: medium
# animals/eurasian/tiger.yml
tiger:
type: wild
size: large
# animals/eurasian/lion.yml
lion:
type: wild
size: large
Result when loading in Python:
data = {
"all_animals": {
"bear": {
"type": "wild",
"size": "large"
},
"wolf": {
"type": "wild",
"size": "medium"
},
"tiger": {
"type": "wild",
"size": "large"
},
"lion": {
"type": "wild",
"size": "large"
}
}
}
Anchored (*) sequence imports
# example.yml
data: !import-all &my-children children/*.yml
again:
data: *my-children
# children/alice.yml
name: alice
age: 10
height: 1.2
# children/bob.yml
name: bob
age: 7
height: 1.0
Result when loading in Python:
data = {
"data": [
{
"name": "alice",
"age": 10,
"height": 1.2
},
{
"name": "bob",
"age": 7,
"height": 1.0
}
],
"again": {
"data": [
{
"name": "alice",
"age": 10,
"height": 1.2
},
{
"name": "bob",
"age": 7,
"height": 1.0
}
]
}
}
Simple (**) sequence import
# example.yml
overarching: !import-all subfolders/**/*.yml
# subfolders/child1.yml
name: child1
# subfolders/child2.yml
name: child2
# subfolders/subfolder1/grandchild1.yml
name: grandchild1
Result when loading in Python:
data = {
"overarching": [
{
"name": "child1"
},
{
"name": "child2"
},
{
"name": "grandchild1"
}
]
}
!import-all-parameterized tag: Import a glob pattern of YAML files as a sequence, enriching the results with one or more metadata keys extracted from globs in the filepath. Supports merging the imports using the "<<" merge key, as well as aliasing the result of an import using an anchor.
The glob pattern system only supports two types of wildcards: * and **. * matches any character except for /, while ** matches any character including /. Metadata keys can be extracted from zero or more globs in the path specification with syntax like this:
# Import all YAML files in the `path/to/*.yml` glob as a sequence, attaching to each element the
# `basename` key extracted from the filename.
my_data: !import-all-parameterized path/to/{basename:*}.yml
#
# my_data:
# - basename: file1
# key1: value1
# - basename: file2
# key2: value2
# key3: value3
#
# Import all YAML files in the `path/to/**/meta.yml` glob as a sequence, attaching to each
# element the `subdirs` key extracted from the subdirectory structure.
my_subdirs: !import-all-parameterized path/to/{subdirs:**}/meta.yml
#
# my_subdirs:
# - subdirs: subdir1
# key1: value1
# - subdirs: subdir1/subdir2/subdir3
# key2: value2
# key3: value3
#
Note (i): There is no safeguard against cyclical imports. If you import a file that imports the original file, it will result in exceeding Python's maximum recursion depth.
Note (ii): When the leaf files of an import contain mappings, then it is simple to "merge" the metadata keys from the path into the resulting imported mappings. However, when the leaf files are scalars or sequences, then the structure of the import results are slightly more contrived. The contents of the imports will be under a
contentkey in each result, with the metadata keys extracted from the path added as additional key/value pairs in the mappings.
Syntax
!import-all-parameterized [&anchor ]<glob_pattern>
Examples
Simple parameterized import (*) with metadata
# example.yml
grade_book: !import-all schools/{school_name:*}/grades/{student_name:*}.yml
# schools/elementary/grades/David.yml
math: 95
science: 90
english: 80
# schools/elementary/grades/Edward.yml
math: 100
science: 90
english: 100
# schools/highschool/grades/Frank.yml
math: 85
science: 95
english: 90
Result when loading in Python:
data = {
"grade_book": [
{
"school_name": "elementary",
"student_name": "David",
"math": 95,
"science": 90,
"english": 80
},
{
"school_name": "elementary",
"student_name": "Edward",
"math": 100,
"science": 90,
"english": 100
},
{
"school_name": "highschool",
"student_name": "Frank",
"math": 85,
"science": 95,
"english": 90
}
]
}
Simple parameterized import (**) with metadata
# example.yml
translations: !import-all-parameterized words/{langspec:**}/words.yml
# words/en/us/words.yml
- hello
- goodbye
- color
- thanks
# words/en/uk/words.yml
- good morrow
- toodle-oo
- colour
- cheers
# words/es/mx/words.yml
- hola
- adios
- color
- gracias
Result when loading in Python:
data = {
"translations": [
{
"langspec": "en/us",
"content": ["hello", "goodbye", "color", "thanks"]
},
{
"langspec": "en/uk",
"content": ["good morrow", "toodle-oo", "colour", "cheers"]
},
{
"langspec": "es/mx",
"content": ["hola", "adios", "color", "gracias"]
}
]
}
Customizing the import directory
By default, !import tags will search relative to the current working directory of the Python process. You can customize the base directory for imports by calling yaml_import.set_import_relative_dir(...) with the desired base directory.
import yaml
from yaml_extras import ExtrasLoader, yaml_import
yaml_import.set_import_relative_dir('/path/to/imports')
data = yaml.load('!import somefile.yml', Loader=ExtrasLoader)
Roadmap
P1
- Add support for
!importto import other whole documents into a YAML document (general import). - Add support for
!import.anchorto import specific anchors from other YAML documents (targeted import). - Add support for
!import-allto import a glob pattern of YAML files as a sequence. - Add support for
!import-all.anchorto import a specific anchor from a glob pattern of YAML files as a sequence. - Add support for
!import-all-parameterizedto import a glob pattern of YAML files as a sequence with some data extracted from the filepath. - Add support for
!import-all-parameterized.anchorto import a specific anchor from a glob pattern of YAML files as a sequence with some data extracted from the filepath. - Allow user to set relative import directory.
P2
- Implement type specification system to validate YAML files against a schema using a
!yamlschematag system which mimics JSON Schema semantics and are validated upon construction. - Add support for
!envtag to import environment variables.
P3
- VSCode / Intellisense plugin to navigate through imports using cmd + click
Acknowledgements
- David Sillman dsillman2000@gmail.com
- pyyaml-include author, @tanbro.
- PyYAML
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file yaml_extras-0.2.2.tar.gz.
File metadata
- Download URL: yaml_extras-0.2.2.tar.gz
- Upload date:
- Size: 14.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
61bda0a376a34f398f39818d51d3e85a1ea3ba88c5e43122e075d29971d75dd1
|
|
| MD5 |
adb9f5824d90e09bfbefb3b0b8f11851
|
|
| BLAKE2b-256 |
d025d401f4e6cea220d451b5998c6a0a82d8bc698c07af5fab330a33196d90a1
|
Provenance
The following attestation bundles were made for yaml_extras-0.2.2.tar.gz:
Publisher:
python-publish.yml on dsillman2000/yaml-extras
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
yaml_extras-0.2.2.tar.gz -
Subject digest:
61bda0a376a34f398f39818d51d3e85a1ea3ba88c5e43122e075d29971d75dd1 - Sigstore transparency entry: 160725976
- Sigstore integration time:
-
Permalink:
dsillman2000/yaml-extras@ed65538437c9af2a0abf6f28e237e5b6bfd997a5 -
Branch / Tag:
refs/tags/0.2.2 - Owner: https://github.com/dsillman2000
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@ed65538437c9af2a0abf6f28e237e5b6bfd997a5 -
Trigger Event:
release
-
Statement type:
File details
Details for the file yaml_extras-0.2.2-py3-none-any.whl.
File metadata
- Download URL: yaml_extras-0.2.2-py3-none-any.whl
- Upload date:
- Size: 12.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
59443fbed416841c07281c73882d2bdb2737004d7e892057a743f3ea18725bf9
|
|
| MD5 |
1ae5652e0d7a77e25cb441521dca0bb7
|
|
| BLAKE2b-256 |
3b34ad825f67ae4e1a5ea7c5b9a94003b275cee3de3453ce4b60c97a719f7988
|
Provenance
The following attestation bundles were made for yaml_extras-0.2.2-py3-none-any.whl:
Publisher:
python-publish.yml on dsillman2000/yaml-extras
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
yaml_extras-0.2.2-py3-none-any.whl -
Subject digest:
59443fbed416841c07281c73882d2bdb2737004d7e892057a743f3ea18725bf9 - Sigstore transparency entry: 160725978
- Sigstore integration time:
-
Permalink:
dsillman2000/yaml-extras@ed65538437c9af2a0abf6f28e237e5b6bfd997a5 -
Branch / Tag:
refs/tags/0.2.2 - Owner: https://github.com/dsillman2000
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@ed65538437c9af2a0abf6f28e237e5b6bfd997a5 -
Trigger Event:
release
-
Statement type: