Skip to main content

YAML document splitter, with iterator that returns document, start linenumber and optionally data

Project description

https://sourceforge.net/p/ruamel-yaml-split/code/ci/default/tree/_doc/_static/license.svg?format=raw https://sourceforge.net/p/ruamel-yaml-split/code/ci/default/tree/_doc/_static/pypi.svg?format=raw https://sourceforge.net/p/oitnb/code/ci/default/tree/_doc/_static/oitnb.svg?format=raw

This package provides a YAML document splitter, that allows you to iterate over a UTF-8 encoded file with YAML documents, and that returns each document, start linenumber and optionally loaded data.

Using this has the advantage over using load_all(), that you can skip individual documents that you know don’t load, or that you test/transform before proper loading. E.g. R markdown files have a YAML header followed by a non-YAML body (of course it would have been much better if in R markdown, the header had been followed by --- | instead of only a directory-end-marker (---), that way you could use any compliant YAML parser to load both)

You can use the following to get to each document:

from pathlib import Path
from ruamel.yaml.split import split

for doc, line_nr in split(Path('input.yaml')):
    print(doc.decode('utf-8'), line_nr)

The line_nr indicates the starting line of the document and can be used as an offset for e.g. errors that are generated when parsing the document.

You can also provide a YAML() instance to get the document loaded for you:

import ruamel.yaml

for doc, data, line_nr in split(Path('input.yaml'), yaml=ruamel.yaml.YAML()):
    print(doc.decode('utf-8'), data, line_nr)

the YAML() instance you provide is used to load all documents.

By default split() splits on the line-ending after the document-end-marker (...), so that any comment on the line of the document-end-marker is part of the document before it. Using some constants imported from ruamel.yaml.split that you provide to the the cmnt parameter of split(), you can influence where the comments “between” documents are split. C_PRE adds any such comments to the preceding document, C_POST to the following document. C_SPLIT_ON_FIRST_BLANK, splits after the first blank line and assigns to both.

from ruamel.yaml.split import C_SPLIT_ON_FIRST_BLANK

for doc, data, line_nr in split(Path('input.yaml'), cmnt=C_SPLIT_ON_FIRST_BLANK):
    print(doc.decode('utf-8'), line_nr)

Comments at the end of the document stream are all attached to the last document, independent of the cmnt parameter.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ruamel.yaml.split-0.2.2.tar.gz (15.8 kB view details)

Uploaded Source

Built Distribution

ruamel.yaml.split-0.2.2-py3-none-any.whl (7.1 kB view details)

Uploaded Python 3

File details

Details for the file ruamel.yaml.split-0.2.2.tar.gz.

File metadata

  • Download URL: ruamel.yaml.split-0.2.2.tar.gz
  • Upload date:
  • Size: 15.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.5

File hashes

Hashes for ruamel.yaml.split-0.2.2.tar.gz
Algorithm Hash digest
SHA256 4f43f4acd04115e56bdb3e980ccfd22ae97d4a2a3a6c15f324c1945de9071b78
MD5 3138d0568055c1fd7f8284dc2c7c4bbe
BLAKE2b-256 ef86f44ccd893d2cafe46bd43b42779c46221c14681bfc68f96d71bb3fa69a69

See more details on using hashes here.

File details

Details for the file ruamel.yaml.split-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for ruamel.yaml.split-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5e31b0ef8c5e63a19103cfc69637006ec48ad3e44b54f69645f510592625bd9a
MD5 548d4fe1c72420169e9a1a1fcfb403d2
BLAKE2b-256 11a74fb76856405adee08a69ccfc22bc899eb065c39b856973172a554f9d83d5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page