One data model, many formats: read, validate, and write JSON, YAML, TOML, XML, and OML.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Omnist

Omnist ("omni-structure") is one canonical data model for JSON, YAML, TOML, XML, and its own native OML (Omnist Markup Language) — read any of them into a single tree, validate it against a schema, compare schema versions, and write it back out to any of the others.

from omnist import parse_schema, doc

s = parse_schema('''
    record Member { "name": string, "role": string }
    record Team   { "name": string, "members" [1,]: Member }
    root Team
''')

s.validate(doc({"name": "Platform",
                "members": [{"name": "Ann", "role": "dev"}]})).ok    # True

Why Omnist

Three things you can do with Omnist that are genuinely hard anywhere else:

Gate schema changes in CI. v1.compatible_with(v2) answers "does every document the old schema accepts still validate under the new one?" with a decidable yes or no — an algorithm with a proof behind it, not a heuristic or a sampled guess. Put it in a pipeline and schema evolution stops being a code-review judgment call.
One model across five formats. Read JSON, YAML, TOML, XML, or OML into the same tree, validate it against the same schema, write it back out to any of the others — one library, one mental model, and a written report of anything a lossy format had to adjust.
Cut a big schema down to what one service needs. schema.extract(...) computes the minimal subschema recognizing only the labels you keep — the operation that trimmed industrial XML schemas to a fraction of their size in the research this library grew from.

What makes those possible is a deliberately small formal model (inspired by Lee & Cheung, CIKM 2010):

A Document is a tree — an ordered list of labeled edges. Arrays are just repeated labels, so the same Document represents JSON, YAML, TOML, XML (including its interleaved repeated elements), and OML — Omnist's own format, the only one with zero loss in either direction.
A Schema is named record definitions (closed named fields, each with a cardinality), where every field's type is always exactly one fixed scalar (optionally nullable) or one Ref to a named record — referenced by name for reuse and recursion. Written as OSD (Omnist Schema Definition).
Closed by construction — records are closed, and scalar types are never composed into enums or unions. That is not a constraint bolted on top; it is what makes compatible_with, equivalent, normalize, extract, and infer well-defined, decidable operations instead of best-effort heuristics. See why Omnist for the case in full — including a verified capability matrix and the honest non-goals.

Boringly correct

The claims above are checked, not asserted: 100% line coverage, a mypy --strict CI gate, property-based fuzzing, formal ABNF grammars verified against the parsers, every documentation example executed in CI, and the schema algebra verified three independent ways — bidirectional inclusion, the paper's minimize-and-isomorphism theorem, and brute-force enumeration against set-theoretic ground truth. Performance is measured and published, not implied: a 100k-edge document validates in about a third of a second (the numbers). See testing for how the suite is built.

The model is defined formally in docs/design/model.md; see the quickstart for the shortest possible example, or the user guide for the practical tour.

A 60-second tour

from omnist import Doc, parse_schema, infer, doc, read_json

# OML is omnist's own format -- see docs/formats/oml.md
Doc.from_oml('id: 1\ntags: "a"\ntags: "b"\n').to_oml()

# converting from other formats is just read one, write another
Doc.from_json('{"id": 1, "tags": ["a", "b"]}').to_yaml()

# describe a shape and check data against it; errors carry exact paths
s = parse_schema('record R { "id": integer, "tags" [0,]: string }\nroot R')
print(s.validate(doc({"id": "x", "tags": ["a"]})))
#   invalid:
#     at $.id: expected integer, got string ('x')

# learn a schema from examples
print(infer([doc({"id": 1, "tags": ["a"]})]).to_osd())
#   record Root {
#       "id": integer,
#       "tags": string,
#   }
#   root Root

# is a schema change backward-compatible? (operations are Schema methods)
v1 = parse_schema('record R { "host": string }\nroot R')
v2 = parse_schema('record R { "host": string, "port" [0,1]: integer }\nroot R')
v1.compatible_with(v2)        # True -- adding an optional field is safe

# schema-directed deserialization: upgrade leaves to match the schema
s2 = parse_schema('record R { "d": date }\nroot R')
read_json('{"d": "2024-01-01"}', schema=s2)   # [('d', datetime.date(2024, 1, 1))]

Run the full demo: python3 examples/canonical_model.py.

Installation

Requires Python 3.11+ (it uses the standard-library tomllib). The core and JSON support have no dependencies.

pip install omnist                      # core + JSON
pip install omnist[all]                 # + pyyaml, tomli_w, defusedxml

Installing also provides an omnist CLI command; see the CLI docs.

Or from a checkout:

git clone https://github.com/omnist-dev/omnist.git
cd omnist
python3 -m venv .venv && source .venv/bin/activate
pip install .                    # core + JSON
pip install pyyaml tomli_w defusedxml   # YAML / writing TOML / hardened XML

Documentation

Full index: docs/, also browsable as a site at omnist.dev.

Quickstart — the shortest possible example: one OML snippet, one schema, validate(), infer().
Why Omnist — the differentiation case: a falsifiable thesis, a verified capability matrix against JSON/YAML/TOML/XML, a worked compatible_with comparison against jsonschema, and the honest non-goals (including XML attribute/namespace dropping).
User guide — the practical tour: documents, OML (the native format), OSD, the Python builder, validation, operations, other codecs, inference.
OML — Omnist's own format, designed alongside the model so every Document round-trips with zero adjustments, and how it maps onto the Python Document.
The Schema model & OSD — Omnist's other central feature: record definitions, cardinality, the Python builder, and the comparison/inference operations.
API reference — every public name, with signatures.
CLI — the omnist command-line tool.
Schema-directed deserialization — what changes (and what doesn't) about a Document's Python types when a schema is, vs. isn't, passed to a reader.
A real-life example — one order schema validated against documents in JSON, YAML, TOML, and XML, plus a compatibility check.
Formats — how each format maps to the model and its caveats (OML · JSON · YAML · TOML · XML).
Model spec — the formal Document and Schema models, self-contained and plain (no paper required).
Glossary — one definition per term used across the docs and code, grouped by concept area.
Testing — the test suite layout, coverage tooling and target, the fuzzing approach, and what CI runs.
Repo layout — how the repo itself is organized: omnist/*.py module responsibilities, the docs page map, and the test file map.

Status

Omnist is alpha, built around a small, self-contained formalism; the public API may still change before a stable release. See the PyPI badge above (or CHANGELOG.md) for the current version.

Feedback and bug reports welcome: https://github.com/omnist-dev/omnist/issues. See SECURITY.md for the trust model if you parse untrusted input.

License

Apache-2.0 — see LICENSE and NOTICE.

Background

The model is inspired by Lee & Cheung, "XML Schema Computations: Schema Compatibility Testing and Subschema Extraction" (CIKM 2010), simplified for the JSON family of formats. You don't need the paper to use Omnist — the model spec is self-contained.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tom.lee

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

Jul 3, 2026

0.2.28

Jul 3, 2026

0.2.27

Jul 3, 2026

0.2.26

Jul 3, 2026

0.2.25

Jul 3, 2026

0.2.24

Jul 2, 2026

0.2.23

Jul 2, 2026

0.2.22

Jul 2, 2026

0.2.21

Jul 2, 2026

0.2.20

Jul 2, 2026

0.2.19

Jul 2, 2026

0.2.18

Jul 2, 2026

0.2.17

Jul 2, 2026

0.2.16

Jul 2, 2026

0.2.15

Jul 1, 2026

0.2.14

Jul 1, 2026

0.2.13

Jul 1, 2026

0.2.12

Jun 30, 2026

0.2.11

Jun 26, 2026

0.2.9

Jun 26, 2026

0.2.2

Jun 25, 2026

0.2.1

Jun 24, 2026

0.2.0

Jun 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omnist-0.3.0.tar.gz (112.8 kB view details)

Uploaded Jul 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

omnist-0.3.0-py3-none-any.whl (64.7 kB view details)

Uploaded Jul 3, 2026 Python 3

File details

Details for the file omnist-0.3.0.tar.gz.

File metadata

Download URL: omnist-0.3.0.tar.gz
Upload date: Jul 3, 2026
Size: 112.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omnist-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`18624c8639a8849102b13ae0a3ddb320a38dc8a94692dc4b605e28f98919849b`
MD5	`e8ae040506ca54173d1f3c771d2aca2d`
BLAKE2b-256	`8a960b798367e4eb8e9e357e97bf3246f9ac8642b182bae85db6909a4348706e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for omnist-0.3.0.tar.gz:

Publisher: publish.yml on omnist-dev/omnist

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: omnist-0.3.0.tar.gz
- Subject digest: 18624c8639a8849102b13ae0a3ddb320a38dc8a94692dc4b605e28f98919849b
- Sigstore transparency entry: 2063831902
- Sigstore integration time: Jul 3, 2026
Source repository:
- Permalink: omnist-dev/omnist@1eceab2fcf99e948d376937bd1c6294ab938ac03
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/omnist-dev
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@1eceab2fcf99e948d376937bd1c6294ab938ac03
- Trigger Event: push

File details

Details for the file omnist-0.3.0-py3-none-any.whl.

File metadata

Download URL: omnist-0.3.0-py3-none-any.whl
Upload date: Jul 3, 2026
Size: 64.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omnist-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3c8386640aa72f8594266cec2822d0daf4884e9effd0af72d8ad260ae71aeacb`
MD5	`85f5b9f10e7ae895dc55b327ba93302b`
BLAKE2b-256	`e99f0e4b64cebce7067ef3e6fea6d91fc6bdaf21b161d0671a668d5add88151d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for omnist-0.3.0-py3-none-any.whl:

Publisher: publish.yml on omnist-dev/omnist

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: omnist-0.3.0-py3-none-any.whl
- Subject digest: 3c8386640aa72f8594266cec2822d0daf4884e9effd0af72d8ad260ae71aeacb
- Sigstore transparency entry: 2063831985
- Sigstore integration time: Jul 3, 2026
Source repository:
- Permalink: omnist-dev/omnist@1eceab2fcf99e948d376937bd1c6294ab938ac03
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/omnist-dev
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@1eceab2fcf99e948d376937bd1c6294ab938ac03
- Trigger Event: push

omnist 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Omnist

Why Omnist

Boringly correct

A 60-second tour

Installation

Documentation

Status

License

Background

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance