Spawn MongoDB resources from JSON Schema
Project description
mongospawn
is a tool to help spawn MongoDB resources given JSON Schema
specifications.
The primary near-term use case is support for the National Microbiome Data
Collaborative (NMDC) pilot project. In particular,
given a JSON Schema with all array-typed properties and with each array item a
$ref
reference to one of the JSON Schema definitions
(see NMDC
example),
mongospawn
can generate MongoDB $jsonSchema
documents to apply as validators
for collections in a database that correspond to each of the original JSON
Schema's array-typed properties. MongoDB's implementation of JSON Schema does
not support $ref
, definitions
, etc., so mongospawn
expands references to
generate appropriate per-collection schema documents.
In addition to generating derived schema documents, mongospawn
can spawn new
databases/collections, with schema validation set, via the pymongo
driver, and
can also manage access to the spawned resources via mongogrant
.
Setup
For development:
pip install -e .[dev]
To update dependency versions:
make update
To use pinned dependencies for reproducible testing:
make
Usage
Example using NMDC's JSON Schema:
from mongospawn.schema import dbschema_from_file, collschemas_for
from pymongo import MongoClient
client = MongoClient()
db = client.nmdc_test
dbschema = dbschema_from_file("nmdc.schema.json")
collschemas = collschemas_for(dbschema)
for name in collschemas:
db.drop_collection(name)
db.create_collection(name, validator={"$jsonSchema": collschemas[name]})
print(f"created {name} collection")
# created activity_set collection
# created biosample_set collection
# created data_object_set collection
# created omics_processing_set collection
# created study_set collection
Now, e.g. if you try to insert a non-conformant JSON document, a
pymongo.errors.WriteError
will be raised:
db.biosample_set.insert_one({"not_a_real_field": 1})
# => WriteError: Document failed validation...
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for mongospawn-0.5.10-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 54c7478ee8180267050ce095dd4ac7796d717028a01ce07e1300bec7a9b5ed98 |
|
MD5 | 2b29b6800a82c09b88d41cea80bab9b4 |
|
BLAKE2b-256 | 534255b275879e9aa09c5b4e008b7bf5fdd00bfe640510521f42cec7af480e9a |