JSON Tools Library
Project description
Index
Infer Schema
With function infer_schema
, you can get schema from json object. The output complys to JSON Schema Specification. Here is an example:
#!/usr/bin/env python
# -*- coding: UTF-8 -*-
from json_tools2 import infer_schema
import json
def main():
schema = infer_schema({"foo": 1})
print(json.dumps(schema, indent=4))
if __name__ == '__main__':
main()
Output
{
"type": "object",
"properties": {
"foo": {
"type": "integer"
}
}
}
When you have an array of element from different types, or object with the same property but from different type, we need to merge schema.
case 1: 2.1
and 2
are of different types and we need to merge schema
[2.1, 2]
case 2: 2.1
and 2
are of different types and we need to merge schema
[{"foo": 2.1}, {"foo": 2}]
Following rules applies when we do schema merging:
- For
null
, any type can merged to it and return anullable type
, for example,sting
merge tonull
and return anullable string
- For
string
, onlynull
can merged tostring
and returnnullable string
. - For
number
, followings are the only valid type that can merge to itinteger
can merged to it without changing itnull
can merge to it and becomenullable number
- For
integer
, followings are the only valid type that can merge to itnumber
can merged to it and upgrade it tonumber
null
can merge to it and becomenullable number
- For
boolean
, followings are the only valid type that can merge to itnull
can merge to it and becomenullable boolean
- For
object
, followings are the only valid type that can merge to itnull
can merge to it and becomenullable object
- For
array
, followings are the only valid type that can merge to itnull
can merge to it and becomenullable array
We use anyOf
to represent nullable type
, for example:
This represent a nullable integer
type.
{
"anyOf": [{"type": "integer"},{"type": "null"}]
}
For any schema inferred, you should be able to validate the payload with the schema inferred, here is an example:
#!/usr/bin/env python
# -*- coding: UTF-8 -*-
# You need to do "pip install jsonschema"
from json_tools2 import infer_schema
from jsonschema import validate
import json
def main():
payload = {"foo": 1}
schema = infer_schema(payload)
validate(instance=payload, schema=schema) # it should not throw exception
if __name__ == '__main__':
main()
Schema evolvement
Think about such case:
- You receive json object every day.
- Every day, you want to load schema inferred from the json object from the prior day, and merge it with the schema inferred from today's object, and then save the inferred schema.
Here is an example:
#!/usr/bin/env python
# -*- coding: UTF-8 -*-
from json_tools2 import get_schema, ValueSchema
import json
def main():
schema = get_schema(1)
print(json.dumps(schema.to_json(), indent=4))
saved_schema = schema.dump_json()
# You got a json object saved_schema which you can save the inferred schema
# load schema
schema = ValueSchema.load_json(saved_schema)
schema.merge_from(get_schema(None))
print(json.dumps(schema.to_json(), indent=4))
if __name__ == '__main__':
main()
Output
{
"type": "integer"
}
{
"anyOf": [
{
"type": "integer"
},
{
"type": "null"
}
]
}
JSON Transform
json_remove_field
json_remove_field(obj:Any, field_name:str) -> Any:
It removes field specified in field_name
recursively.
Example:
json_remove_field({
"foo": 1,
"bar": {"foo": 1, "t": 2}
}, "foo")
it will remove field "foo" recursively, so it will return
{
"bar": {
"t": 2
}
}
json_traverse_transform
json_traverse_transform(obj: Any, path:str, handler:Callable[[Any], None]) -> Any:
It traverse the input json object obj
, for each object that match the path pattern specified in path
, it calls the function handler
. Eventually obj
is retuened however, it might be mutated by handler
.
Path pattern:
Given object, a path represent a list of object extracted from object.
For example, given object X, here are example pathes:
"foo" represent X.foo
"foo.bar" represent X.foo.bar
"foo[]" represent [X.foo[0], X.foo[1], ...]
"foo[].x" represent [X.foo[0].x, X.foo[1].x, ...]
"foo[].x[]" represent [X.foo[0].x[0], X.foo[0].x[1], ..., X.foo[1].x[0], X.foo[1].x[1], ...]
Example:
def handler(o):
if type(o) != dict:
return o
x = o.get("x", 0)
y = o.get("y", 0)
if type(x) in (int, float) and type(y) in (int, float):
o["sum"] = x+y
json_traverse_transform(
{
"cords": [
{"x": 1, "y": 2},
{"x": 2, "y": 3},
]
}, "cords[]", handler
)
returns
{
"cords": [
{"x": 1, "y": 2, "sum": 3},
{"x": 2, "y": 3, "sum": 5},
]
}
json_convert_to_array
json_convert_to_array(obj:Any, field_name:str, new_field_name:str, path="")->Any:
For each matching child object based on pattern path, it convert it's field field_name
from dictionary into list
, result is stored in field new_field_name
, the original field is either deleted or overritten.
Example:
json_convert_to_array([
{
"q1": {
"score": {"math": 1, "physics": 2}
}
},
{
"q1": {
"score": {"math": 2, "physics": 3}
}
}
], "score", "score2", "[].q1")
returns
[
{
"q1": {
"score2": [
{"key": "math", "value": 1},
{"key": "physics", "value": 2}
]
}
},
{
"q1": {
"score2": [
{"key": "math", "value": 2},
{"key": "physics", "value": 3}
]
}
},
]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file json-tools2-0.0.11.tar.gz
.
File metadata
- Download URL: json-tools2-0.0.11.tar.gz
- Upload date:
- Size: 12.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fdb1e98cc2d39323fab6f032d99480c848c201abe387653b8dbcdefd761e4e76 |
|
MD5 | 679f914b30844527ba90a1d3eba64570 |
|
BLAKE2b-256 | b80e6cf86fe02a23264fe0bef3a222684f424860c0a5c8cb4904708cd94727a6 |
File details
Details for the file json_tools2-0.0.11-py2.py3-none-any.whl
.
File metadata
- Download URL: json_tools2-0.0.11-py2.py3-none-any.whl
- Upload date:
- Size: 11.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c858991923a637416296a880b293503ee8150c0e064cff0a507cde1512725cae |
|
MD5 | 042d48508da431509d8a77d9060238c1 |
|
BLAKE2b-256 | 0ebd0989e3dc185737ce918e55b807ef39df46334f62d643dbb3abc70e61ef28 |