Tools to diff json format data.
Project description
diffjson
Utilities to diff json data. https://nfwprod.github.io/diffjson/
Features
- Enable search like XPATH.
- Enable diff for multi json data.
Branch Class for Search
Branch Class
Convert dict/list json data to Branch class format by 'generate_branch'. Branch class is hierachical class tree.
- RootBranch
- Root for all child branches.
- Provide search and other methods for users.
- DictBranch
- Child branch for dict format child.
- ListBranch
- Child branch for list format child.
- Leaf
- Edge branch for str, bool, int, and float.
- Branch
- Branch common class for inheritate.
Search
Branch class accept search with xpath like strings. (Strongly inspired by josonpath-ng!!)
# Example Data, sample.yaml
branch01:
b01-01: string
b01-02: 1
b01-03: 2.0
b01-04: True
branch02:
b02-01:
- name: n02-01-i01
value: v02-01-i01
- name: n02-01-i02
value:
b02-01-i02-01:
- name: n02-01-i02-01-i01
value: v02-01-i02-01-i01
- name: n02-01-i02-01-i02
value: v02-01-i02-01-i02
- name: n02-01-i02-01-i03
value: v02-01-i02-01-i03
b02-01-i02-02:
name: n02-01-i02-02
value: v02-01-i02-02
- name: n02-01-i03
value:
b02-01-i03-01: v02-01-i03-01
branch03:
b03-01: null
import diffjson
import yaml
with open('sample.yaml'), 'r') as f:
sampledata = yaml.safe_load(f)
# Get dict format data under b01-01
b = diffjson.generate_branch(sampledata)
result = b.search('/branch01/b01-01')
print(result)
> ['string']
Search returns all matched data as List style.
DiffBranch Class for Diff JSON Data
DiffBranch Class
Diff multi json data as Branch instance by 'diff_branch'.
- DiffRootBranch
- Root for all child diff branches.
- DiffCommonBranch
- Child branch for all data format.
- Diff for dict, list and leaf are contained in DiffCommonBranch.Branch.
- DiffBranch
- Branch common class for inheritate.
Diff
diffbranch = diffjson.diff_branch([data01, data02, data03])
# Export diff in csv format
diffbranch.export_csv('./diff.csv')
NodenameMasks Options
Sometimes data are contained in list format and orders are changes randomly. For example,,
Data Before.
- name: id01
value: data01
- name: id02
value: data02
- name: id03
value: data03
Data After.
- name: id01
value: data01
- name: id03
value: data03
- name: id02
value: changed
We want to diff "name: id02" and "name: id02". Don't want to diff second data "name: id02" and "name: id03".
For such case, use mask function.
masks = {'/': lambda x: x['name'], '/branch01': lambda x: x['id']}
diffbranch = diffjson.diff_branch(
[data01, data02, data03],
nodenamemasks=masks)
Nodename masks convert list part to dict part with lambda generated key like follows.
Data Before
id01:
name: id01
value: data01
id02:
name: id02
value: data02
id03:
name: id03
value: data03
Data After
id01:
name: id01
value: data01
id02:
name: id02
value: changed
id03:
name: id03
value: data03
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file diffjson-0.1.1.tar.gz
.
File metadata
- Download URL: diffjson-0.1.1.tar.gz
- Upload date:
- Size: 13.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0be0205beaff0c653876c071b466e553dde2c476f7e4a4f4635002fef65f09a |
|
MD5 | 32570e9ef0c2f90e7653e18c22712112 |
|
BLAKE2b-256 | 5e4b2cb70484d71d632cc024505f8431f458ddc975dee8c5e00800108fabe504 |
File details
Details for the file diffjson-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: diffjson-0.1.1-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/45.2.0 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f4a4799787a1ba1f8394a691cf70d6b7e8a5e703341a34d60b9dbcb41a0ad87 |
|
MD5 | 3062d53d2d56a2b41647a9f4212d6086 |
|
BLAKE2b-256 | d543eccc3e668743ff99b6381fd971854bb1df16e2d649c3b011f6f8f550f5cd |