Generates an MD5 unique hash of an Json object
Project description
MerkleJson
Generates a unique hash for a json or dict object. In this way you can guarantee the consistency of your json object quickly and easily by storing a unique hash which would act as a certificate signature.
MerkleJson guarantees the unique hash, regardless of the order of the json objects in a list. Also it detects if an merkle hash already exists in the object which avoids re-calculations. Furthermore, you can keep the consistency of the object only partially, by ignoring other irrelevant keys.
Installation
pip install merkle-json
Usage
from merkle_json import MerkleJson
#init the object
mj = MerkleJson()
#... your code here
Serialize JSON object :
Unlike JSON, MerkleJson serializes objects canonically.
obj = {
'keyD': 1,
'keyA': 2,
'keyC': [4,3],
'keyB': 4,
}
print(obj) # '{'keyD': 1, 'keyA': 2, 'keyC': [4, 3], 'keyB': 4}'
mj = MerkleJson()
mj.stringify(obj) # '{"keyA":2,"keyB":4,"keyC":3,"keyD":1}'
Generate unique hash of JSON object :
mj = MerkleJson()
mjHash = mj.hash(obj)
print(mjHash) # '7001bd2b415e6a624a23d7bc7c249b21'
# obj_unordered is the same object than obj however it contains different order of the keys
obj_unordered = {
'keyC': [3,4],
'keyA': 2,
'keyB': 4,
'keyD': 1,
}
mj.hash(obj_unordered) # '7001bd2b415e6a624a23d7bc7c249b21'
As you can see they both return the same merkle hash value.
In python when comparing both objects you will get a False
to obj == obj_unordered
due to the array which has a different order [4,3] != [3,4]
.
If merkleHash
(default key) is present then the calculation will not be performed (by default cached = True
) :
new_obj = {
'key1': 'something',
'key2': 'otherthing'
}
mj = MerkleJson()
mjHash = mj.hash(new_obj, cached = True) # cached = True by default
print(mjHash) # 'ff55be553d7bcfbf6233891554378ddd'
new_obj = {
'key1': 'something',
'key2': 'otherthing',
'merkleHash': 'my_hash',
'merkleHash2' : 'my_valid_hash'
}
mj = MerkleJson(ignoreKeys=['merkleHash2']) #ignore merkleHash2 to be ignored
mjHash = mj.hash(new_obj, cached = False) # cached = True by default
print(mjHash) # 'ff55be553d7bcfbf6233891554378ddd'
mjHash = mj.hash(new_obj) # cached = True by default
print(mjHash) # 'my_hash'
You can get the hash which is stored in a different key by using hashTag
parameter :
mj = MerkleJson(hashTag= 'merkleHash2')
mjHash = mj.hash(new_obj) # cached = True by default but now the value of merkleHash2 will be returned as defined in hashTag parameter
print(mjHash) # 'my_valid_hash'
Ignore specific key in the object with the parameter ignoreKeys
:
This way you can keep the consistency of the object partially.
#let say you want to ignore 'key2'
new_obj = {
'key1': 'something',
'key2': 'otherthing' #ignored
}
obj_without_key2 = {
'key1': 'something'
}
mj = MerkleJson(ignoreKeys = ['key2'])
mjHash = mj.hash(new_obj) #
print(mjHash) # '95f6d042c35c1cedc7e3dea0a255c332'
mjHash2 = mj.hash(obj_without_key2) #
print(mjHash2) # '95f6d042c35c1cedc7e3dea0a255c332'
print(mjHash == mjHash2) # True
Ignore keys with None
values by setting ignoreNulls = True
when calling the hash()
function :
k = [{
'test' : None,
'test2': 1
}]
l = [{
'test2': 1
}]
print(mj.hash(k)) # prints 0cf3a78d7746983efea449bf54fa8664
print(mj.hash(k, ignoreNulls=True)) # prints c9153ef54418f96e7937f25be9a8c27f
print(mj.hash(l)) # prints c9153ef54418f96e7937f25be9a8c27f
License
Credit
The project was inspired by :
- Merkle Tree
- Similar package/project in javascript : merkle-json
Merkle Tree
Merkle Trees, also known as hash trees, are a data structure used to efficiently verify the integrity of large datasets. They were named after Ralph Merkle, who first proposed the idea in a 1979 paper.
A Merkle Tree works by recursively hashing pairs of data until a single hash value, known as the root hash or Merkle root, is produced. Each level of the tree represents a different layer of hashes, with the root hash at the top. The leaf nodes of the tree contain the original data or hashed values.
One of the main advantages of Merkle Trees is their ability to efficiently verify the integrity of large datasets without having to hash the entire dataset. Instead, by hashing only the nodes along the path from the data to the root, a Merkle proof can be generated which proves that the data is part of the larger dataset without revealing any other information about the dataset.
Merkle Trees are commonly used in peer-to-peer networks, such as blockchain technology, to ensure the integrity of the data stored in the network. They are also used in other applications such as file sharing and digital signatures.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file merkle-json-1.0.0.tar.gz
.
File metadata
- Download URL: merkle-json-1.0.0.tar.gz
- Upload date:
- Size: 4.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2deb0e78beeb5cd9b515853bdc82bad341fa92538efe460f0142318dc4c6919d |
|
MD5 | da993a93fb0e58db4306aeda8310df95 |
|
BLAKE2b-256 | f6b00b9f87bc84cf208078764a82d4323937b5b861a5aad6e60319d6771398b3 |
File details
Details for the file merkle_json-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: merkle_json-1.0.0-py3-none-any.whl
- Upload date:
- Size: 5.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 832a626f0049e8de22b3096d6183100887e83bfabf238a64e1107ac1efa8499e |
|
MD5 | 1a1c0826399724b96cc26f086cfb75c7 |
|
BLAKE2b-256 | 09ca6b62db7bb61fef5e84bf8a7d7e6237b3fda24e4dae4839aeb76edcb2f1d3 |