Map flat data to structured JSON via a mapping.
Project description
# jsonmapping [![Build Status](https://travis-ci.org/pudo/jsonmapping.svg?branch=master)](https://travis-ci.org/pudo/jsonmapping)
To transform flat data structures into nested object graphs matching JSON
schema definitions, this package defines a mapping language. It defines how
the columns of a source data set (e.g. a CSV file, database table) are to be
converted to the fields of a JSON schema.
The format allows mapping nested structures, including arrays. It also supports
the application of very basic data transformation steps, such as generating a
URL slug or hashing a column value.
## Example mapping
The mapping format is independent of any particular JSON schema, such that
multiple mappings could be defined for any one particular schema.
```json
{
"schema": {"$ref": "http://www.popoloproject.com/schemas/person.json"},
"mapping": {
"id": {"column": "person_id"},
"name": {"column": "person_name"},
"memberships": [{
"mapping": {
"role": {"default": "Member of Organization"},
"organization": {
"mapping": {
"id": {
"columns": ["org_id"],
"constant": "default-org"
},
"name": {
"column": "org_name",
"constant": "Default Organization",
"transforms": ["strip"]
}
}
}
}
}]
}
}
```
This mapping would apply to a four-column CSV file and map it to a set of
nested JSON objects (a [Popolo](http://www.popoloproject.com/) person, with a
membership in an organization).
## Data Transforms
While ``jsonmapping`` is not a data cleaning tool, it supports some very basic
data transformation operations that can be applied on a particular column or
set of columns. These include:
* ``coalesce``: Select the first non-null value from the list of items.
* ``slugify``: Transform each string into a URL slug form.
* ``join``: Merge together the string values of all selected columns.
* ``upper``: Transform the text to upper case.
* ``lower``: Transform the text to lower case.
* ``strip``: Remove leading and trailing whitespace.
* ``hash``: Generate a SHA1 hash of the given value.
## Usage
``jsonmapping`` is available on the Python Package Index:
```bash
$ pip install jsonmapping
```
The library can then be used as follows:
```python
from jsonschema import RefResolver
from jsonmapping import Mapper
# ... load the mapping ...
mapping = load_mapping()
resolver = RefResolver.from_schema(mapping)
# ... grab some data ...
rows = read_csv()
objs = []
# This will transform flat data rows into nested JSON objects:
for obj, err in Mapper.apply_iter(rows, mapping, resolver):
if err is None:
objs.append(obj)
# And you can reverse the process, even though that is lossy:
for row in Mapper.flatten_iter(objs, mapping, resolver):
print row
```
## Tests
The test suite will usually be executed in it's own ``virtualenv`` and perform a
coverage check as well as the tests. To execute on a system with ``virtualenv``
and ``make`` installed, type:
```bash
$ make test
```
To transform flat data structures into nested object graphs matching JSON
schema definitions, this package defines a mapping language. It defines how
the columns of a source data set (e.g. a CSV file, database table) are to be
converted to the fields of a JSON schema.
The format allows mapping nested structures, including arrays. It also supports
the application of very basic data transformation steps, such as generating a
URL slug or hashing a column value.
## Example mapping
The mapping format is independent of any particular JSON schema, such that
multiple mappings could be defined for any one particular schema.
```json
{
"schema": {"$ref": "http://www.popoloproject.com/schemas/person.json"},
"mapping": {
"id": {"column": "person_id"},
"name": {"column": "person_name"},
"memberships": [{
"mapping": {
"role": {"default": "Member of Organization"},
"organization": {
"mapping": {
"id": {
"columns": ["org_id"],
"constant": "default-org"
},
"name": {
"column": "org_name",
"constant": "Default Organization",
"transforms": ["strip"]
}
}
}
}
}]
}
}
```
This mapping would apply to a four-column CSV file and map it to a set of
nested JSON objects (a [Popolo](http://www.popoloproject.com/) person, with a
membership in an organization).
## Data Transforms
While ``jsonmapping`` is not a data cleaning tool, it supports some very basic
data transformation operations that can be applied on a particular column or
set of columns. These include:
* ``coalesce``: Select the first non-null value from the list of items.
* ``slugify``: Transform each string into a URL slug form.
* ``join``: Merge together the string values of all selected columns.
* ``upper``: Transform the text to upper case.
* ``lower``: Transform the text to lower case.
* ``strip``: Remove leading and trailing whitespace.
* ``hash``: Generate a SHA1 hash of the given value.
## Usage
``jsonmapping`` is available on the Python Package Index:
```bash
$ pip install jsonmapping
```
The library can then be used as follows:
```python
from jsonschema import RefResolver
from jsonmapping import Mapper
# ... load the mapping ...
mapping = load_mapping()
resolver = RefResolver.from_schema(mapping)
# ... grab some data ...
rows = read_csv()
objs = []
# This will transform flat data rows into nested JSON objects:
for obj, err in Mapper.apply_iter(rows, mapping, resolver):
if err is None:
objs.append(obj)
# And you can reverse the process, even though that is lossy:
for row in Mapper.flatten_iter(objs, mapping, resolver):
print row
```
## Tests
The test suite will usually be executed in it's own ``virtualenv`` and perform a
coverage check as well as the tests. To execute on a system with ``virtualenv``
and ``make`` installed, type:
```bash
$ make test
```
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jsonmapping-0.4.1.tar.gz
(8.6 kB
view details)
Built Distribution
File details
Details for the file jsonmapping-0.4.1.tar.gz
.
File metadata
- Download URL: jsonmapping-0.4.1.tar.gz
- Upload date:
- Size: 8.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0e2aaed686661812f3b2a643a95e4bb140e44bf6e9be01e308c54998ccb53007 |
|
MD5 | 2a9141b9183cd253c30b510951e5bee1 |
|
BLAKE2b-256 | d00b775bdef9e2ab8b2d41abbf44ba894ded50a3d0443c4eca5c52c895853d18 |
File details
Details for the file jsonmapping-0.4.1-py2-none-any.whl
.
File metadata
- Download URL: jsonmapping-0.4.1-py2-none-any.whl
- Upload date:
- Size: 11.2 kB
- Tags: Python 2
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 06bceab9ef9c1d0ed34ecc6dca9ca89f2e90a4863186a9d67f9e4828da8a037c |
|
MD5 | 54b9360815fa033225a2d7a345513c36 |
|
BLAKE2b-256 | 5945b13fbbb730341d548137d19c4cd92d06f7cff385c6213cb053eff0529692 |