Library for reading JSON-stat format data.
Project description
jsonstat.py is a library for reading the JSON-stat [1] data format maintained and promoted by Xavier Badosa. The JSON-stat format is a JSON format for publishing dataset. JSON-stat is used by several institutions to publish statistical data. For example it is used by Eurostat that provide statistical information about the European Union (EU) and Istat, the Italian National Institute of Statistics)
jsonstat.py library tries to mimic as much is possible in pythonn the json-stat Javascript Toolkit One of the objectives is to be helpful in exploring dataset using ipython notebook.
For a fast overview of the feature you can start from this example notebook oecd-canada-jsonstat_v1.html
You can also check out some of the jupyter example notebook from the example directory on github or into the documentation
As bonus jsonstat.py contains an useful classes to explore dataset publiched by Istat.
You can find useful another python library pyjstat by Miguel Expósito Martín concerning json-stat format. Its main focus is to translate json-stat format into pandas dataframe.
This library is in beta status. I am actively working on it and hope to improve this project. For every comment feel free to contact me gf@26fe.com
Installation
Pip will install all required dependencies. For installation:
pip install jsonstat
Usage
Simple Usage
There is a simple command line interface, so you can experiment to parse jsonstat file without write code:
jsonstat --cache_dir /tmp http://json-stat.org/samples/oecd-canada.json downloaded file(s) are stored into '/tmp' download 'http://json-stat.org/samples/oecd-canada.json' JsonstatCollection contains the following JsonStatDataSet: 0: dataset 'oecd' 1: dataset 'canada'
code example:
url = 'http://json-stat.org/samples/oecd-canada.json' collection = jsonstat.from_url(json_string) # print list of dataset contained into the collection collection.info() # select the first dataset of the collection and print a short description oecd = collection.dataset(0) oecd.info() # print description about each dimension of the dataset for d in oecd.dimensions(): d.info() # print a datapoint contained into the dataset print(oecd.value(area='IT', year='2012')) # convert a dataset in pandas dataframe df = oecd.to_data_frame('year')
For more python script examples see examples directory
For jupyter (ipython) notebooks see examples-notebooks directory
License
jsonstat.py is provided under the LGPL license. See LICENSE file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.