Skip to main content

A Python- and pandas-powered client for Statistical Data and Metadata eXchange

Project description

pandaSDMX is an Apache 2.0-licensed Python package aimed at becoming the most intuitive and versatile tool to retrieve and acquire statistical data and metadata disseminated in SDMX format. It should work with all+ SDMX data providers supporting SDMX 2.1. Currently the only known SDMX 2.1-compliant agencies are the European statistics office (Eurostat, more than 5700 dataflows) and the European Central Bank (ECB, more than 70 dataflows). While pandaSDMX is extensible to cater any output format, it currently supports only pandas, the gold-standard of data analysis in Python. But from pandas you can export your data to Excel and friends.

Main features

  • intuitive API inspired by requests

  • support for many SDMX features including

    • generic datasets

    • data structure definitions, codelists and concept schemes

    • dataflow definitions

    • categorisations and category schemes

  • pythonic representation of the SDMX information model

  • find dataflows by name or description in multiple languages if available

  • read and write local files for offline use

  • writer transforming SDMX generic datasets into multi-indexed pandas DataFrames or Series of observations and attributes

  • extensible through custom readers and writers for alternative input and output formats of data and metadata

Example

>>> from pandasdmx import Request
>>> # Get annual unemployment data from Eurostat
>>> une_resp = Request('ESTAT').get(resource_type = 'data', resource_id = 'une_rt_a')
>>> # From the received dataset, select the time series on Greece, Ireland and Spain, and write them to a pandas DataFrame
>>> une_df = une_resp.write(s for s in une_resp.msg.data.series if s.key.GEO in ['EL', 'ES', 'IE'])
>>> # Explore the DataFrame
>>> une_df.columns.names
>>> une_df.columns.levels[0:2]
>>> une_df.loc[:'2006', ('TOTAL', 'T')]

Version 0.2.0 (2015-04)

This version is a quantum leap. The whole project has been redesigned and rewritten from scratch to provide robust support for many SDMX features. The new architecture is centered around a pythonic representation of the SDMX information model. It is extensible through readers and writers for alternative input and output formats. Export to pandas has been dramatically improved. Sphinx documentation has been added.

v0.1 (2014-09)

Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandaSDMX-0.2.0.tar.gz (51.0 kB view hashes)

Uploaded Source

Built Distribution

pandaSDMX-0.2.0-py2.py3-none-any.whl (94.0 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page