Skip to main content

A Python- and pandas-powered client for Statistical Data and Metadata eXchange

Project description

pandaSDMX is an Apache 2.0-licensed Python package aimed at becoming the most intuitive and versatile tool to retrieve and acquire statistical data and metadata disseminated in SDMX format. It should work with all SDMX data providers supporting SDMX 2.1. Currently, this is tested for the European statistics office (Eurostat), and the European Central Bank (ECB) each providing hundreds of thousands of time series.

While pandaSDMX is extensible to cater any output format, it currently supports only pandas, the gold-standard of data analysis in Python. But from pandas you can export your data to Excel and friends.

Main features

  • intuitive API inspired by requests

  • support for many SDMX features including

    • generic datasets

    • data structure definitions, code lists and concept schemes

    • dataflow definitions and content-constraints

    • categorisations and category schemes

  • pythonic representation of the SDMX information model

  • find dataflows by name or description in multiple languages if available

  • When requesting datasets, validate column selections against code lists and content-constraints if available

  • read and write SDMX messages to and from local files

  • configurable HTTP connections

  • support for requests-cache allowing to cache SDMX messages in memory, MongoDB, Redis or SQLite

  • writer transforming SDMX generic datasets into multi-indexed pandas DataFrames or Series of observations and attributes

  • extensible through custom readers and writers for alternative input and output formats of data and metadata

For further details including extensive code examples see the documentation .

v0.3.0 (2015-09-22)

  • support for requests-cache allowing to cache SDMX messages in memory, MongoDB, Redis or SQLite

  • pythonic selection of series when requesting a dataset: Request.get allows the key keyword argument in a data request to be a dict mapping dimension names to values. In this case, the dataflow definition and datastructure definition, and content-constraint are downloaded on the fly, cached in memory and used to validate the keys. The dotted key string needed to construct the URL will be generated automatically.

  • The Response.write method takes a parse_time keyword arg. Set it to False to avoid parsing of dates, times and time periods as exotic formats may cause crashes.

  • The Request.get method takes a memcache keyward argument. If set to a string, the received Response instance will be stored in the dict Request.cache for later use. This is useful when, e.g., a DSD is needed multiple times to validate keys.

  • fixed base URL for Eurostat

  • major refactorings to enhance code maintainability

v0.2.2 (2015-05-19)

  • Make HTTP connections configurable by exposing the requests.get API through the pandasdmx.api.Request constructor. Hence, proxy servers, authentication information and other HTTP-related parameters consumed by requests.get can be set for an Request instance and used in subsequent requests. The configuration is exposed as a dict through the Request.client.config attribute.

  • Responses now have an http_headers attribute containing the headers returned by the SDMX server

v0.2.1 (2015-04-22)

  • API: add support for zip archives received from an SDMX server. This is common for large datasets from Eurostat

  • incidentally get a remote resource if the footer of a received message specifies an URL. This pattern is common for large datasets from Eurostat.

  • allow passing a file-like object to api.Request.get()

  • enhance documentation

  • make pandas writer parse more time period formats and increase its performance

v0.2.0 (2015-04-13)

This version is a quantum leap. The whole project has been redesigned and rewritten from scratch to provide robust support for many SDMX features. The new architecture is centered around a pythonic representation of the SDMX information model. It is extensible through readers and writers for alternative input and output formats. Export to pandas has been dramatically improved. Sphinx documentation has been added.

v0.1 (2014-09)

Initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandaSDMX-0.3.0.tar.gz (56.1 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pandaSDMX-0.3.0-py3.4.egg (132.9 kB view details)

Uploaded Egg

pandaSDMX-0.3.0-py2.7.egg (130.5 kB view details)

Uploaded Egg

File details

Details for the file pandaSDMX-0.3.0.tar.gz.

File metadata

  • Download URL: pandaSDMX-0.3.0.tar.gz
  • Upload date:
  • Size: 56.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandaSDMX-0.3.0.tar.gz
Algorithm Hash digest
SHA256 5ed162dfb54a8c06a0f4bae1b3c225471834634e90cf579e1be88a5fe983dfba
MD5 0d44ac9800a2a90f838141db6b791a48
BLAKE2b-256 c0f05a26020c2e9ff85036e9f8e0410170eb9f0a36daa8016b0c94980b2feae1

See more details on using hashes here.

File details

Details for the file pandaSDMX-0.3.0-py3.4.egg.

File metadata

  • Download URL: pandaSDMX-0.3.0-py3.4.egg
  • Upload date:
  • Size: 132.9 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandaSDMX-0.3.0-py3.4.egg
Algorithm Hash digest
SHA256 b9e7b26e00793a57c0a227590706f92f1878b5204d8e1a7b3a755fa8021b1beb
MD5 79f49939e2da937a0a452df14b072b49
BLAKE2b-256 be734cd37e06453bf4b12e63b66bfb1eb14b756c0dfc9395aeb3e28c7943b279

See more details on using hashes here.

File details

Details for the file pandaSDMX-0.3.0-py2.7.egg.

File metadata

  • Download URL: pandaSDMX-0.3.0-py2.7.egg
  • Upload date:
  • Size: 130.5 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pandaSDMX-0.3.0-py2.7.egg
Algorithm Hash digest
SHA256 acd312a16a41abf9c395194c87b21ce87d0cc45143393f75c84d6d18cea2bb46
MD5 b03d66d1ebdca2ca7af9980e0cc0006e
BLAKE2b-256 0a47d4af3d9b0a9a062deb74e61f151553806d89f0a111a7390156bee3a69391

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page