Skip to main content

a Python library to interact with a collection of frictionless datapackages

Project description

Binder DOI

This module provides a Python library to interact with a collection of frictionless datapackages. Such datapackages consist of a CSV (data) file which is annotated with a JSON file. This allows storing additional information such as units used in the columns of a CSV or store metadata describing the underlying data. Example datapackages can be found here and a JSON could be structured as follows

{
    "resources": [
        {
            "name": "demo_package",
            "type": "table",
            "path": "demo_package.csv",
            "scheme": "file",
            "format": "csv",
            "mediatype": "text/csv",
            "encoding": "utf-8",
            "schema": {
                "fields": [
                    {
                        "name": "t",
                        "type": "number",
                        "unit": "s"
                    },
                    {
                        "name": "j",
                        "type": "number",
                        "unit": "A / m2"
                    }
                ]
            },
            "metadata": {
                "echemdb": {
                    "description": "Sample data for the unitpackage module.",
                    "curation": {
                        "process": [
                            {
                                "role": "experimentalist",
                                "name": "John Doe",
                                "laboratory": "Institute of Good Scientific Practice",
                                "date": "2021-07-09"
                            }
                        ]
                    }
                }
            }
        }
    ]
}

The metadata of an entries' resource in a collection is accessible from the python API.

>>> from unitpackage.collection import Collection
>>> db = Collection.from_local('./doc/files')
>>> entry = db['demo_package_cv']
>>> entry.description
'Sample data for the unitpackage module.'

From the API also a simple 2D plot can be drawn.

>>> entry.plot()

Ultimately, the unitpackage allows for simple transformation of data within a resource into different units.

>>> entry.get_unit('j')
'A / m2'
>>> entry.df
          t         E        j
0  0.000000	-0.196962 0.043009
1  0.011368	-0.196393 0.051408
...
>>> entry.rescale({'E' : 'mV', 'j' : 'uA / m2'}).df
          t           E             j
0  0.000000 -196.961730  43008.842162
1  0.011368 -196.393321  51408.199892
...

Collections for specific measurement types can be created, which provide additional accessibility to the meatadata or simplify the representation of such data in common plot types. An example of such a collection can be found on echemdb.org, which shows Cyclic Voltammetry data annotated following echemdb's metadata schema, which can be stored in a CVCollection and is retrieved from the echemdb data repository.

Detailed installation instructions, description of the modules, advanced usage examples, including local collection creation, are provided in our documentation.

Installation instructions

This package is available on PyPI and can be installed with pip:

pip install unitpackage

The package is also available on conda-forge an can be installed with conda

conda install -c conda-forge unitpackage

or mamba

mamba install -c conda-forge unitpackage

Please consult our documentation for more detailed installation instructions.

License

The contents of this repository are licensed under the GNU General Public License v3.0 or, at your option, any later version.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unitpackage-0.8.3.tar.gz (31.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

unitpackage-0.8.3-py3-none-any.whl (36.7 kB view details)

Uploaded Python 3

File details

Details for the file unitpackage-0.8.3.tar.gz.

File metadata

  • Download URL: unitpackage-0.8.3.tar.gz
  • Upload date:
  • Size: 31.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for unitpackage-0.8.3.tar.gz
Algorithm Hash digest
SHA256 2944255c0b0eaf05598032b67a744d7efdc18a060ab26eadf6ef0f8a82cf2958
MD5 94d2cceaf64ede3968867d3e35b83bea
BLAKE2b-256 b5453649503c0f48cbf6eacc7e6a267c1d9d7544cc7136b0ed711af4919a2231

See more details on using hashes here.

File details

Details for the file unitpackage-0.8.3-py3-none-any.whl.

File metadata

  • Download URL: unitpackage-0.8.3-py3-none-any.whl
  • Upload date:
  • Size: 36.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.8

File hashes

Hashes for unitpackage-0.8.3-py3-none-any.whl
Algorithm Hash digest
SHA256 8cb6064f1fc9b12203781dbc10d9106bc3afc7e31f49a132bdfa39fed5f6c758
MD5 facdb6a6887416cd314003485f8097f7
BLAKE2b-256 55d77a11e0813b54df936bb630f3d14742be4a508b6a3c729e80888d01ae9e6d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page