Toolbox for creating/assessing EMSO-compliant NetCDF datasets and integrate them into ERDDAP services
Project description
Metadata Harmonizer Toolbox
This repository contains a set of tools that can be used to create NetCDF files, integrate them into an ERDDAP server and to ensure the compliance with the EMSO Metadata Specifications. The tools provided here are:
emh.generate_dataset(): creates EMSO-compliant NetCDF files from.csvand.yamlfilesemh.erddap_config(): integrates NetCDF files into an ERDDAP serveremh.metadata_report(): check the compliance of a dataset with the specifications.
In order to create and publish an EMSO-compliant dataset, the typical workflow is:
- Prepare CSV data and YAML metadata
- Generate EMSO-compliant NetCDF files using
generate_dataset() - Integrate datasets into your ERDDAP deployment using
erddap_config() - Validate metadata and operational compliance using
metadata_report()
Installation
To install as a PyPi package:
pip3 install emso_metadata_harmonizer
🛠 NetCDF Generator
To generate a NetCDF dataset from data (csv) and metadata (yaml) files:
import emso_metadata_harmonizer as emh
emh.generate_dataset(["data.csv"], ["meta.yaml"], output="dataset.nc")
Full example with data and metadata from the example 2
import emso_metadata_harmonizer as emh
import urllib
# Download data and metadata from the example 2 in the metadata-harmonizer repository
data_url = "https://raw.githubusercontent.com/emso-eric/metadata-harmonizer/refs/heads/develop/examples/02/SBE16.csv"
meta_url = "https://raw.githubusercontent.com/emso-eric/metadata-harmonizer/refs/heads/develop/examples/02/meta.yaml"
urllib.request.urlretrieve(data_url, "data.csv")
urllib.request.urlretrieve(meta_url, "meta.yaml")
# Generate dataset from one data file
emh.generate_dataset(["data.csv"], ["meta.yaml"], "dataset.nc")
To generate a dataset from multiple data files:
import emso_metadata_harmonizer as emh
import urllib
# Generate dataset from multiple data files
data1_url = "https://raw.githubusercontent.com/emso-eric/metadata-harmonizer/refs/heads/develop/examples/02/SBE16.csv"
data2_url = "https://raw.githubusercontent.com/emso-eric/metadata-harmonizer/refs/heads/develop/examples/02/SBE37.csv"
meta_url = "https://raw.githubusercontent.com/emso-eric/metadata-harmonizer/refs/heads/develop/examples/02/meta.yaml"
urllib.request.urlretrieve(data1_url, "data1.csv")
urllib.request.urlretrieve(data2_url, "data2.csv")
urllib.request.urlretrieve(meta_url, "meta.yaml")
emh.generate_dataset(["data.csv", "data2.csv"], ["meta.yaml"], "dataset2.nc")
⚙️ ERDDAP Configurator
The ERDDAP Configurator (erddap_config()) helps prepare ERDDAP dataset definitions for NetCDF files, reducing manual
work editing ERDDAP’s XML configurations. It reads NetCDF metadata and generates XML chunk required to register a new
dataset.
import emso_metadata_harmonizer as emh
emh.erddap_config("dataset2.nc", "MyDatasetIdentifier", "/path/to/dataset/files")
To automatically append a new dataset into an existing ERDDAP deployment, the path to the datasets.xml file should
be passed via the datasets_xml_file parameter.
import emso_metadata_harmonizer as emh
emh.erddap_config("dataset.nc", "MyDatasetIdentifier", "/path/to/dataset/files", datasets_xml_file="path/to/datasets.xml")
📈 Metadata Report
The metadata reporting tool assesses the level of compliance of an ERDDAP or NetCDF dataset with the EMSO Metadata Specifications. To test a dataset, use the following syntax:
import emso_metadata_harmonizer as emh
emh.metadata_report("dataset.nc")
Contact info
- author: Enoc Martínez
- version: v1.0.0
- organization: Universitat Politècnica de Catalunya (UPC)
- contact: enoc.martinez@upc.edu
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file emso_metadata_harmonizer-1.0.0a3-py3-none-any.whl.
File metadata
- Download URL: emso_metadata_harmonizer-1.0.0a3-py3-none-any.whl
- Upload date:
- Size: 52.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dac0cf7bcc7d3e8ab1825ec987e29736de313eefdb6106db7d6247e67076a735
|
|
| MD5 |
022d4f1ac5269747e7ca8542ae855145
|
|
| BLAKE2b-256 |
e19f95add4364a2e98c2fac244c284903c66a58d0ec0406e0f8f60a62cd91dcf
|