Skip to main content

Python wrapper for ASF's metadata

Project description

PyPI version

asf_metadata

This repo contains all the Python 3.8 scripts related to manage ASF metadata. These are currently mostly dealing with ISO metadata.

Table of Contents


ISO template

ISO metadata templates can be best maintained in a more user-friendly environment. I have settled for an Excel spreadsheet to achieve that.

ISO metadata

The template mechanism has three different flows. All use an Excel spreadsheet to define the structure of the ISO metadata.

ISO metadata template generation

The ISO metadata's structure is stored in an Excel spreadsheet and contains identifiers for each metadata parameter that is variable. The structure and variables form the template that we can store in two formats: XML and JSON.

Format conversion

In order to convert the metadata between various formats, we need to know the structure of it. That is saved within an Excel spreadsheet.

This functionality is something I anticipate using for the NISAR mission, for which we only get metadata in ISO XML format. Using the structure from the spreadsheet allows to painlessly deal with the namespaces within the XML files.

ISO metadata generation

For the generation of ISO metadata, we need the structure from the Excel spreadsheet. The template parameters need to be exchanged by their actual values. This will be achieved by extracting the necessary information based on the product files, replacing the functionality that existed in the old MapReady framework.


Excel spreadsheet

The Excel spreadsheet has a number of worksheets.

ISO metadata structure

ISO metadata structure worksheet

This worksheet provides the relevant parts of the ISO metadata, most importantly its structure and the namespaces used within that structure.

Namespaces

One of the biggest nuisances about ISO XML metadata is the namespaces.

Namespaces look-up-table

This worksheet links each parameter to its respective namespace.

Namespaces URLs

This worksheet defines the URLs for the individual namespaces needed in the XML declarations.


ISO metadata functions

I developed a number of functions that perform the various tasks outline above.

Extracting information from templates

The template information in the Excel spreadsheet is extracted (using the isoTemplate2lists function) into a number of lists: the flattened template structure, the individual metadata parameters, and their respective values.

Conversion of metadata formats

In order to convert an XML file to JSON format, we need to extract the individual metadata parameters out of the ISO metadata XML file using the get_metadata_values function.

Replacing placeholders with actual values

The templates stored in the Excel spreadsheets contain ISO parameter placeholders. The product_dictionary2values function replaces these placeholders, saved in a product dictionary, with the actual metadata values.

Generating ISO metadata from product files

ISO metadata can be generated by extracting the necessary information based on the product files (using the generate_product_dictionary function). Depending on the product type, specific conversion functions are called:

  • RTC products: gamma_rtc_metadata function

Building metadata structures

The ISO metadata is saved in two different formats: XML and JSON. The XML is using a tree structure that is built with the iso_xml_structure function, while the JSON structure is formed by a complex nested dictionary using the iso_dictionary_structure function.

Writing metadata to file

There are two separate functions to save the metadata to a file format. meta_xml_file is the function that generates an ISO XML metadata file, while meta_json_file saves the metadata in JSON format.

Cleaning up metadata

There are two separate functions to clean up the metadata structures. clean_xml_structure (for XML files) and clean_json_structure (for JSON files) is the function that removes metadata for product files (digital elevation model, incidence angle map, scattering area map) that are only optionally included.


ISO metadata tools

I developed a number of tools that perform the various tasks outline above.

Generate ISO template file from Excel spreadsheet

usage: generate_iso_template.py [-h] [-dem DEM] excelFile isoBase

Generate ISO template file from Excel spreadsheet

positional arguments: excelFile name of the Excel template spreadsheet isoBase basename of the ISO metadata template file

optional arguments: -h, --help show this help message and exit -dem DEM name of DEM template spreadsheet

Generate ISO metadata file from Excel spreadsheet

usage: generate_iso_metadata.py [-h] [-product PRODUCT] [-data DATA] [-dem DEM] meta_file log_file iso_base

Generate ISO metadata file from Excel spreadsheet

positional arguments: meta_file file path of product metadata such as a "manifest.safe" or leader file log_file name of RTC processing log file iso_base basename of the ISO XML metadata file

optional arguments: -h, --help show this help message and exit -product PRODUCT name of the product type (default: gamma_rtc) choices: ["gamma_rtc"] -data DATA name of the data source (default: sentinel) choices: ["sentinel"] -dem DEM DEM type used for terrain correction choices: ["copernicus"]

At the moment, I have only added one function gamma_rtc_log2meta that analyzes GAMMA RTC log files for metadata. The corresponding product type is 'gamma_rtc'.

Author

Rudi Gens

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

asf_metadata-0.2.0.tar.gz (619.1 kB view details)

Uploaded Source

Built Distribution

asf_metadata-0.2.0-py3-none-any.whl (611.9 kB view details)

Uploaded Python 3

File details

Details for the file asf_metadata-0.2.0.tar.gz.

File metadata

  • Download URL: asf_metadata-0.2.0.tar.gz
  • Upload date:
  • Size: 619.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for asf_metadata-0.2.0.tar.gz
Algorithm Hash digest
SHA256 1021ef916064accd857920932a1dc2826e3d470825c4595e715cfae67115d6b5
MD5 de16053c042ef49320064e033332e2d7
BLAKE2b-256 66694ccac2f294bbce3c943d263a1bdb588b674b65f842c46f0a7a5618c84110

See more details on using hashes here.

File details

Details for the file asf_metadata-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: asf_metadata-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 611.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for asf_metadata-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c877bdc296a70a9e0db3f842b347206f07b61c4f901378da0c6ad8828c7114c6
MD5 647c7e3c9a16cf465db50fb899c0378b
BLAKE2b-256 6d2879e5957e67adbe0dc5397809be69dc023893796e145a2e2f3aa41560f283

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page