Skip to main content

Process Slocum glider data

Project description

glide

The Slocum underwater glider processing command line interface.

glide produces quality controlled L2 and L3 datasets from real-time and delayed Slocum glider data. It can generate datasets that meet IOOS Glider DAC standards. It requires that data are first converted to netCDF or csv using dbd2netcdf (or dbd2csv), a very fast Dinkum binary conversion tool.

Our definitions of data processing levels are guided by NASA, the Spray data, and our own experiences working with gliders. We define the following levels:

  • L0: Binary files produced by Slocum gliders include .dbd, .sbd, .ebd, .tbd or their compressed counterparts .dcd, ... etc.
  • L1: NetCDF or csv timeseries of flight and science data generated using dbd2netcdf. Usually named glidername.dbd.nc and glidername.ebd.nc or something similar. No quality control is performed. Data have the same units as in masterdata.
  • L2: Variable units are converted to oceanographic standards. Quality controls are applied. Some missing data are interpolated. Dead reckoned GPS positions are adjusted using surface GPS fixes; valid GPS fixes are also written on a dedicated time_gps dimension. Thermodynamic variables, such as potential density, are derived. Profiles are identified and tagged with profile_id. Depth-averaged velocity is reported on a time_uv dimension. Science and flight variables specified in the configuration file are merged into a single file.
  • L3: The L2 data are binned in depth and separated into profiles. Ancillery datasets may be merged, such as MicroRider data processed using q2netcdf.

We also provide the following intermediate processing outputs that may be useful for debugging issues:

  • L1B: The L1 data are parsed and quality control is performed but science and flight data are not merged.

Installation

glide is now published to PyPI and can be installed using pip.

pip install slocum-glide

Since this is primary a CLI tool, pipx may be preferred for system installation.

pipx install slocum-glide

Note that glide may be slow to run at first try because the environment has to be built.

Usage

flowchart TD;
    sbd[L0 flight: .sbd /.dbd] -->|dbd2netcdf| l1sbd[L1 flight: .sbd.nc];
    tbd[L0 science: .tbd /.ebd] -->|dbd2netcdf| l1tbd[L1 science: .tbd.nc];
    l1sbd --- C[ ]:::empty;
    l1tbd --- C;
    config[config.yml] --- C;
    C -->|glide l2| l2[L2: .l2.nc];
    l2 --- D[ ]:::empty;
    config --- D;
    q[Ancillery: .q.nc] --- D;
    D -->|glide l3| l3[L3: .l3.nc];
    
classDef empty fill:none,stroke:none,color:transparent,width:1px,height:1px;

The expected processing pipeline is described by the chart above. glide requires a configuration file to properly process glider data. If you do not provide a file, the default file will be used. The configuration file specifies which variables to extract from the L1 data and provides flags for unit conversion and quality controls. Variables that are not listed will not be extracted. A lot of the package functionality is documented in the configuration file and users are encourage to reivew it thoroughly.

Assuming that you have already run dbd2netcdf over raw files (e.g. dbd2netcdf -o glider.tbd.nc *.tbd) you can apply the l2 processing to the flight and science data.

glide l2 -o glidername.l2.nc glidername.sbd.nc glidername.tbd.nc 

In case the default configuration needs modification, you may output it and edit as needed.

glide cfg -o my.config.yml

A custom configuration may then be specified as an argument.

glide l2 -o glidername.l2.nc -c my.config.yml glidername.sbd.nc glidername.tbd.nc

The two file arguments also accept shell-style glob patterns in quotes ("..."). glide will concatenate per-segment L1 files for you.

glide l2 -o glidername.l2.nc "glidername-*.sbd.nc" "glidername-*.tbd.nc"

Each flight file must have a science file with the same basename stem (e.g. glider-2025-056-0-27.sbd.nc pairs with glider-2025-056-0-27.tbd.nc); the command aborts on any unpaired file. Pass --skip-unpaired to drop unmatched files with a warning instead.

To perform level 3 processing with a specific bin size use the -b option. Note that binning is performed in depth, not pressure.

glide l3 glidername.l2.nc -o glidername.l3.nc -c glidername.config.yml -b 10

To extract dead-reckoned location data to CSV or just the surface fixes use the gps subcommand.

glide gps glidername.l2.nc -o glidername.gps.csv  # dead-reckoned, interpolated position
glide gps glidername.l2.nc -o glidername.fixes.csv --fixes  # surface GPS fixes only

A flight model may be calibrated against L2 glider data resulting in variables such as angle of attack and through-water speed. Review the configuration yml for more details on the setup.

glider flight -c my.config.yml -o glidername.flight.nc glidername.l2.nc

To view the help for the package, or a specific command, use:

glide --help
glide l2 --help

Real-time workflow and national glider DAC

For real-time applications, especially the production of DAC files, glide is designed to be run on the full concatenated dataset for a deployment, not on individual segment files as they arrive. Re-running glide l2 on real-time data is relatively cheap and fast. The re-run avoids gaps that arise when velocity, GPS, or other state is reported only at the next surfacing and also ensures consistent profile numbering. You can either pre-merge files with dbd2netcdf and pass the resulting single file to glide l2, or pass glob patterns directly to glide l2, as specified above.

IOOS Glider DAC submission

Add --ioos to glide l2 to additionally emit one NGDAC v2-compliant NetCDF file per profile (one descent or one ascent) into the given directory:

glide l2 glider.sbd.nc glider.tbd.nc -o glider.l2.nc \
    --ioos ./dac/ -g glidername -c glidername.config.yml

A profile is emitted only when its containing surface-to-surface segment has a finite depth-averaged velocity — i.e., the closing surfacing has reported. Profiles still awaiting that surfacing are skipped and will be emitted on a future re-run with more concatenated data. Existing files are skipped (the filename encodes the profile start time); pass --force to overwrite.

Per-deployment instrument metadata (CTD make/model/serial, calibration dates, etc.) goes in the instruments: section of config.yml and is placed as scalar variables in each profile file.

Quality control

During L1 to L2 processing we:

  • Drop missing or repeated timestamps.
  • Check data are within valid_min and valid_max limits from the config.
  • Interpolate missing dead-reckoned position and linearly adjust the dead-reckoned position to align with surface fixes.
  • Identify behavioral states such as dive, climb, surface, and drift, and assign profile numbers. This relies on profinder
  • Track per-variable QC flags (*_qc) for variables tagged track_qc in core.yml, including for variables interpolated across the science/flight merge.

We plan to implement more of the standard IOOS QC methods in the future.

Contributing

Collaboration is highly encouraged, and contributions from the community are welcome. To ensure a productive and respectful development process, please follow these guidelines.

  • Open an issue to describe the problem you're addressing or the feature you'd like to implement. Be as detailed as possible. Include relevant context, your motivation, and any initial ideas you may have.
  • Fork the repository and begin working on a solution in a separate branch.
  • When ready, submit a pull request that references the related issue. Keep pull requests focused and limited to a single concern.
  • Please ensure your code follows the existing style and structure of the project.
  • Include tests, as appropriate.

Development

This package is developed with uv.

After cloning this repository, genereate the virtual environment:

uv sync

Run tests:

uv run pytest -v

Format code:

uv run ruff format src tests
uv run ruff check --select I --fix

Type checking:

uv run mypy src tests

Try out glide on the test data:

uv run glide --log-level=debug l2 tests/data/osu684.sbd.csv tests/data/osu684.tbd.csv

By default this will produce a file slocum.l2.nc.

Publishing

Make sure to the bump the version appropriately.

uv version --bump <major/minor/patch>

Remove any existing distributions, build, and publish.

rm -rf dist
uv build
uvx twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

slocum_glide-0.1.3.tar.gz (829.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

slocum_glide-0.1.3-py3-none-any.whl (62.6 kB view details)

Uploaded Python 3

File details

Details for the file slocum_glide-0.1.3.tar.gz.

File metadata

  • Download URL: slocum_glide-0.1.3.tar.gz
  • Upload date:
  • Size: 829.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for slocum_glide-0.1.3.tar.gz
Algorithm Hash digest
SHA256 939e1ae2783957f5ce3a36f1047136d3cf8de5a7f539d0e2970cfdff5c5c6d58
MD5 b7f082811983aa4aa41099684f2b1f3e
BLAKE2b-256 0db435782907afd2764809de8b44d37107f6b86e138634faa7b523cd11181ff7

See more details on using hashes here.

File details

Details for the file slocum_glide-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: slocum_glide-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 62.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for slocum_glide-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 803bb36c03caf0de2230b90eb4aa388e1a7c7fc8a2e0bd2078342cc56a3de638
MD5 ade64d934f13c0aa838383d0c3440502
BLAKE2b-256 260ba30e1a9bc4868f23b3e43577e2f8899cbbff4ff0d2c15c27fc94086ad1f8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page