Skip to main content

A useful tool for post-processing Earth System Model output 'history files' into the time series format.

Project description

Generate Time Series (GenTS)

Available on pypi Docs GitHub License

The GenTS (Generate Time Series) is an open-source Python Package designed to simplify the post-processing of history files into time series files. This package includes streamlined functions that require minimal input to operate and a documented API for custom workflows.

Features

  • Robust Python API with immutable framework that simplifies integration with existing post-processing workflows and development of new ones
  • Command line interface with sensible defaults for common CESM/E3SM cases and flags for edge/custom cases
  • Supports parallel processing for maximizing data throughput
  • Checks existing time series output for integrity to generate only the remaining files needed
  • Adaptive re-chunking for compliance with CMOR (Climate Model Output Rewriter) standards
  • Comprehensive testing suite for ensuring accuracy and reliability
  • ASV benchmarking suite for maintaining performance across releases
  • Docker/Apptainer/Singularity containers for bypassing complicated environment setups
  • Detailed documentation for self-debugging and quickly on-boarding contributors

Installation

GenTS can be installed in a Python environment using pip. This requires a conda, uv, or Python virtual environment for installing GenTS dependencies (namely numpy, netCDF4, and cftime).

For maximum portability and to avoid environment issues, use the containerized version of GenTS.

PyPI

pip install gents

To install from source, please view the ReadTheDocs Documentation.

Container

Apptainer and Singularity container platforms are typically employed over Docker in HPC environments. Luckily, these platforms (and most others) support running directly from Docker images. The form thus varies across institutions and systems:

For Derecho and Casper (NCAR):

module load apptainer
apptainer run --bind /glade/derecho --cleanenv docker://agentoxygen/gents:latest run_gents --help

For TACC Systems:

module load apptainer
apptainer run docker://agentoxygen/gents:latest run_gents --help

For Perlmutter (NERSC):

shifterimg -v pull docker:agentoxygen/gents:latest
shifter --image=docker:agentoxygen/gents:latest run_gents --help

Running GenTS

Note

It is bad practice to run GenTS on a login node due to the large number of IO-heavy processes it can create. Instead, submit GenTS in a batch job or run it interactively on a compute node. Here are reference pages for deploying interactive sessions on compute nodes for popular HPC centers:

Derecho/Casper Instructions

TACC Instructions

Perlmutter Instructions

GenTS comes with a pre-configured CLI that can be run on most CESM model output and E3SM (atm-only) model output by calling run_gents. The CLI is built on a robust API which can also be configured in a Python script or Jupyter Notebook for custom cases/workflows.

CLI

To view options for running in the command line:

run_gents --help

API Example

Example run.py:

from gents.hfcollection import HFCollection
from gents.timeseries import TSCollection


if __name__ == "__main__":
    input_head_dir = "... case directory with model output ..."
    output_head_dir = "... scratch directory to output time series to ..."

    hf_collection = HFCollection(input_head_dir, num_processes=64)
    hf_collection = hf_collection.include(["*/atm/*", "*/ocn/*", "*.h4.*"])

    ts_collection = TSCollection(hf_collection.include_years(0, 5), output_head_dir, num_processes=32)
    ts_collection = ts_collection.apply_overwrite("*")
    ts_collection.execute()

Then execute the script in a Conda or Python virtual environment with gents installed:

python run.py

Or run from the container:

apptainer run docker://agentoxygen/gents:latest python run.py

Contributor/Bug Reporting Guidelines

Please report all issues to the GitHub issue tracker. When submitting a bug, run gents.utils.enable_logging(verbose=True) at the top of your script to include all log output. This will aid in reproducing the bug and quickly developing a solution.

For development, it is recommended to use the Docker method for testing. These tests are automatically run in the GitHub workflow, but should be run before committing changes.

Citation

If our software package helps you with your research, please consider citing it:

In BibTeX:

@Manual{         cummins2026gents,
 title         = {{GenTS}: Generate Time Series Python Package (Software)},
 author        = {Cameron Cummins},
 year          = {2026},
 url           = {https://github.com/AgentOxygen/GenTS}
}

Acknowledgements

The following people made the development of GenTS possible. Thank you!

Portions of this work were supported by the Regional and Global Model Analysis (RGMA) component of the Earth and Environmental System Modeling Program of the U.S. Department of Energy's Office of Biological & Environmental Research (BER) under Lawrence Livermore National Lab subaward DE-AC52-07NA27344, Lawrence Berkeley National Lab subaward DE-AC02-05CH11231, and Pacific Northwest National Lab subaward DE-AC05-76RL01830. This work was also supported by the National Science Foundation (NSF) National Center for Atmospheric Research, which is a major facility sponsored by NSF under Cooperative Agreement No. 1852977.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gents-1.1.0.tar.gz (167.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gents-1.1.0-py3-none-any.whl (62.4 kB view details)

Uploaded Python 3

File details

Details for the file gents-1.1.0.tar.gz.

File metadata

  • Download URL: gents-1.1.0.tar.gz
  • Upload date:
  • Size: 167.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for gents-1.1.0.tar.gz
Algorithm Hash digest
SHA256 b968b41bee9cfa3dfbd3b386e51d90876e28086d3a819a2b75113a7e0aa23204
MD5 2330acc59c99149fdb33d7aaf6b3c831
BLAKE2b-256 2a4495f439efd93093ed5d02ccbb0d5b3044408fccd7c828649d774b1b6ef06c

See more details on using hashes here.

File details

Details for the file gents-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: gents-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 62.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for gents-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 23c1a6f5cc8f1809e53fd0567b7307456a823693e3e47d1f272cad5aed3ed311
MD5 5249d4197c4d61c0a7cd381aabd1980e
BLAKE2b-256 26b4500c2a05a8f99e4804ab5b14bd1a9da36a12fa1b0c7b7852fe89c2a105be

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page