Skip to main content

An intake plugin for parsing an Earth System Model (ESM) catalog and loading netCDF files and/or Zarr stores into Xarray datasets. This is the ACCESS-NRI bleeding edge fork.

Reason this release was yanked:

Pacakaging Mishaps (conda related)

Project description

Intake-esm

Badges

CI GitHub Workflow Status Code Coverage Status pre-commit.ci status
Docs Documentation Status
Package Conda PyPI Versions
License License
Citation Zenodo

Motivation

Computer simulations of the Earth’s climate and weather generate huge amounts of data. These data are often persisted on HPC systems or in the cloud across multiple data assets of a variety of formats (netCDF, zarr, etc...). Finding, investigating, loading these data assets into compute-ready data containers costs time and effort. The data user needs to know what data sets are available, the attributes describing each data set, before loading a specific data set and analyzing it.

Finding, investigating, loading these assets into data array containers such as xarray can be a daunting task due to the large number of files a user may be interested in. Intake-esm aims to address these issues by providing necessary functionality for searching, discovering, data access/loading.

Overview

intake-esm is a data cataloging utility built on top of intake, pandas, polars and xarray, and it's pretty awesome!

  • Opening an ESM catalog definition file: An Earth System Model (ESM) catalog file is a JSON file that conforms to the ESM Collection Specification. When provided a link/path to an esm catalog file, intake-esm establishes a link to a database (CSV file) that contains data assets locations and associated metadata (i.e., which experiment, model, the come from). The catalog JSON file can be stored on a local filesystem or can be hosted on a remote server.

    In [1]: import intake
    
    In [2]: import intake_esm
    
    In [3]: cat_url = intake_esm.tutorial.get_url("google_cmip6")
    
    In [4]: cat = intake.open_esm_datastore(cat_url)
    
    In [5]: cat
    Out[5]: <GOOGLE-CMIP6 catalog with 4 dataset(s) from 261 asset(s>
    
  • Search and Discovery: intake-esm provides functionality to execute queries against the catalog:

    In [5]: cat_subset = cat.search(
       ...:     experiment_id=["historical", "ssp585"],
       ...:     table_id="Oyr",
       ...:     variable_id="o2",
       ...:     grid_label="gn",
       ...: )
    
    In [6]: cat_subset
    Out[6]: <GOOGLE-CMIP6 catalog with 2 dataset(s) from 67 asset(s)>
    
  • Access: when the user is satisfied with the results of their query, they can load data assets (netCDF and/or Zarr stores) into xarray datasets:

      In [7]: dset_dict = cat_subset.to_dataset_dict()
    
      --> The keys in the returned dictionary of datasets are constructed as follows:
              'activity_id.institution_id.source_id.experiment_id.table_id.grid_label'
      |███████████████████████████████████████████████████████████████| 100.00% [2/2 00:18<00:00]
    

See documentation for more information.

Installation

Intake-esm can be installed from PyPI with pip:

python -m pip install intake-esm

It is also available from conda-forge for conda installations:

conda install -c conda-forge intake-esm

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

intake_esm_access-2025.9.16.tar.gz (141.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

intake_esm_access-2025.9.16-py3-none-any.whl (36.2 kB view details)

Uploaded Python 3

File details

Details for the file intake_esm_access-2025.9.16.tar.gz.

File metadata

  • Download URL: intake_esm_access-2025.9.16.tar.gz
  • Upload date:
  • Size: 141.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for intake_esm_access-2025.9.16.tar.gz
Algorithm Hash digest
SHA256 ada489c76c6decf2687318768ce87fbdfa6ae90161b46dfe1c26004cbe1e80ef
MD5 e5726593c0a95bf1fb0d82db6ebe132d
BLAKE2b-256 72c277386c6c920d260a487387457a1932073b2f54c35f1af64bae5ddf1add4f

See more details on using hashes here.

Provenance

The following attestation bundles were made for intake_esm_access-2025.9.16.tar.gz:

Publisher: pypi.yml on ACCESS-NRI/intake-esm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file intake_esm_access-2025.9.16-py3-none-any.whl.

File metadata

File hashes

Hashes for intake_esm_access-2025.9.16-py3-none-any.whl
Algorithm Hash digest
SHA256 61f2a2cb0230e9e08cf1ffb44bc37b9afa64aef4af82e4c1b29b7a3a7dca1e2d
MD5 80c910b9971f92aba891e4426efbc575
BLAKE2b-256 2939c3a28b67a33bce5545d4f5d1fa26470f5feab4393a487c028864a350e0ce

See more details on using hashes here.

Provenance

The following attestation bundles were made for intake_esm_access-2025.9.16-py3-none-any.whl:

Publisher: pypi.yml on ACCESS-NRI/intake-esm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page