An intake plugin for parsing an Earth System Model (ESM) catalog and loading netCDF files and/or Zarr stores into Xarray datasets.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

andersy005 jukent mlevy-ncar

These details have not been verified by PyPI

Project links

Project description

Intake-esm

Intake-esm

Badges

CI
Docs
Package
License
Citation

Motivation

Computer simulations of the Earth’s climate and weather generate huge amounts of data. These data are often persisted on HPC systems or in the cloud across multiple data assets of a variety of formats (netCDF, zarr, etc...). Finding, investigating, loading these data assets into compute-ready data containers costs time and effort. The data user needs to know what data sets are available, the attributes describing each data set, before loading a specific data set and analyzing it.

Finding, investigating, loading these assets into data array containers such as xarray can be a daunting task due to the large number of files a user may be interested in. Intake-esm aims to address these issues by providing necessary functionality for searching, discovering, data access/loading.

Overview

intake-esm is a data cataloging utility built on top of intake, pandas, polars and xarray, and it's pretty awesome!

Opening an ESM catalog definition file: An Earth System Model (ESM) catalog file is a JSON file that conforms to the ESM Collection Specification. When provided a link/path to an esm catalog file, intake-esm establishes a link to a database (CSV file) that contains data assets locations and associated metadata (i.e., which experiment, model, the come from). The catalog JSON file can be stored on a local filesystem or can be hosted on a remote server.
```
In [1]: import intake

In [2]: import intake_esm

In [3]: cat_url = intake_esm.tutorial.get_url("google_cmip6")

In [4]: cat = intake.open_esm_datastore(cat_url)

In [5]: cat
Out[5]: <GOOGLE-CMIP6 catalog with 4 dataset(s) from 261 asset(s>
```

Search and Discovery: intake-esm provides functionality to execute queries against the catalog:

In [5]: cat_subset = cat.search(
   ...:     experiment_id=["historical", "ssp585"],
   ...:     table_id="Oyr",
   ...:     variable_id="o2",
   ...:     grid_label="gn",
   ...: )

In [6]: cat_subset
Out[6]: <GOOGLE-CMIP6 catalog with 4 dataset(s) from 261 asset(s)>

Access: when the user is satisfied with the results of their query, they can load data assets (netCDF and/or Zarr stores) into xarray datasets:

  In [7]: dset_dict = cat_subset.to_dataset_dict()

  --> The keys in the returned dictionary of datasets are constructed as follows:
          'activity_id.institution_id.source_id.experiment_id.table_id.grid_label'
  |███████████████████████████████████████████████████████████████| 100.00% [2/2 00:18<00:00]

See documentation for more information.

Installation

Intake-esm can be installed from PyPI with pip:

python -m pip install intake-esm

It is also available from conda-forge for conda installations:

conda install -c conda-forge intake-esm

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

andersy005 jukent mlevy-ncar

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2025.7.9

Jul 10, 2025

2025.2.3

Feb 6, 2025

2024.2.6

Feb 6, 2024

2023.11.10

Nov 10, 2023

2023.10.27

Oct 27, 2023

2023.7.7

Jul 7, 2023

2023.6.14

Jun 14, 2023

2023.4.20

Apr 20, 2023

2022.9.18

Sep 18, 2022

2021.8.17

Aug 17, 2021

2021.1.15

Jan 15, 2021

2020.12.18

Dec 18, 2020

2020.11.4

Nov 4, 2020

2020.8.15

Aug 15, 2020

2020.6.11

Jun 11, 2020

2020.5.21

May 21, 2020

2020.5.1

May 1, 2020

2020.3.16.2

Mar 26, 2020

2020.3.16.1

Mar 18, 2020

2020.3.16

Mar 16, 2020

2019.12.13

Dec 13, 2019

2019.10.15

Oct 15, 2019

2019.8.23

Aug 23, 2019

2019.8.5

Aug 5, 2019

2019.5.11

May 12, 2019

2019.4.26.1

Apr 26, 2019

2019.4.26

Apr 26, 2019

2019.2.28

Feb 27, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

intake_esm-2025.7.9.tar.gz (116.4 kB view details)

Uploaded Jul 10, 2025 Source

Built Distribution

intake_esm-2025.7.9-py3-none-any.whl (33.6 kB view details)

Uploaded Jul 10, 2025 Python 3

File details

Details for the file intake_esm-2025.7.9.tar.gz.

File metadata

Download URL: intake_esm-2025.7.9.tar.gz
Upload date: Jul 10, 2025
Size: 116.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for intake_esm-2025.7.9.tar.gz
Algorithm	Hash digest
SHA256	`d82a61e13c11a01a6c50cfebc885bb4c41fc943d907b70a131cef32734458c8b`
MD5	`ee3cbbc7de7c1f46e8a04ff3aac73c59`
BLAKE2b-256	`00550be2d4d30b03336b8e40582bec80f5473e92db8882052554c29b67ba5cac`

See more details on using hashes here.

Provenance

The following attestation bundles were made for intake_esm-2025.7.9.tar.gz:

Publisher: pypi.yml on intake/intake-esm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: intake_esm-2025.7.9.tar.gz
- Subject digest: d82a61e13c11a01a6c50cfebc885bb4c41fc943d907b70a131cef32734458c8b
- Sigstore transparency entry: 269415549
- Sigstore integration time: Jul 10, 2025
Source repository:
- Permalink: intake/intake-esm@443ccf39c37e175bc20e2cd87ea55db8fc8ffcb4
- Branch / Tag: refs/tags/v2025.7.9
- Owner: https://github.com/intake
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi.yml@443ccf39c37e175bc20e2cd87ea55db8fc8ffcb4
- Trigger Event: release

File details

Details for the file intake_esm-2025.7.9-py3-none-any.whl.

File metadata

Download URL: intake_esm-2025.7.9-py3-none-any.whl
Upload date: Jul 10, 2025
Size: 33.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for intake_esm-2025.7.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`eb444e0bda55c40af9fd15f8fcecd04571e43eeaaaab46b963b353c3e01dab45`
MD5	`290019aecf89e7e1a7b2347ec9f18947`
BLAKE2b-256	`a55e20de2f054d670a617386debd28b9f28548a2d80e346ffc776a29b7f15f57`

See more details on using hashes here.

Provenance

The following attestation bundles were made for intake_esm-2025.7.9-py3-none-any.whl:

Publisher: pypi.yml on intake/intake-esm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: intake_esm-2025.7.9-py3-none-any.whl
- Subject digest: eb444e0bda55c40af9fd15f8fcecd04571e43eeaaaab46b963b353c3e01dab45
- Sigstore transparency entry: 269415572
- Sigstore integration time: Jul 10, 2025
Source repository:
- Permalink: intake/intake-esm@443ccf39c37e175bc20e2cd87ea55db8fc8ffcb4
- Branch / Tag: refs/tags/v2025.7.9
- Owner: https://github.com/intake
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi.yml@443ccf39c37e175bc20e2cd87ea55db8fc8ffcb4
- Trigger Event: release

intake-esm 2025.7.9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Intake-esm

Badges

Motivation

Overview

Installation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance