Skip to main content

An intake plugin for building and reading Icechunk stores via VirtualiZarr and intake-esm

Project description

intake-virtual-icechunk

An intake plugin for building and reading Icechunk stores via VirtualiZarr and intake-esm.

Concept

The goal is a pipeline that takes a pre-built intake-esm datastore and produces a single virtual Icechunk store that mirrors its structure:

  1. Open a pre-built intake-esm datastore with intake-esm.
  2. For each dataset in the catalog, open the constituent files with VirtualiZarr to create virtual references — no data is copied.
  3. Write each dataset as a named Zarr group inside one Icechunk store, using the catalog's groupby_attrs to derive the group name.
  4. Expose the result through an intake driver (virtual_icechunk) that hides all Icechunk-specific complexity (sessions, stores, branches) behind an interface that feels like a hybrid of an esm-datastore and an xarray.Dataset — defaulting to Xarray semantics wherever possible, and falling back to esm-datastore conventions only where necessary (e.g. catalog search and group selection).

The end result is one Icechunk store, one group per dataset, fully virtual (no data duplication), and accessible via intake.open_virtual_icechunk().

This package provides two things

  1. Building (IcechunkStoreBuilder) — given a pre-built intake-esm catalog, creates virtual references with VirtualiZarr and writes each dataset as a named Zarr group inside a single Icechunk store.
  2. Reading (IcechunkSource) — an intake driver for opening a group from an Icechunk store as an xarray.Dataset via intake.open_virtual_icechunk().

Installation

pip install intake-virtual-icechunk

License

Apache-2.0. See LICENSE for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

intake_virtual_icechunk-0.1.0.tar.gz (259.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

intake_virtual_icechunk-0.1.0-py3-none-any.whl (31.1 kB view details)

Uploaded Python 3

File details

Details for the file intake_virtual_icechunk-0.1.0.tar.gz.

File metadata

  • Download URL: intake_virtual_icechunk-0.1.0.tar.gz
  • Upload date:
  • Size: 259.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for intake_virtual_icechunk-0.1.0.tar.gz
Algorithm Hash digest
SHA256 bee71296c97ffe7c6cbfad8621e63259ee0e102325299e71890bc8190aa1628d
MD5 18b776c7ae8de06dc9852f3cf7a8e899
BLAKE2b-256 cc06cd879b9fbe2a63d2bf763d24897a155d237f41500fa4346a077aa7468a4c

See more details on using hashes here.

Provenance

The following attestation bundles were made for intake_virtual_icechunk-0.1.0.tar.gz:

Publisher: cd.yml on ACCESS-NRI/intake-virtual-icechunk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file intake_virtual_icechunk-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for intake_virtual_icechunk-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d125494cf9f777977dfba86218675c0dbf8ca8d43c8e32f1ba9c12db18dc021a
MD5 3613c85328dea259eecf4240f5ff0037
BLAKE2b-256 56a6988dc73b378be4b8521b4f8109ee41276d165db164256b8400fa06046f37

See more details on using hashes here.

Provenance

The following attestation bundles were made for intake_virtual_icechunk-0.1.0-py3-none-any.whl:

Publisher: cd.yml on ACCESS-NRI/intake-virtual-icechunk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page