Skip to main content

Recompress zarr chunks in-place

Project description

mysquishy

"I shall call him Squishy and he shall be mine and he shall be my Squishy." — Dory

mysquishy

Change the compression of a zarr array in-place.

This works by loading the chunk bytes, decoding them, recoding them, and saving them. At the end, the .zarray file at the root is updated.

Note that this only works for single arrays: there should be a .zarray file at the root. It is left as an exercise for the reader to figure out how to recompress every array in a zarr group.

Contributions are very welcome.

⚠️ BAD SQUISHY! ⚠️

Bad Squishy!

This is currently just a proof of concept and it will leave your zarr arrays in an inconsistent/broken state if it is interrupted! And/or there might be bugs. It makes no effort to account for missing keys, for example. As above, contributions are very welcome but in the meantime be careful! Make a tiny zarr example similar to your data and check that it works fine there before deploying it properly.

Usage

From Python:

nbytes_before, nbytes_after = mysquishy.squish('path/to/array.zarr')

From the command line:

mysquishy path/to/array.zarr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mysquishy-0.2.0.tar.gz (4.7 kB view hashes)

Uploaded Source

Built Distribution

mysquishy-0.2.0-py3-none-any.whl (5.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page