Utilities for working with ZARR data and AIND metadata

These details have not been verified by PyPI

Project links

Project description

aind-zarr-utils

Code Style Interrogate Coverage Python

A Python utility library developed by Allen Institute for Neural Dynamics for working with ZARR files and AIND metadata. This package enables converting ZARR datasets to SimpleITK and ANTs images, processing neuroimaging annotation data from Neuroglancer, and handling anatomical coordinate transformations.

Key Features

ZARR ↔ Image Conversion: Convert ZARR datasets to SimpleITK and ANTs images with proper coordinate system handling
Neuroglancer Integration: Process annotation layers and coordinate transforms from Neuroglancer
Coordinate Transformations: Handle point transformations from image space to anatomical space (LPS coordinates)
Multi-source JSON Reading: Unified JSON loading from local files, HTTP URLs, and S3 URIs
Pipeline-specific Corrections: Version-based spatial domain corrections for pipeline compatibility
S3 Integration: Built-in support for AWS S3 with caching and anonymous access
CCF Registration: Pipeline-specific coordinate transformations and CCF registration utilities

Usage

aind_zarr_utils provides an anatomically aware platform for interacting with AIND Zarr images and their corresponding metadata.

To load an image, first read the metadata:

from aind_s3_cache.json_utils import get_json
metadata_path = 'path_to_metadata'
metadata = {'acquisition':get_json(metadata_path)}

you can then use the "zarr_to_ants" or "zarr_to_sitk" to load the zarr as an ANTs image or SITK image, respectively. i.e.:

from aind_zarr_utils.zarr import zarr_to_ants,zarr_to_sitk
channel_path = 'path_to_image'
level = 3 # Zarr level to read
ants_image = zarr_to_ants(channel_path, metadata, level=level)
sitk_image = zarr_to_sitk(channel_path, metadata, level=level)

In some cases, it is useful to interact with point data in image space, without explicitly loading the underlying image. In this case, aind_zarr_utils allows for the use of a "stub" image. Here, an empty sitk image will be loaded. This image will have origin, direction, and spacing that match the full image, but takes up minimal space in memory. Once a stub image is loaded, regular SITK functions for moving from, for example, indices to physical space can be used.

from aind_zarr_utils.zarr import zarr_to_sitk_stub
sitk_stub = zarr_to_sitk_stub(channel_path,metadata,level

Stub images are particularly useful when dealing with points annotated in image space. aind_zarr_utils includes functionality for reading neuroglancer annotations. Since neuroglancer itself is not anatomically aware, this operation also depends on knowledge of the metadata. i.e.:

neuroglancer_json = get_json('path_to_neuroglancer.json')
points_neuroglancer_indicies,_ = neuroglancer_annotations_to_indices(neuroglancer_json)
points_image_anatomical = annotation_indices_to_anatomical(stub,points_neuroglancer_indicies)

Basic ZARR to Image Conversion

from aind_zarr_utils import zarr_to_ants, zarr_to_sitk, zarr_to_sitk_stub

# Convert ZARR to ANTs image with anatomical coordinates
ants_img = zarr_to_ants(zarr_uri, metadata, level=3, scale_unit="millimeter")

# Convert ZARR to SimpleITK image
sitk_img = zarr_to_sitk(zarr_uri, metadata, level=3, scale_unit="millimeter")

# Create stub image for coordinate transformations only
stub_img = zarr_to_sitk_stub(zarr_uri, metadata, level=0)

Processing Neuroglancer Annotations

from aind_zarr_utils import neuroglancer_annotations_to_indices, neuroglancer_annotations_to_anatomical

# Get points in voxel indices
annotations, descriptions = neuroglancer_annotations_to_indices(neuroglancer_data)

# Transform to anatomical coordinates (LPS)
physical_points, descriptions = neuroglancer_annotations_to_anatomical(
    neuroglancer_data, zarr_uri, metadata, scale_unit="millimeter"
)

Multi-source JSON Loading

from aind_zarr_utils import get_json

# Automatically handles local files, URLs, and S3 URIs
data = get_json("s3://aind-open-data/path/to/file.json")
data = get_json("https://example.com/data.json")
data = get_json("/local/path/data.json")

Pipeline-specific Domain Corrections

from aind_zarr_utils import mimic_pipeline_zarr_to_anatomical_stub, neuroglancer_to_ccf

# Create stub with pipeline version-specific corrections
pipeline_stub = mimic_pipeline_zarr_to_anatomical_stub(
    zarr_uri, metadata, processing_data
)

# Transform neuroglancer annotations with pipeline-corrected spatial properties
points_ccf, descriptions = neuroglancer_to_ccf(
    neuroglancer_data, zarr_uri, metadata, processing_data,
    template_used="SmartSPIM-template_2024-05-16_11-26-14"
)

Installation

From PyPI (when available)

pip install aind-zarr-utils

From Source

For development or latest features:

git clone https://github.com/AllenNeuralDynamics/aind-zarr-utils.git
cd aind-zarr-utils
pip install -e .

Development Installation

For contributing to the project:

git clone https://github.com/AllenNeuralDynamics/aind-zarr-utils.git
cd aind-zarr-utils
pip install -e .[dev]
# or with uv:
uv sync

Requirements

Python ≥3.10
Core dependencies: NumPy, ome-zarr, SimpleITK, antspyx, requests
AIND ecosystem: aind-anatomical-utils, aind-registration-utils
Cloud: boto3, s3fs for AWS S3 integration

Key Concepts

Coordinate Systems

LPS (Left-Posterior-Superior): Standard output coordinate system for all functions
Coordinate transformations: Explicit handling between neuroimaging orientations (RAS/LPS)
Neuroglancer points: Assumed to be in order z, y, x, t (only z, y, x returned)

Data Sources

S3 Integration: Primary bucket aind-open-data with anonymous access support
ZARR Data: Multi-resolution support with automatic unit conversions
Pipeline Compatibility: Version-specific spatial domain corrections

Architecture

Modular Design: Separate modules for ZARR conversion, neuroglancer processing, annotations, JSON utilities
Caching: S3 resource caching with ETag-based validation
Error Handling: Comprehensive validation and meaningful error messages

Contributing

Development Workflow

Please test your changes using the full linting and testing suite:

./scripts/run_linters_and_checks.sh -c

Or run individual commands:

uv run --frozen ruff format          # Code formatting
uv run --frozen ruff check           # Linting
uv run --frozen mypy                 # Type checking
uv run --frozen interrogate -v       # Documentation coverage
uv run --frozen codespell --check-filenames  # Spell checking
uv run --frozen pytest --cov aind_zarr_utils # Tests with coverage

Pull requests

For internal members, please create a branch. For external members, please fork the repository and open a pull request from the fork. We'll primarily use Angular style for commit messages. Roughly, they should follow the pattern:

<type>(<scope>): <short summary>

where scope (optional) describes the packages affected by the code changes and type (mandatory) is one of:

build: Changes that affect build tools or external dependencies (example scopes: pyproject.toml, setup.py)
ci: Changes to our CI configuration files and scripts (examples: .github/workflows/ci.yml)
docs: Documentation only changes
feat: A new feature
fix: A bugfix
perf: A code change that improves performance
refactor: A code change that neither fixes a bug nor adds a feature
test: Adding missing tests or correcting existing tests

Semantic Release

The table below, from semantic release, shows which commit message gets you which release type when semantic-release runs (using the default configuration):

Commit message	Release type
`fix(pencil): stop graphite breaking when too much pressure applied`	~~Patch~~ Fix Release, Default release
`feat(pencil): add 'graphiteWidth' option`	~~Minor~~ Feature Release
`perf(pencil): remove graphiteWidth option` `BREAKING CHANGE: The graphiteWidth option has been removed.` `The default graphite width of 10mm is always used for performance reasons.`	~~Major~~ Breaking Release (Note that the `BREAKING CHANGE:` token must be in the footer of the commit)

Documentation

To generate the rst files source files for documentation, run

sphinx-apidoc -o docs/source/ src

Then to create the documentation HTML files, run

sphinx-build -b html docs/source/ docs/build/html

More info on sphinx installation can be found here.

Read the Docs Deployment

Note: Private repositories require Read the Docs for Business account. The following instructions are for a public repo.

The following are required to import and build documentations on Read the Docs:

A Read the Docs user account connected to Github. See here for more details.
Read the Docs needs elevated permissions to perform certain operations that ensure that the workflow is as smooth as possible, like installing webhooks. If you are not the owner of the repo, you may have to request elevated permissions from the owner/admin.
A .readthedocs.yaml file in the root directory of the repo. Here is a basic template:

# Read the Docs configuration file
# See https://docs.readthedocs.io/en/stable/config-file/v2.html for details

# Required
version: 2

# Set the OS, Python version, and other tools you might need
build:
  os: ubuntu-24.04
  tools:
    python: "3.13"

# Path to a Sphinx configuration file.
sphinx:
  configuration: docs/source/conf.py

# Declare the Python requirements required to build your documentation
python:
  install:
    - method: pip
      path: .
      extra_requirements:
        - dev

Here are the steps for building docs in Read the Docs. See here for detailed instructions:

From Read the Docs dashboard, click on Add project.
For automatic configuration, select Configure automatically and type the name of the repo. A repo with public visibility should appear as you type.
Follow the subsequent steps.
For manual configuration, select Configure manually and follow the subsequent steps

Once a project is created successfully, you will be able to configure/modify the project's settings; such as Default version, Default branch etc.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.14.1

Apr 28, 2026

0.14.0

Feb 11, 2026

0.13.2

Dec 10, 2025

0.13.1

Dec 10, 2025

This version

0.13.0

Dec 10, 2025

0.12.0

Dec 3, 2025

0.11.6

Dec 3, 2025

0.11.5

Nov 7, 2025

0.11.4

Nov 6, 2025

0.11.3

Nov 4, 2025

0.11.2

Oct 24, 2025

0.11.1

Oct 21, 2025

0.11.0

Oct 19, 2025

0.10.4

Oct 14, 2025

0.10.3

Oct 10, 2025

0.10.2

Oct 9, 2025

0.10.1

Oct 9, 2025

0.10.0

Oct 9, 2025

0.9.0

Oct 8, 2025

0.8.0

Sep 29, 2025

0.7.0

Sep 28, 2025

0.6.0

Sep 19, 2025

0.5.1

Sep 18, 2025

0.5.0

Sep 17, 2025

0.4.0

Sep 17, 2025

0.3.1

Sep 11, 2025

0.3.0

Sep 10, 2025

0.2.0

Sep 9, 2025

0.1.4

Aug 7, 2025

0.1.3

Jul 29, 2025

0.1.2

Jul 1, 2025

0.1.1

Jun 30, 2025

0.1.0

Jun 30, 2025

0.0.11

Jun 30, 2025

0.0.10

Jun 26, 2025

0.0.9

Jun 26, 2025

0.0.8

Jun 26, 2025

0.0.7

Jun 26, 2025

0.0.6

Jun 26, 2025

0.0.5

Jun 13, 2025

0.0.4

Apr 24, 2025

0.0.3

Apr 23, 2025

0.0.2

Apr 23, 2025

0.0.1

Apr 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aind_zarr_utils-0.13.0.tar.gz (352.2 kB view details)

Uploaded Dec 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

aind_zarr_utils-0.13.0-py3-none-any.whl (37.4 kB view details)

Uploaded Dec 10, 2025 Python 3

File details

Details for the file aind_zarr_utils-0.13.0.tar.gz.

File metadata

Download URL: aind_zarr_utils-0.13.0.tar.gz
Upload date: Dec 10, 2025
Size: 352.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.17 {"installer":{"name":"uv","version":"0.9.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for aind_zarr_utils-0.13.0.tar.gz
Algorithm	Hash digest
SHA256	`ac0cc8f44821de3dee76062516b5c5bbd4ded2fff440142fc9e3c23c1e153498`
MD5	`ee3f38315ebc09398f7b8c74702f479d`
BLAKE2b-256	`ee94c7947f2749c00d094084ce31eef3158b3dbc5f4f32519a9512ba1edc1757`

See more details on using hashes here.

File details

Details for the file aind_zarr_utils-0.13.0-py3-none-any.whl.

File metadata

Download URL: aind_zarr_utils-0.13.0-py3-none-any.whl
Upload date: Dec 10, 2025
Size: 37.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.17 {"installer":{"name":"uv","version":"0.9.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for aind_zarr_utils-0.13.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c70974b5234771f3d1f6bed84b7d1af8f1070df8ddc52df3549c627b1b943969`
MD5	`f6a72da8d8a1f275c6ddf9c162886cc3`
BLAKE2b-256	`9842db90c8461d3023fc55d63c3ea41f307b57fd55dfdb6e736a1464afb6733e`

See more details on using hashes here.

aind-zarr-utils 0.13.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

aind-zarr-utils

Key Features

Usage

Basic ZARR to Image Conversion

Processing Neuroglancer Annotations

Multi-source JSON Loading

Pipeline-specific Domain Corrections

Installation

From PyPI (when available)

From Source

Development Installation

Requirements

Key Concepts

Coordinate Systems

Data Sources

Architecture

Contributing

Development Workflow

Pull requests

Semantic Release

Documentation

Read the Docs Deployment

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes