A BioIO reader plugin for reading Zarr files in the OME format.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

aicspypi

These details have not been verified by PyPI

Development Status
- 4 - Beta
Natural Language
- English
Programming Language

Project description

bioio-ome-zarr

A BioIO reader plugin for reading OME ZARR images using ome-zarr

Documentation

See the full documentation on our GitHub pages site - the generic use and installation instructions there will work for this package.

Information about the base reader this package relies on can be found in the bioio-base repository here

Installation

Stable Release: pip install bioio-ome-zarr
Development Head: pip install git+https://github.com/bioio-devs/bioio-ome-zarr.git

Example Usage (see full documentation for more examples)

Install bioio-ome-zarr alongside bioio:

pip install bioio bioio-ome-zarr

This example shows a simple use case for just accessing the pixel data of the image by explicitly passing this Reader into the BioImage. Passing the Reader into the BioImage instance is optional as bioio will automatically detect installed plug-ins and auto-select the most recently installed plug-in that supports the file passed in.

from bioio import BioImage
import bioio_ome_zarr

img = BioImage("my_file.zarr", reader=bioio_ome_zarr.Reader)
img.data

Reading from AWS S3

To read from private S3 buckets, credentials must be configured. Public buckets can be accessed without credentials.

from bioio import BioImage
path = "https://allencell.s3.amazonaws.com/aics/nuc-morph-dataset/hipsc_fov_nuclei_timelapse_dataset/hipsc_fov_nuclei_timelapse_data_used_for_analysis/baseline_colonies_fov_timelapse_dataset/20200323_09_small/raw.ome.zarr"
image = BioImage(path)
print(image.get_image_dask_data())

Writing OME-Zarr Stores

The OMEZarrWriter can write both Zarr v2 (NGFF 0.4) and Zarr v3 (NGFF 0.5) formats.

basic writer example (2D YX)

from bioio_ome_zarr.writers import OMEZarrWriter
import numpy as np

# Minimal 2D example (Y, X)
data = np.random.randint(0, 255, size=(64, 64), dtype=np.uint8)

writer = OMEZarrWriter(
    store="basic.zarr",
    level_shapes=(64, 64),   # (Y, X)
    dtype=data.dtype,
)

# Write the data to the store
writer.write_full_volume(data)

5D (TCZYX), with one extra resolution level

from bioio_ome_zarr.writers import OMEZarrWriter, Channel
import numpy as np

level_shapes = [
    (2, 3, 4, 256, 256),  # L0 full res
    (2, 3, 4, 128, 128),  # L1 downsampled Y/X by 2
]

data = np.random.randint(0, 255, size=level_shapes[0], dtype=np.uint8)
channels = [Channel(label=f"c{i}", color="FF0000") for i in range(data.shape[1])]

writer = OMEZarrWriter(
    store="output.zarr",
    level_shapes=level_shapes,
    dtype=data.dtype,
    zarr_format=3,  # 2 for Zarr v2
    channels=channels,
    axes_names=["t", "c", "z", "y", "x"],
    axes_types=["time", "channel", "space", "space", "space"],
    axes_units=[None, None, "micrometer", "micrometer", "micrometer"],
)

writer.write_full_volume(data)

Full writer parameters and API

Parameter	Type	Description
store	`str` or `zarr.storage.StoreLike`	Filesystem path, fsspec URL, or Store-like for the root group.
level_shapes	`Sequence[int]` or `Sequence[Sequence[int]]`	Either a single N‑D shape (one level) or an explicit per‑level list of shapes (level 0 first). Examples above.
dtype	`np.dtype` or `str`	On‑disk dtype (e.g., `uint8`, `uint16`).
chunk_shape	`Sequence[int]` or `Sequence[Sequence[int]]` or `None`	Chunk shape: single (applied to all levels) or per‑level. If `None`, a ≈16 MiB chunk is suggested per level via `multiscale_chunk_size_from_memory_target`.
shard_shape	`Sequence[int]` or `Sequence[Sequence[int]]` or `None`	Zarr v3 only. Single or per‑level shard shapes. Each shard dim must be an integer multiple of the corresponding chunk dim.
compressor	`BloscCodec` (v3) or `numcodecs.abc.Codec` (v2) or `None`	Compression codec. Defaults to zstd + bitshuffle.
zarr_format	`Literal[2,3]`	Target Zarr format – `2` (NGFF 0.4) or `3` (NGFF 0.5). Default `3`.
image_name	`str` or `None`	Name used in multiscales metadata. Default: `"Image"`.
channels	`list[Channel]` or `None`	OMERO‑style channel metadata.
rdefs	`dict` or `None`	OMERO rendering defaults.
creator_info	`dict` or `None`	Optional creator block (NGFF 0.5).
root_transform	`dict[str, Any]` or `None`	Optional transform placed at multiscale root.
axes_names	`list[str]` or `None`	Axis names; defaults to the last N of `["t","c","z","y","x"]`.
axes_types	`list[str]` or `None`	Axis types; defaults to `["time","channel","space", …]`.
axes_units	`list[str]`or`None`	Physical units per axis.
physical_pixel_size	`list[float]` or `None`	Level‑0 physical scale per axis.

Methods

write_full_volume(input_data: np.ndarray | dask.array.Array) -> None Write level‑0 and all declared pyramid levels. NumPy arrays are wrapped into Dask using level‑0 chunks.
write_timepoints(data: np.ndarray | dask.array.Array, *, start_T_src=0, start_T_dest=0, total_T: int | None = None) -> None Stream along the T axis from data into the store. Spatial axes are downsampled for lower levels; T/C are preserved.
write_region(data: np.ndarray, region: tuple[slice, ...]) -> None Write one block of level‑0 data into the store at region, downsampling and writing the proportional region at every pyramid level. Designed for concurrent, shard‑aligned writes (see below); when each region maps to its own shard at every level, writes are race‑free without any locking.
initialize() -> None Eagerly create the root group, per‑level arrays, and metadata (otherwise done lazily on first write). Call once before issuing concurrent write_region calls.
OMEZarrWriter.open(store) -> OMEZarrWriter (classmethod) Attach a writer to an already‑initialized store and write regions into it without re‑creating arrays or metadata. This is the multiprocess‑friendly counterpart to initialize(): one process creates the store, each worker process calls open(store) and writes a disjoint region.
preview_metadata() -> dict[str, Any] Returns the NGFF metadata dict(s) that would be written (no IO).

Writing a full volume (NumPy or Dask)

# NumPy (wrapped automatically)
writer.write_full_volume(data)

# Or pass an explicit Dask array
import dask.array as da
writer.write_full_volume(da.from_array(data, chunks=(1, 1, 1, 64, 64)))

Writing timepoints in batches (streaming along T)

# Suppose your writer axes include "T"; write timepoints in flexible batches
from bioio import BioImage
import dask.array as da

bioimg = BioImage("/path/to/any/bioimage")
data = bioimg.get_image_dask_data()

# Write the entire timeseries at once
writer.write_timepoints(data)

# Write in 5-timepoint batches
for t in range(0, data.shape[0], 5):
    writer.write_timepoints(
    data,
    start_T_src=t,
    start_T_dest=t,
    total_T=5,
    )

# Write source timepoints [10:20] into destination positions [50:60]
writer.write_timepoints(
    data,
    start_T_src=10,
    start_T_dest=50,
    total_T=10,
)

Writing by region

write_region writes one in-memory block into a region of every pyramid level (it downsamples the block for each lower level for you). It's the building block for streaming large images that don't fit in memory, and for parallel ingestion.

For correctness with concurrent writers, each region should be shard-aligned: it must cover complete shards at every level so two workers never touch the same shard file. The simplest shard-aligned tiling is to split along one axis (e.g. T) on shard boundaries.

Single process

import numpy as np
from bioio_ome_zarr.writers import OMEZarrWriter

level_shapes = [(4, 1, 2, 32, 32), (4, 1, 2, 16, 16)]
writer = OMEZarrWriter(
    store="by_region.zarr",
    level_shapes=level_shapes,
    dtype="uint16",
    zarr_format=3,
    axes_names=["t", "c", "z", "y", "x"],
    chunk_shape=[(1, 1, 1, 16, 16), (1, 1, 1, 8, 8)],
    shard_shape=[(2, 1, 2, 32, 32), (2, 1, 2, 16, 16)],  # T split into 2 shards
)

source = np.arange(np.prod(level_shapes[0]), dtype=np.uint16).reshape(level_shapes[0])

# Write the two T-shards one at a time; each region is full-extent except on T.
for t0, t1 in [(0, 2), (2, 4)]:
    region = (slice(t0, t1), slice(0, 1), slice(0, 2), slice(0, 32), slice(0, 32))
    writer.write_region(source[t0:t1], region)

Multiprocess

The parent creates the store once with initialize(); each worker attaches with open() and writes a disjoint, shard-aligned region. The worker function must be at module level so it is picklable under the spawn start method.

# Worker: attach to the already-initialized store and write one shard.
def write_one_region(region, block):
    OMEZarrWriter.open("by_region.zarr").write_region(block, region)

import multiprocessing as mp

# Parent creates arrays + metadata exactly once, before any workers start.
writer.initialize()

regions = [
    (slice(0, 2), slice(0, 1), slice(0, 2), slice(0, 32), slice(0, 32)),
    (slice(2, 4), slice(0, 1), slice(0, 2), slice(0, 32), slice(0, 32)),
]

ctx = mp.get_context("spawn")
procs = [ctx.Process(target=write_one_region, args=(r, source[r])) for r in regions]
for p in procs:
    p.start()
for p in procs:
    p.join()

Custom chunking per level

# Provide one chunk shape per level; must match ndim
chunk_shape = (
    (1, 1, 1, 64, 64),  # level 0
    (1, 1, 1, 32, 32),  # level 1
)
writer = OMEZarrWriter(
    store="custom_chunks.zarr",
    level_shapes=[(1, 1, 2, 256, 256), (1, 1, 2, 128, 128)],
    dtype="uint16",
    zarr_format=3,
    chunk_shape=chunk_shape,
)

# Example data matching the declared shape
arr = np.random.randint(0, 65535, size=(1, 1, 2, 256, 256), dtype=np.uint16)
writer.write_full_volume(arr)

Sharded arrays (v3 only)

from zarr.codecs import BloscCodec, BloscShuffle

writer = OMEZarrWriter(
    store="sharded_v3.zarr",
    level_shapes=[(1, 1, 16, 1024, 1024), (1, 1, 16, 512, 512)],
    dtype="uint8",
    zarr_format=3,
    chunk_shape=[(1, 1, 1, 128, 128),(1, 1, 1, 128, 128)],
    shard_shape=[(1, 1, 1, 256, 256), (1, 1, 1, 256, 256)],
    compressor=BloscCodec(cname="zstd", clevel=5, shuffle=BloscShuffle.bitshuffle),
)

writer.write_full_volume(
    np.random.randint(0, 255, size=(1, 1, 16, 1024, 1024), dtype=np.uint8)
)

Targeting Zarr v2 explicitly (NGFF 0.4)

import numcodecs

writer = OMEZarrWriter(
    store="target_v2.zarr",
    level_shapes=[(2, 1, 4, 256, 256), (2, 1, 4, 128, 128)],
    dtype="uint8",
    zarr_format=2,  # write NGFF 0.4
    compressor=numcodecs.Blosc(
        cname="zstd", clevel=3, shuffle=numcodecs.Blosc.BITSHUFFLE
    ),
)

writer.write_full_volume(
    np.random.randint(0, 255, size=(2, 1, 4, 256, 256), dtype=np.uint8)
)

Writing to S3 (or any fsspec URL)

# Requires creds for private buckets; public can be anonymous
writer = OMEZarrWriter(
    store="s3://my-bucket/path/to/out.zarr",
    level_shapes=(1, 2, 8, 2048, 2048),  # single level (TCZYX), no pyramid
    dtype="uint16",
    zarr_format=3,
)

writer.write_full_volume(
    np.random.randint(0, 65535, size=(1, 2, 8, 2048, 2048), dtype=np.uint16)
)

Writer Utility Functions

multiscale_chunk_size_from_memory_target(level_shapes, dtype, memory_target) -> list[tuple[int, ...]]
Suggests per-level chunk shapes that each fit within a fixed byte budget.

Works for any ndim (2…5).
prioritizes the highest-index axis first (grow X, then Y, then Z, then C, then T).

Example: 16 MiB budget on large pyramids (rightmost-axis first)

from bioio_ome_zarr.writers.utils import multiscale_chunk_size_from_memory_target

# 4D (C, Z, Y, X) across 5 levels
level_shapes = [
    (8, 64, 4096, 4096),
    (8, 64, 2048, 2048),
    (8, 64, 1024, 1024),
    (8, 64,  512,  512),
    (8, 64,  256,  256),
]

# 16 MiB target
chunks = multiscale_chunk_size_from_memory_target(level_shapes, "uint16", 16 << 20)

chunks = [
   (1,  1, 2048, 4096),
   (1,  2, 2048, 2048),
   (1,  8, 1024, 1024),
   (1, 32,  512,  512),
   (2, 64,  256,  256),
 ]

add_zarr_level(existing_zarr, scale_factors, compressor=None, t_batch=4) -> None Appends a new resolution level to an existing v2 OME-Zarr store, writing in time (T) batches.

scale_factors: per-axis scale relative to the previous highest level (tuple of length 5 for T, C, Z, Y, X).
Automatically determines appropriate chunk size using multiscale_chunk_size_from_memory_target.
Updates the multiscales metadata block with the new level's path and transformations.
Example:

from bioio_ome_zarr.writers import add_zarr_level
add_zarr_level(
    "my_existing.zarr",
    scale_factors=(1, 1, 0.5, 0.5, 0.5),
    compressor=numcodecs.Blosc(cname="zstd", clevel=3, shuffle=numcodecs.Blosc.BITSHUFFLE)
)

Using Config Presets

Config presets make it easy to get started with OMEZarrWriter without needing to know all of its options. They inspect your input data and return a configuration dictionary that you can pass directly into the writer.

Visualization preset

The visualization preset (get_default_config_for_viz) creates a multiscale pyramid (full resolution plus downsampled levels along Y/X) suitable for interactive browsing.

import numpy as np
from bioio_ome_zarr.writers import (
    OMEZarrWriter,
    get_default_config_for_viz,
)

data = np.zeros((1, 1, 4, 64, 64), dtype="uint16")

cfg = get_default_config_for_viz(data)
writer = OMEZarrWriter("output.zarr", **cfg)
writer.write_full_volume(data)

This produces a Zarr store with the original data and additional lower-resolution levels for visualization.

Machine learning preset

The ML preset (get_default_config_for_ml) writes only the full-resolution data, chunked to optimize for patch-wise access often used in training pipelines.

Editing Zarrs

bioio-ome-zarr provides a utility for editing metadata of an existing OME-Zarr store in-place without rewriting image data.

The function:

from bioio_ome_zarr.writer import edit_metadata

allows you to modify common metadata fields such as:

Image name
Channel metadata
Rendering definitions (rdefs)
Axis names, types, and units
Physical pixel size (with automatic pyramid propagation)
Root-level coordinate transforms
Creator / provenance information (NGFF v0.5)

Changing Axis Metadata (e.g. ZTX → TYX)

Sometimes an image was written with incorrect or placeholder axis metadata. For example, you may have a dataset whose axes were labeled as:

Z, T, X

but the data actually represents:

T, Y, X

You can correct the axis metadata in-place:

edit_metadata(
    "my_image.ome.zarr",
    axes_names=["t", "y", "x"],
    axes_types=["time", "space", "space"],
    axes_units=["second", "micrometer", "micrometer"],
)

⚠️ Important: This operation updates metadata only.

If the actual array order needs to change (for example, true ZTX data must become TYX data), you must rewrite the array data before updating metadata.

Updating Physical Pixel Size

To change the physical pixel size of the base resolution:

edit_metadata(
    "my_image.ome.zarr",
    physical_pixel_size=[1.0, 1.0, 0.5, 0.108, 0.108],
)

This will:

Update the base resolution scale
Automatically propagate scale changes to all pyramid levels
Preserve relative downsampling ratios

Updating Channel Metadata

Channel metadata is written into the OMERO metadata block.

from bioio_ome_zarr.metadata import Channel

channels = [
    Channel(label="DAPI", color="#0000FF"),
    Channel(label="GFP", color="#00FF00"),
]

edit_metadata(
    "my_image.ome.zarr",
    channels=channels,
)

Setting Creator Metadata (NGFF v0.5)

For NGFF v0.5 stores:

edit_metadata(
    "my_image.ome.zarr",
    creator_info={
        "name": "bioio-ome-zarr",
        "version": "3.1.0",
    },
)

Notes

The Zarr store is modified in-place.
No image data is rewritten/rechunked.

Issues

Click here to view all open issues in bioio-devs organization at once or check this repository's issue tab.

Development

See CONTRIBUTING.md for information related to developing the code.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

aicspypi

These details have not been verified by PyPI

Development Status
- 4 - Beta
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

This version

3.5.0

Jun 18, 2026

3.4.0

Apr 22, 2026

3.3.0

Mar 3, 2026

3.2.2

Feb 10, 2026

3.2.1

Jan 8, 2026

3.2.0

Dec 4, 2025

3.1.0

Nov 18, 2025

3.0.3

Oct 30, 2025

3.0.2

Oct 9, 2025

3.0.1

Oct 8, 2025

3.0.0

Oct 7, 2025

2.3.0

Sep 9, 2025

2.2.0

Aug 28, 2025

2.1.1

Jul 23, 2025

2.1.0

Jul 9, 2025

2.0.0

Jun 20, 2025

1.2.0

Apr 18, 2025

1.1.2

Apr 2, 2025

1.1.1

Mar 19, 2025

1.1.0

Aug 15, 2024

1.0.1

Jun 10, 2024

1.0.0

Dec 6, 2023

0.9.0

Oct 27, 2023

0.0.0

Aug 15, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bioio_ome_zarr-3.5.0.tar.gz (45.2 kB view details)

Uploaded Jun 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bioio_ome_zarr-3.5.0-py3-none-any.whl (34.4 kB view details)

Uploaded Jun 18, 2026 Python 3

File details

Details for the file bioio_ome_zarr-3.5.0.tar.gz.

File metadata

Download URL: bioio_ome_zarr-3.5.0.tar.gz
Upload date: Jun 18, 2026
Size: 45.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bioio_ome_zarr-3.5.0.tar.gz
Algorithm	Hash digest
SHA256	`d4c4ae848343631aaaee817a6a788289d0cbd73b278137e43a4a25c86e44eca2`
MD5	`f15f0f67fb5261bf241d4e4592363fbf`
BLAKE2b-256	`6e54dfcb380bbb5aa9631dfd508e2ed62e8646708a8668d204bd51ad1f16f27f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for bioio_ome_zarr-3.5.0.tar.gz:

Publisher: ci.yml on bioio-devs/bioio-ome-zarr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: bioio_ome_zarr-3.5.0.tar.gz
- Subject digest: d4c4ae848343631aaaee817a6a788289d0cbd73b278137e43a4a25c86e44eca2
- Sigstore transparency entry: 1860854766
- Sigstore integration time: Jun 18, 2026
Source repository:
- Permalink: bioio-devs/bioio-ome-zarr@254fa9044a4368fbc3b51ce86a29294ba95f78e9
- Branch / Tag: refs/tags/v3.5.0
- Owner: https://github.com/bioio-devs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@254fa9044a4368fbc3b51ce86a29294ba95f78e9
- Trigger Event: push

File details

Details for the file bioio_ome_zarr-3.5.0-py3-none-any.whl.

File metadata

Download URL: bioio_ome_zarr-3.5.0-py3-none-any.whl
Upload date: Jun 18, 2026
Size: 34.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for bioio_ome_zarr-3.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2bf641d515ffc9fb9524ae03565d7aa5cfa886586c007b4343fe61f5f834e6d0`
MD5	`f964151c2662b61936e2a64ad305926c`
BLAKE2b-256	`8d998302e4536c625065eef9218e38a54c379ec0014d464b41bd83bb6515d54b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for bioio_ome_zarr-3.5.0-py3-none-any.whl:

Publisher: ci.yml on bioio-devs/bioio-ome-zarr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: bioio_ome_zarr-3.5.0-py3-none-any.whl
- Subject digest: 2bf641d515ffc9fb9524ae03565d7aa5cfa886586c007b4343fe61f5f834e6d0
- Sigstore transparency entry: 1860854996
- Sigstore integration time: Jun 18, 2026
Source repository:
- Permalink: bioio-devs/bioio-ome-zarr@254fa9044a4368fbc3b51ce86a29294ba95f78e9
- Branch / Tag: refs/tags/v3.5.0
- Owner: https://github.com/bioio-devs
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@254fa9044a4368fbc3b51ce86a29294ba95f78e9
- Trigger Event: push

bioio-ome-zarr 3.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

bioio-ome-zarr

Documentation

Installation

Example Usage (see full documentation for more examples)

Reading from AWS S3

Writing OME-Zarr Stores

basic writer example (2D YX)

5D (TCZYX), with one extra resolution level

Full writer parameters and API

Writing a full volume (NumPy or Dask)

Writing timepoints in batches (streaming along T)

Writing by region

Single process

Multiprocess

Custom chunking per level

Sharded arrays (v3 only)

Targeting Zarr v2 explicitly (NGFF 0.4)

Writing to S3 (or any fsspec URL)

Writer Utility Functions

Example: 16 MiB budget on large pyramids (rightmost-axis first)

Using Config Presets

Visualization preset

Machine learning preset

Editing Zarrs

Changing Axis Metadata (e.g. ZTX → TYX)

Updating Physical Pixel Size

Updating Channel Metadata

Setting Creator Metadata (NGFF v0.5)

Notes

Issues

Development

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance