Skip to main content

OME-TIFF and OME-ZARR writer APIs designed for microscopy acquisition

Project description

ome-writers

License PyPI Python Version CI codecov codspeed

OME-TIFF and OME-ZARR writer APIs designed for microscopy acquisition.

Purpose

ome-writers provides a unified interface for writing microscopy image data to OME-compliant formats (OME-TIFF and OME-Zarr) using various different backends. It is designed for streaming acquisition: receiving 2D camera frames one at a time and writing them to multi-dimensional arrays with proper metadata.

The core problem ome-writers solves:

Map a stream of 2D frames (arriving in acquisition order) to storage locations in multi-dimensional arrays, while generating OME-compliant metadata for both TIFF and Zarr formats.

Installation

You can install ome-writers via pip. You must also select select at least one backend extra:

pip install ome-writers[<backend>]

...where <backend> is a comma-separated list of one or more of the following:

  • tensorstore — Uses tensorstore, supports OME-Zarr v0.5.
  • acquire-zarr — Uses acquire-zarr, supports OME-Zarr v0.5.
  • zarr-python — Uses zarr-python, supports OME-Zarr v0.5.
  • zarrs-python — Uses zarrs-python, supports OME-Zarr v0.5.
  • tifffile — Uses tifffile, supports OME-TIFF.
  • all — install all backends.

[!Note] All zarr-backends use yaozarrs to generate OME-Zarr metadata and create zarr hierarchies (only array-writing is handled by the selected backend).

(Developers using uv sync will end up with all backends installed by default.)

Basic Usage

from ome_writers import AcquisitionSettings, Dimension, create_stream

settings = AcquisitionSettings(
    root_path="example_5d_image.ome.zarr",
    dimensions=[
        Dimension(name="t", count=10, chunk_size=1, type="time"),
        Dimension(name="c", count=2, chunk_size=1, type="channel"),
        Dimension(name="z", count=5, chunk_size=1, type="space", scale=5),
        Dimension(name="y", count=256, chunk_size=64, type="space", scale=0.1),
        Dimension(name="x", count=256, chunk_size=64, type="space", scale=0.1),
    ],
    dtype="uint16",
    overwrite=True,
    backend="auto",
)

with create_stream(settings) as stream:
    for frame in ...:
        stream.append(frame)

High-Level Architecture

┌───────────────────────┐      ┌─────────────────┐      ┌───────────────────────┐
│  AcquisitionSettings  │─────▶│   FrameRouter   │─────▶│  ArrayBackend         │
│                       │      │                 │      │                       │
│  Declarative model    │      │  __next__() ->  │      │  write(pos,idx,frame) │
│  of acquisition order │      │    (pos, idx)   │      │  finalize()           │
└───────────────────────┘      └─────────────────┘      └───────────────────────┘

AcquisitionSettings (schema.py)

The schema is the declarative description of what to create. In addition to other storage details such as data types, chunking, compression, and other metadata, it must fully describe the dimensionality of the data and the exact order in which frames will arrive.

Explicit non-goal: ome-writers does not attempt to handle non-deterministic acquisition patterns (e.g., event-driven acquisitions where data shape is unknown ahead of time). However, we do support an unbounded first dimension (e.g., time or whatever). For this case, we recommend a flat 3D structure (e.g., FYX with unbounded F) where F is "any frame", storing metadata for mapping frames to logical dimensions externally.

It answers:

  • What dimensions exist? (T, C, Z, Y, X, positions, plates, etc.)
  • What is the acquisition order? (how will frames arrive)
  • What is the storage order? (how should axes be arranged on disk)
  • Data types, chunking, compression, sharding, etc.

The schema separates acquisition order (the order dimensions appear in the dimensions list) from storage order (controlled by the storage_order field). This allows data to arrive in one order (e.g., TZCYX) but be stored in another (e.g., TCZYX for NGFF compliance).

FrameRouter

The router is the stateful iterator that maps frame numbers to storage locations. It:

  1. Reads the schema to understand both acquisition and storage order
  2. Maintains iteration state (which frame are we on?)
  3. Computes the permutation from acquisition order to storage order
  4. Yields (position_key, storage_index) tuples for each frame

The router is the only component that knows about both orderings. It iterates in acquisition order (because that's how frames arrive) and emits storage-order indices (because that's what backends need).

ArrayBackend

Backends are format-specific writers that handle the actual I/O. They:

  1. Create arrays/files based on the schema
  2. Write frames to specified locations
  3. Generate format-appropriate metadata
  4. Handle finalization (flushing, closing)

Supported backends:

  • tensorstore — OME-Zarr v0.5 via yaozarrs
  • zarr-python — OME-Zarr v0.5 via yaozarrs
  • acquire-zarr — OME-Zarr v0.5 via yaozarrs
  • tifffile — OME-TIFF

Backends receive indices in storage order and don't need to know about acquisition order.

Design Principles

  1. Schema is declarative — describes the target structure, not how to build it
  2. Router handles the mapping — single place for acquisition→storage order logic
  3. Backends are simple adapters — receive storage-order indices, write bytes
  4. Position is a meta-dimension — appears in iteration but becomes separate arrays/files, not an array axis

Why this layer of abstraction?

The separation of schema, router, and backend allows us to leave the performance-critical tasks to C++ libraries (like tensorstore, acquire-zarr), while keeping "fiddly" metadata logic and frame routing in Python (where it's easier to maintain).

The API of this library is heavily inspired by the acquire-zarr API (declare deterministic experiment with schema, append frames with single append() calls). But we also:

  • want to support both zarr and tiff formats (OME-TIFF)
  • want to support other zarr array libraries, such as tensorstore.
  • want to take advantage of Python for metadata management (e.g. ome-types for OME-XML generation and yaozarrs for OME-Zarr metadata)

Supported Use Cases

  • Single 5D image (TCZYX or any permutation) — the common case
  • Multi-position acquisition — separate arrays/files per stage position
  • Well plates — hierarchical plate/well/field structure with explicit acquisition order
  • Unbounded first dimension — e.g., streaming time-lapse with unknown total frames

Currently Unsupported Edge Cases

  • Jagged arrays: E.g.
    • one channel does Z-stacks while another does single planes. In other words, the outer array is regular, but some inner frames are missing/skipped.
    • different positions have different shapes (nT, nZ, etc), such as is possible when using subsequences in useq-schema. (maybe this is just the responsibility of the user to create multiple streams).
  • Multi-camera setups, particularly with different image shapes or data types. (here too... the caller could just call append() in the right order for each buffer)
  • What happens if you want to skip a frame at runtime, maybe append(None)?

Contributing

We welcome contributions to ome-writers! See our contributing guide for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ome_writers-0.1.0rc1.tar.gz (86.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ome_writers-0.1.0rc1-py3-none-any.whl (50.6 kB view details)

Uploaded Python 3

File details

Details for the file ome_writers-0.1.0rc1.tar.gz.

File metadata

  • Download URL: ome_writers-0.1.0rc1.tar.gz
  • Upload date:
  • Size: 86.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ome_writers-0.1.0rc1.tar.gz
Algorithm Hash digest
SHA256 de9cb1ec02068941904079641f343ade3d528ecf343d61c6893294b47b6cf09a
MD5 01580177e5caefe9973a8c6a0a549604
BLAKE2b-256 11ffaba7c95b9056ad50db0496c9420ff87eaad6436bc163d68a2e35b7effd8b

See more details on using hashes here.

Provenance

The following attestation bundles were made for ome_writers-0.1.0rc1.tar.gz:

Publisher: ci.yml on pymmcore-plus/ome-writers

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ome_writers-0.1.0rc1-py3-none-any.whl.

File metadata

  • Download URL: ome_writers-0.1.0rc1-py3-none-any.whl
  • Upload date:
  • Size: 50.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ome_writers-0.1.0rc1-py3-none-any.whl
Algorithm Hash digest
SHA256 8b2a03dca3f3a1e7501609b4813c66f41633fac53a6e63c153ac1035b1729188
MD5 f51fcf80b7fcd227794b3e58d5b756fc
BLAKE2b-256 2f58224775582eab87da57d985a2fb1d52e069bec042112a19a2a602a28994b4

See more details on using hashes here.

Provenance

The following attestation bundles were made for ome_writers-0.1.0rc1-py3-none-any.whl:

Publisher: ci.yml on pymmcore-plus/ome-writers

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page