A BioIO conversion tool for going between image formats
Project description
bioio-conversion
A BioIO conversion tool for going between image formats.
Documentation
See the full documentation on our GitHub Pages site:
https://bioio-devs.github.io/bioio-conversion
Installation
Install from PyPI along with core BioIO and plugins:
pip install bioio-conversion
Python Package Usage
Available Converters
-
OmeZarrConverter
-
Purpose: Convert any BioImage-supported input (TIFF, CZI, ND2, etc.) into an OME-Zarr store.
-
Features:
- Multi-scene export (
scenes=0, list, orNone= all) - Flexible multiscale pyramid options (
level_shapes,num_levels,downsample_z) - Chunk-size tuning (
chunk_shape,memory_target,shard_shape) - Metadata options (
channels,axes_names,axes_units,axes_types,physical_pixel_size) - Output format (
zarr_format= 2 or 3) - Optional auto Dask cluster
- Multi-scene export (
-
Import path:
from bioio_conversion.converters import OmeZarrConverter
-
-
BatchConverter
-
Purpose: Orchestrate batch conversions of many files (CSV, directory crawl, or explicit list).
-
Features:
- Factory methods:
from_csv(),from_directory(),from_list() - Shared
default_optsfor per-job overrides - Dispatch jobs via
.run_jobs()
- Factory methods:
-
Import path:
from bioio_conversion.converters import BatchConverter
-
Example: OmeZarrConverter
Minimal usage
from bioio_conversion.converters import OmeZarrConverter
conv = OmeZarrConverter(
source='image.tiff',
destination='out_dir'
)
conv.convert()
Advanced usage: full control
from bioio_conversion.converters import OmeZarrConverter
from zarr.codecs import BloscCodec
conv = OmeZarrConverter(
source='multi_scene.czi',
destination='zarr_output',
scenes=None,
name='experiment1',
tbatch=2,
num_levels=3,
downsample_z=True,
chunk_shape=(1,1,16,256,256),
shard_shape=(1,1,128,1024,1024),
memory_target=32*1024*1024,
dtype='uint16',
compressor=BloscCodec(),
zarr_format=3,
)
conv.convert()
Explicit level_shapes
conv = OmeZarrConverter(
source="image_tczyx.tif",
destination="out_tczyx",
level_shapes=[
(1, 3, 5, 325, 475),
(1, 3, 2, 162, 238),
(1, 3, 1, 81, 119),
],
)
conv.convert()
Channel metadata
from bioio_ome_zarr.writers import Channel
channels = [
Channel(label="DAPI", color="#0000FF", active=True,
window={"min":100, "max":2000, "start":200, "end":1200}),
Channel(label="GFP", color="#00FF00", active=True),
Channel(label="TRITC", color="#FF0000", active=False),
]
conv = OmeZarrConverter(
source="multi_channel.czi",
destination="out_channels",
channels=channels,
)
conv.convert()
Axes & physical pixel sizes
conv = OmeZarrConverter(
source="custom_axes.tif",
destination="out_axes",
axes_names=["t","c","z","y","x"],
axes_types=["time","channel","space","space","space"],
axes_units=[None, None, "micrometer","micrometer","micrometer"],
physical_pixel_size=[1.0, 1.0, 0.4, 0.108, 0.108],
)
conv.convert()
Example with fewer dimensions (3D ZYX)
conv = OmeZarrConverter(
source="volume_zyx.tif",
destination="out_zyx",
num_levels=2,
downsample_z=True,
)
conv.convert()
CSV-driven batch conversion
The CSV file should have a header row that names the job parameters. At minimum, include a source column (path to each input image). You may also include per-job overrides for any converter option (e.g. destination, scenes, tbatch, num_levels, downsample_z, level_shapes, memory_target, dtype, channel_names, etc.). Values in each row will be merged with the default_opts you passed to BatchConverter.
from bioio_conversion import BatchConverter
bc = BatchConverter(
converter_key='ome-zarr',
default_opts={
'destination': 'batch_out',
'tbatch': 4,
}
)
jobs = bc.from_csv('jobs.csv') # parse CSV into job dicts
bc.run_jobs(jobs)
Directory-driven batch conversion
from bioio_conversion import BatchConverter
bc = BatchConverter(default_opts={
'destination': 'dir_out',
})
jobs = bc.from_directory(
'/data/images',
max_depth=2,
pattern='*.tif'
)
bc.run_jobs(jobs)
List-driven batch conversion
from bioio_conversion import BatchConverter
paths = ['/data/a.czi', '/data/b.czi', '/data/c.zarr']
bc = BatchConverter(default_opts={
'destination': 'list_out',
'scenes': 0
})
jobs = bc.from_list(paths)
bc.run_jobs(jobs)
Command-Line Interface
bioio-convert – single-file conversion
Convert a single image file to OME-Zarr using the configured backend (default: OME-Zarr).
bioio-convert SOURCE -d DESTINATION [options]
SOURCE is the input image file (e.g. .czi, .ome.tiff, .nd2).
Core options
source(positional): input image path-d,--destination: output directory for.ome.zarr(required)-n,--name: base name for the output (defaults to a value derived from the input)-s,--scenes: scene(s) to export (e.g.0or0,2). If omitted, the converter/writer default is used (“all scenes”).--tbatch: number of timepoints per write batch.--start-t-src: source T index at which to begin reading (0-based).--start-t-dest: destination T index at which to begin writing (0-based).
Multiscale (pyramid) options
-
--level-shapes: semicolon-separated per-level shapes (level 0 first). Each tuple must have one integer per axis.- Example:
--level-shapes "1,3,5,512,512;1,3,5,256,256;1,3,5,128,128"
- Example:
-
--num-levels: total number of pyramid levels (including level 0). If provided (and--level-shapesis not), a half-pyramid is built in X/Y (and optionally Z). -
--downsample-z: when used with--num-levels, also halves the Z dimension at each level if a Z axis exists.
Chunking / sharding (advanced)
--chunk-shape: single chunk shape tuple applied to all levels (e.g.1,1,16,256,256).--chunk-shape-per-level: semicolon-separated chunk shapes per level. Overrides--chunk-shapeand--memory-target.--memory-target: approximate in-memory byte budget used to derive per-level chunk shapes when explicit chunk shapes are not provided.--shard-shape: single shard shape tuple for Zarr v3 (e.g.1,1,128,1024,1024).--shard-shape-per-level: semicolon-separated shard shapes per level (Zarr v3). Overrides--shard-shape.
Writer / metadata options
-
--dtype: output dtype override (e.g.uint16,float32). If omitted, the reader’s native dtype is used. -
--physical-pixel-sizes: comma-separated floats (one per axis, level 0 only). Example for(t,c,z,y,x):--physical-pixel-sizes 1.0,1.0,0.4,0.108,0.108 -
--zarr-format: target Zarr version:2≈ NGFF 0.43≈ NGFF 0.5 If omitted, the writer’s default is used (3≈ NGFF 0.5).
Channel display options
These only take effect when --channel-labels is provided. All lists must
align by channel index.
--channel-labels: comma-separated channel names (e.g.DAPI,GFP,TRITC).--channel-colors: comma-separated colors (CSS color names or hex codes). Example:"#0000FF,#00FF00,#FF0000".--channel-actives: booleans for channel visibility (e.g.true,true,false).--channel-coefficients: per-channel intensity coefficients (e.g.1,0.8,1.2).--channel-families: intensity family names per channel (e.g.linear,sRGB,sRGB).--channel-inverted: booleans for inverted display per channel.--channel-window-min,--channel-window-max,--channel-window-start,--channel-window-end: per-channel windowing values. Only used when any window value is provided.
Axis metadata options
--axes-names: comma-separated axis names in native axis order. Example:t,c,z,y,x.--axes-types: comma-separated axis semantic types (e.g.time,channel,space,space,space).--axes-units: comma-separated axis units, in the same order as--axes-names. Usenone,null, or a blank position for missing units. Example for(t,c,z,y,x):s,,um,um,um.
bioio-convert examples
Basic usage
bioio-convert image.tif -d out_dir
Custom name
bioio-convert sample.czi -d out_dir -n my_run
Export all scenes
bioio-convert multi_scene.ome.tiff -d zarr_out
Export specific scenes
bioio-convert multi_scene.ome.tiff -d zarr_out -s 0,2
Simple half-pyramid (XY only)
bioio-convert volume.tif -d out_xy --num-levels 3
Simple half-pyramid (XYZ)
bioio-convert volume_tczyx.tif -d out_xyz --num-levels 3 --downsample-z
Explicit level shapes
bioio-convert image.tif -d out_explicit \
--level-shapes "1,3,5,325,475;1,3,2,162,238;1,3,1,81,119"
Dtype and chunking
bioio-convert image.tif -d out_dir \
--dtype uint16 \
--memory-target 33554432
Custom channels
bioio-convert image_with_channels.czi -d out_dir \
--channel-labels DAPI,GFP,TRITC \
--channel-colors "#0000FF,#00FF00,#FF0000" \
--channel-actives true,true,false
Axis metadata
bioio-convert image_tczyx.tif -d out_axes \
--axes-names t,c,z,y,x \
--axes-types time,channel,space,space,space \
--axes-units s,,um,um,um
Physical pixel sizes
bioio-convert image.tif -d out_dir \
--physical-pixel-sizes 1.0,1.0,0.4,0.108,0.108
bioio-batch-convert – batch conversion
Batch mode: convert many files via CSV, directory walk, or an explicit list
of paths. All of the shared OME-Zarr options listed for bioio-convert
(--num-levels, --chunk-shape, --channel-*, axis metadata, etc.) are
also accepted here and act as defaults for every job.
bioio-batch-convert --mode [csv|dir|list] [mode options] [shared options]
Mode selection
-
-m,--mode [csv|dir|list](required):csv: read jobs from a CSV file.dir: scan a directory tree for input files.list: use an explicit list of paths from the command line.
Mode-specific options
-
CSV mode (
--mode csv)--csv-file: path to a CSV describing jobs (one row per job). Each column name maps to anOmeZarrConverterinit argument (e.g.source,destination,scenes,tbatch, etc.). Values are parsed by the batch loader; per-row values override shared defaults from the CLI.
-
Directory mode (
--mode dir)--directory/--dir: root directory to scan.--depth: maximum recursion depth (0 = only top-level files).--pattern: glob pattern used when scanning (e.g. "*.czi").
-
List mode (
--mode list)--paths: explicit input file paths (repeatable).
Shared conversion options
All of the bioio-convert options (destination, multiscale, chunking,
channels, axes, etc.) can be passed to bioio-batch-convert. They are
converted via build_ome_zarr_init_opts(...) and applied as defaults to
every job created by the BatchConverter. CSV columns that match a given
argument override the shared defaults on a per-job basis.
bioio-batch-convert examples
CSV mode
bioio-batch-convert \
--mode csv \
--csv-file jobs.csv \
--destination batch_out \
--tbatch 4 \
--dtype uint16 \
--num-levels 3
Directory mode
bioio-batch-convert \
--mode dir \
--directory data/ \
--depth 2 \
--pattern '*.czi' \
--destination output_zarr \
--level-shapes "1,3,5,325,475;1,3,2,162,238;1,3,1,81,119"
List mode
bioio-batch-convert \
--mode list \
--paths a.czi b.czi c.tiff \
--destination list_out \
--name batch_run \
--num-levels 2 \
--downsample-z
License & Issues
BSD 3-Clause https://bioio-devs.github.io/bioio-conversion/LICENSE
Report bugs at: https://github.com/bioio-devs/bioio-conversion/issues
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bioio_conversion-1.0.0.tar.gz.
File metadata
- Download URL: bioio_conversion-1.0.0.tar.gz
- Upload date:
- Size: 33.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aab7caf90589932939c667d08e37cb5e21585727c6a139efebd5effddbfb38da
|
|
| MD5 |
49ae3605f7cd860353eed3530ece6934
|
|
| BLAKE2b-256 |
111c7278f9d88ac676e5d31a15e4efe0f74da3af7736acb61732ca0fbc7f8d8a
|
Provenance
The following attestation bundles were made for bioio_conversion-1.0.0.tar.gz:
Publisher:
ci.yml on bioio-devs/bioio-conversion
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
bioio_conversion-1.0.0.tar.gz -
Subject digest:
aab7caf90589932939c667d08e37cb5e21585727c6a139efebd5effddbfb38da - Sigstore transparency entry: 992601189
- Sigstore integration time:
-
Permalink:
bioio-devs/bioio-conversion@cdc2981b718e611fa62f1ef33ae7e576281c89d8 -
Branch / Tag:
refs/tags/v1.0.0 - Owner: https://github.com/bioio-devs
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@cdc2981b718e611fa62f1ef33ae7e576281c89d8 -
Trigger Event:
push
-
Statement type:
File details
Details for the file bioio_conversion-1.0.0-py3-none-any.whl.
File metadata
- Download URL: bioio_conversion-1.0.0-py3-none-any.whl
- Upload date:
- Size: 25.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7fd22062f648d851e210ad124586246c5c799d2a749a177b0294763bdbb74223
|
|
| MD5 |
292af569cb8a11bfde30a27f5403dc9d
|
|
| BLAKE2b-256 |
dcc8275286c116bcc6c8729aae4b47904638c1bd2e22b7d6bd588df75c233106
|
Provenance
The following attestation bundles were made for bioio_conversion-1.0.0-py3-none-any.whl:
Publisher:
ci.yml on bioio-devs/bioio-conversion
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
bioio_conversion-1.0.0-py3-none-any.whl -
Subject digest:
7fd22062f648d851e210ad124586246c5c799d2a749a177b0294763bdbb74223 - Sigstore transparency entry: 992601190
- Sigstore integration time:
-
Permalink:
bioio-devs/bioio-conversion@cdc2981b718e611fa62f1ef33ae7e576281c89d8 -
Branch / Tag:
refs/tags/v1.0.0 - Owner: https://github.com/bioio-devs
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@cdc2981b718e611fa62f1ef33ae7e576281c89d8 -
Trigger Event:
push
-
Statement type: