A python package for aligning and stitching light sheet fluorescence microscopy images

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Developers
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 3.10

Project description

Rhapso

This is the official code base for Rhapso, a modular Python toolkit for the alignment and stitching of large-scale microscopy datasets.

Summary
Contact
Supported Features
Performance
Layout
Installation
How To Start
Try Rhapso on Sample Data
Ray
Run Locally w/ Ray
Run on AWS Cluster w/ Ray
Access Ray Dashboard
Parameters
Tuning Guide
Build Package
- Using the Built .whl File

Summary

Rhapso is a set of Python components used to register, align, and stitch large-scale, overlapping, tile-based, multiscale microscopy datasets. Its stateless components can run on a single machine or scale out across cloud-based clusters.

Rhapso is published on PyPI.

Rhapso was developed by the Allen Institute for Neural Dynamics.

Contact

Questions or want to contribute? Please open an issue..

Supported Features

Interest Point Detection - DOG based feature detection
Interest Point Matching - Descriptor based RANSAC to match feature points
Global Optimization - Align matched features between tile pairs globally
Affine Fusion - Fuse tiles using generated alignments up to affine with multiscale support
Validation and Visualization Tools - Validate component specific results for the best output
ZARR - Zarr data as input
TIFF - TIFF data as input
AWS - AWS S3 based input/output and Ray based EC2 instances
Scale - Tested on 200 TB of data without downsampling

Layout

Rhapso/
└── Rhapso/
    ├── data_prep/                          # Custom data loaders
    ├── detection/
    ├── evaluation/
    ├── affine_fusion/
    ├── multiscale/
    ├── image_split/
    ├── matching/
    ├── pipelines/
    │   └── ray/
    │       ├── aws/
    │       │   ├── config/                 # Cluster templates (edit for your account)
    │       │   └── alignment_pipeline.py   # AWS Ray pipeline entry point
    |       |   └── fusion_pipeline.py      # AWS Ray pipeline entry point
    │       ├── local/
    │       │   └── alignment_pipeline.py   # Local Ray pipeline entry point
    |       |   └── fusion_pipeline.py      # AWS Ray pipeline entry point
    │       ├── param/                      # Run parameter files (customize per run)
    │       ├── interest_point_detection.py # Detection pipeline script
    │       ├── interest_point_matching.py  # Matching pipeline script
    │       └── solver.py                   # Global solver pipeline script
    │       └── affine_fusion.py            # Affine fusion pipeline script
    │       └── multiscale.py               # Multiscale pipeline script
    ├── solver/
    └── visualization/                      # Validation tools

Installation

Option 1: Install from PyPI (recommended)

# create and activate a virtual environment
python -m venv .venv && source .venv/bin/activate
# or: conda create -n rhapso python=3.10 && conda activate rhapso

# install Rhapso from PyPI
pip install Rhapso

Option 2: Install from GitHub (developers)

# clone the repo
git clone https://github.com/AllenNeuralDynamics/Rhapso.git

# create and activate a virtual environment
python -m venv .venv && source .venv/bin/activate
# or: conda create -n rhapso python=3.11 && conda activate rhapso

# install deps
pip install -r requirements.txt

How to Start

Rhapso is driven by pipeline scripts.

Each pipeline script has at minimum an associated param file (e.g. in Rhapso/pipelines/ray/param/).
If you are running on a cluster, you’ll also have a Ray cluster config (e.g. in Rhapso/pipelines/ray/aws/config/).

A good way to get started:

Pick a template pipeline script
For example:
- Rhapso/pipelines/ray/local/alignment_pipeline.py (local)
- Rhapso/pipelines/ray/aws/alignment_pipeline.py (AWS/Ray cluster)
Point it to your param file
Update the with open("...param.yml") line so it reads your own parameter YAML.
- Run Locally w/ Ray
(Optional) Point it to your cluster config
If you’re using AWS/Ray, update the cluster config path.
- Run on AWS Cluster w/ Ray
Edit the params to match your dataset
Paths, downsampling, thresholds, matching/solver settings, etc.
Run the pipeline
The pipeline script will call the Rhapso components (detection, matching, solver, fusion) in the order defined in the script using the parameters you configured.

Try Rhapso on Sample Data

The quickest way to get familiar with Rhapso is to run it on a real dataset. We have a small (10GB) Z1 example hosted in a public S3 bucket, so you can access it without special permissions. It’s a good starting point to copy and adapt for your own alignment workflows.

XML (input)

s3://aind-open-data/HCR_802704_2025-08-30_02-00-00_processed_2025-10-01_21-09-24/image_tile_alignment/single_channel_xmls/channel_488.xml

Image prefix (referenced by the XML)

s3://aind-open-data/HCR_802704_2025-08-30_02-00-00_processed_2025-10-01_21-09-24/image_radial_correction/

Note: Occasionally we clean up our aind-open-data bucket. If you find this dataset does not exist, please create an issue and we will replace it.

High Level Approach to Registration, Alignment, and Fusion

This process has a lot of knobs and variations, and when used correctly, can work for a broad range of datasets.

First, figure out what type of alignment you need.

Are there translations to shift to?
If so, you’ll likely want to start with a rigid alignment.

Once you’ve run the rigid step, how does your data look?

Did the required translations shrink to an acceptable level?
If not, try again with new parameters, keeping the questions above in mind.

At this point, the translational part of your alignment should be in good shape. Now ask: are transformations needed? If so, you likely need an affine alignment next.

Your dataset should be correctly aligned at this point. If not, there are a number of reasons why, and we have listed some common recurrences and will keep this up to date.

There is a special case in some datasets where the z-stack is very large. In this case, you can use the split-dataset utility, which splits each tile into chunks. Then you can run split-affine alignment, allowing for more precise transformations without such imposing global rails.

Common Causes of Poor Alignment

Not enough quality matches (adjust sigma threshold until you do)
Data is not consistent looking (we take a global approach to params)
Large translations needed (extend search radius)
Translations that extend beyond overlapping span (increase overlap)

Performance

Interest Point Detection Performance Example (130TB Zarr dataset)

Environment	Resources	Avg runtime
Local single machine	10 CPU, 10 GB RAM	~120 min
AWS Ray cluster	560 CPU, 4.4 TB RAM	~30 min

*Actual times vary by pipeline components, dataset size, tiling, and parameter choices.*

Ray

Ray is a Python framework for parallel and distributed computing. It lets you run regular Python functions in parallel on a single machine or scale them out to a cluster (e.g., AWS) with minimal code changes. In Rhapso, we use Ray to process large scale datasets.

Convert a function into a distributed task with @ray.remote
Control scheduling with resource hints (CPUs, memory)

[!TIP] Ray schedules greedily by default and each task reserves 1 CPU, so if you fire many tasks, Ray will try to run as many as your machine advertises—often too much for a laptop. Throttle concurrency explicitly so you don’t overload your system. Use your machine's activity monitor to track this or the Ray dashboard to monitor this on your cluster:
Cap by CPUs:
@ray.remote(num_cpus=3)   # Ray will schedule each time 3 cpus are available
Cap by Memory and CPU if Tasks are RAM-Heavy (bytes):
@ray.remote(num_cpus=2, memory=4 * 1024**3)  # 4 GiB and 2 CPU per task>
No Cap on Resources:
@ray.remote             
Good Local Default:
@ray.remote(num_cpus=2)

Run Locally with Ray

1. Edit or create param file (templates in codebase)

Rhapso/Rhapso/pipelines/param/

2. Update alignment pipeline script to point to param file

with open("Rhapso/pipelines/ray/param/your_param_file.yml", "r") as file:
    config = yaml.safe_load(file)

3. Run local alignment pipeline script

python Rhapso/pipelines/ray/local/alignment_pipeline.py

Run on AWS Cluster with Ray

1. Edit/create param file (templates in codebase)

Rhapso/pipelines/ray/param/

2. Update alignment pipeline script to point to param file

with open("Rhapso/pipelines/ray/param/your_param_file.yml", "r") as file:
    config = yaml.safe_load(file)

3. Edit/create config file (templates in codebase)

Rhapso/pipelines/ray/aws/config/

5. Update alignment pipeline script to point to config file

unified_yml = "your_cluster_config_file_name.yml"

7. Run AWS alignment pipeline script

python Rhapso/pipelines/ray/aws/alignment_pipeline.py

[!TIP]

The pipeline script is set to always spin the cluster down, it is a good practice to double check in AWS.

If you experience a sticky cache on run params, you may have forgotten to spin your old cluster down.

Access Ray Dashboard

This is a great place to tune your cluster's performance.

Find public IP of head node.

Replace the ip address and PEM file location to ssh into head node.

ssh -i /You/path/to/ssh/key.pem -L port:localhost:port ubuntu@public.ip.address

Go to dashboard.
```
http://localhost:8265
```

Parameters

Detection

| Parameter          | Feature / step         | What it does                                                                                  | Typical range\*                   |
| :----------------- | :--------------------- | :-------------------------------------------------------------------------------------------- | :-------------------------------- |
| `dsxy`             | Downsampling (XY)      | Reduces XY resolution before detection; speeds up & denoises, but raises minimum feature size | 16                                |
| `dsz`              | Downsampling (Z)       | Reduces Z resolution; often lower than XY due to anisotropy                                   | 16                                |
| `min_intensity`    | Normalization          | Lower bound for intensity normalization prior to DoG                                          | 1                                 |
| `max_intensity`    | Normalization          | Upper bound for intensity normalization prior to DoG                                          | 5                                 |
| `sigma`            | DoG blur               | Gaussian blur scale (sets feature size); higher = smoother, fewer peaks                       | 1.5 - 2.5                         |
| `threshold`        | Peak detection (DoG)   | Peak threshold (initial min peak ≈ `threshold / 3`); higher = fewer, stronger peaks           | 0.0008 - .05                      |
| `median_filter`    | Pre-filter (XY)        | Median filter size to suppress speckle/isolated noise before DoG                              | 1-10                              |
| `combine_distance` | Post-merge (DoG peaks) | Merge radius (voxels) to de-duplicate nearby detections                                       | 0.5                               |
| `chunks_per_bound` | Tiling/parallelism     | Sub-partitions per tile/bound; higher improves parallelism but adds overhead                  | 12-18                             |
| `max_spots`        | Post-cap               | Maximum detections per bound to prevent domination by dense regions                           | 8,0000 - 10,000                   |

Matching

# Candidate Selection
| Parameter                      | Feature / step      | What it does                                                      | Typical range  |
| :----------------------------- | :------------------ | :---------------------------------------------------------------- | :------------- |
| `num_neighbors`                | Candidate search    | Number of nearest neighbors to consider per point                 | 3              |
| `redundancy`                   | Candidate search    | Extra neighbors added for robustness beyond `num_neighbors`       | 0 - 1          |
| `significance`                 | Ratio test          | Strictness of descriptor ratio test; larger = stricter acceptance | 3              |
| `search_radius`                | Spatial gating      | Max spatial distance for candidate matches (in downsampled units) | 100 - 300      |
| `num_required_neighbors`       | Candidate filtering | Minimum neighbors required to keep a candidate point              | 3              |

# Ransac
| Parameter                     | Feature / step       | What it does                                                      | Typical range  |
| :---------------------------- | :------------------- | :---------------------------------------------------------------- | :------------- |
| `model_min_matches`           | RANSAC               | Minimum correspondences to estimate a rigid transform             | 18 – 32        |
| `inlier_factor`               | RANSAC               | Inlier tolerance scaling; larger = looser inlier threshold        | 30 – 100       |
| `lambda_value`                | RANSAC               | Regularization strength during model fitting                      | 0.1 – 0.05     |
| `num_iterations`              | RANSAC               | Number of RANSAC trials; higher = more robust, slower             | 10,0000        |
| `regularization_weight`       | RANSAC               | Weight applied to the regularization term                         | 1.0            |

Solver

| Parameter            | Feature / step | What it does                                                       | Typical range       |
| :------------------- | :------------- | :----------------------------------------------------------------- | :------------------ |
| `relative_threshold` | Graph pruning  | Reject edges with residuals above dataset-relative cutoff          | 3.5                 |
| `absolute_threshold` | Graph pruning  | Reject edges above an absolute error bound (detection-space units) | 7.0                 |
| `min_matches`        | Graph pruning  | Minimum matches required to retain an edge between tiles           | 3                   |
| `damp`               | Optimization   | Damping for iterative solver; higher can stabilize tough cases     | 1.0                 |
| `max_iterations`     | Optimization   | Upper bound on solver iterations                                   | 10,0000             |
| `max_allowed_error`  | Optimization   | Overall error cap; `inf` disables hard stop by error               | `inf`               |
| `max_plateauwidth`   | Early stopping | Stagnation window before stopping on no improvement                | 200                 |

Fusion

| Parameter            | Feature / step | What it does                                                       | Typical range       |
| :------------------- | :------------- | :----------------------------------------------------------------- | :------------------ |
| `block_size`         | Optimization   | Cell size per task xyz                                             | 256, 256, 256       |
| `intensity_range`    | Optimization   | Range of intensity values                                          | 0, 65535            |
| `block_scale`        | Optimization   | Scaling of cell size                                               | 2, 2, 1             |
| `overlap_strategy`   | Early stopping | Strategy when more than 1 view contributes also lowest_view_wins   | avg_blend           |

Multiscale

| Parameter               | Feature / step | What it does                                                       | Typical range       |
| :---------------------- | :------------- | :----------------------------------------------------------------- | :------------------ |
| `multiscale_chunk_size` | Graph pruning  | Output cell size                                                   | 128, 128, 128       |
| `voxel_size`            | Graph pruning  | Voxel size of data in zyx                                          | 1.0, .748, .748     |
| `n_lvls`                | Optimization   | Num levels to multiscale including base level                      | 7                   |
| `scale_factor`          | Optimization   | Scaling factor per level for entropy                               | [2,2,2],...num lvls |
| `target_block_size_mb`  | Optimization   | Per worker block size                                              | 256                 |
| `base_level`            | Early stopping | Existing base res level                                            | 0                   |

Tuning Guide

Start with Detection. The quality and density of interest points strongly determine alignment outcomes.
Target Counts (exaSPIM): ~25–35k points per tile in dense regions; ~10k for sparser tiles. Going much higher usually increases runtime without meaningful accuracy gains.
Inspect Early. After detection, run the visualization script and verify that peaks form clustered shapes/lines with a good spatial spread—a good sign for robust rigid matches.
Rigid → Affine Dependency. Weak rigid matches produce poor rigid transforms, which then degrade affine matching (points don’t land close enough). If tiles fail to align:
- Check match counts for the problem tile and its neighbors.
- Adjust high-impact detection knobs—sigma, threshold, and median_filter—within sensible ranges.
- Revisit max_spots and combine_distance to balance density vs. duplicate detections.

Build Package

Using the Built `.whl` File

Build the .whl File in the root of this repo:

cd /path/to/Rhapso
pip install setuptools wheel
python setup.py sdist bdist_wheel

The .whl file will appear in the dist directory. Do not rename it to ensure compatibility (e.g., rhapso-0.1-py3-none-any.whl).

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Developers
Natural Language
- English
Operating System
- OS Independent
Programming Language
- Python :: 3.10

Release history Release notifications | RSS feed

0.2.7

May 6, 2026

0.2.5

Mar 25, 2026

0.2.4

Mar 24, 2026

0.2.3

Mar 9, 2026

0.2.2

Feb 4, 2026

0.2.1

Feb 3, 2026

0.2.0

May 6, 2026

0.1.993

Feb 3, 2026

This version

0.1.992

May 6, 2026

0.1.991

Jan 29, 2026

0.1.99

Jan 13, 2026

0.1.98

Jan 13, 2026

0.1.97

Jan 13, 2026

0.1.96

Jan 13, 2026

0.1.95

Jan 11, 2026

0.1.94

Jan 11, 2026

0.1.93

Jan 10, 2026

0.1.92

Jan 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rhapso-0.1.992.tar.gz (148.6 kB view details)

Uploaded May 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rhapso-0.1.992-py3-none-any.whl (173.9 kB view details)

Uploaded May 6, 2026 Python 3

File details

Details for the file rhapso-0.1.992.tar.gz.

File metadata

Download URL: rhapso-0.1.992.tar.gz
Upload date: May 6, 2026
Size: 148.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rhapso-0.1.992.tar.gz
Algorithm	Hash digest
SHA256	`cec11610b321596cf827355f2839aa9bd48e8ad2bdc4aa10bffda5d03393bf8f`
MD5	`982767ebe5316a04b9a61d2293faabb7`
BLAKE2b-256	`0212ac8de35554dfc9cfeefc6854bf76b4bc009988f901a64b0f5cbf1b257617`

See more details on using hashes here.

File details

Details for the file rhapso-0.1.992-py3-none-any.whl.

File metadata

Download URL: rhapso-0.1.992-py3-none-any.whl
Upload date: May 6, 2026
Size: 173.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rhapso-0.1.992-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c6c4d47b0efd1fa534067d5fc9ec7bfcf9121abc069688c7ea6b413166051b10`
MD5	`79207ef86e6fab4bef8f61cd44cfc6e0`
BLAKE2b-256	`c5baf6d13972ecc91c079f11f0dd3c2f38a344e651cf7098ea13b6f2b0e382ae`

See more details on using hashes here.

Rhapso 0.1.992

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Rhapso

Table of Contents

Summary

Contact

Supported Features

Layout

Installation

Option 1: Install from PyPI (recommended)

Option 2: Install from GitHub (developers)

How to Start

Try Rhapso on Sample Data

High Level Approach to Registration, Alignment, and Fusion

Performance

Ray

Run Locally with Ray

1. Edit or create param file (templates in codebase)

2. Update alignment pipeline script to point to param file

3. Run local alignment pipeline script

Run on AWS Cluster with Ray

1. Edit/create param file (templates in codebase)

2. Update alignment pipeline script to point to param file

3. Edit/create config file (templates in codebase)

5. Update alignment pipeline script to point to config file

7. Run AWS alignment pipeline script

Access Ray Dashboard

Parameters

Detection

Matching

Solver

Fusion

Multiscale

Tuning Guide

Build Package

Using the Built .whl File

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Using the Built `.whl` File