U-Net deep-learning tool for echocardiographic ROI segmentation and de-identification

These details have not been verified by PyPI

Project links

Project description

EchoROI — U-Net ROI Segmentation for Echocardiography

A lightweight U-Net model that segments the region of interest (ROI) in echocardiography frames — removing scanner chrome, ECG traces, and text overlays so that downstream models receive only clinically relevant pixels.

Trained on 1,355 annotated echocardiographic frames spanning four-chamber, parasternal, and subcostal views across eight datasets, achieving a Dice coefficient of 0.9880 on the held-out validation split.

Paper: see paper/paper.md for the full JOSS-style manuscript.

Key Features

Feature	Detail
Architecture	Standard U-Net (31 M params, 4 encoder/decoder levels)
Input	256 × 256 × 1 grayscale (aspect-ratio preserving, zero-padded)
Output	256 × 256 × 1 binary mask (sigmoid, threshold 0.5)
Formats	Keras (`.keras`, 373 MB) and ONNX (`.onnx`, 124 MB)
Performance	Mean Dice 0.9880 on held-out validation set
ONNX Runtime	Cross-platform inference — no TensorFlow dependency

Model Architecture

EchoROI uses a standard U-Net adapted for scan-sector segmentation with 256 × 256 grayscale input, same-padding convolutions to preserve sector geometry, and dropout regularisation to reduce overfitting on a small heterogeneous training set. Additional implementation details, intended use, and known limitations are summarised in MODEL_CARD.md.

Reference U-Net architecture used by EchoROI. The model follows a standard encoder-decoder U-Net layout with same-padding convolutions, dropout regularisation, and a single-channel sigmoid output for binary scan-sector segmentation.

Loss Function

The model is trained with a composite BCE + Dice + Total Variation loss:

$$\mathcal{L} = w_\text{bce},\text{BCE} + w_\text{dice},\text{DiceLoss} + \alpha_\text{tv},\text{TV}(\hat{y})$$

Term	Purpose	Weight
BCE	Stable per-pixel classification gradient	1.0
Dice	Region-overlap optimisation; robust to class imbalance	1.0
Total Variation	Penalises high-frequency mask edges → smooth sector boundaries	1 × 10⁻⁴

The TV regulariser is the key ingredient for producing the smooth, fan-shaped sector boundaries typical of ultrasound probes. BCE alone can produce noisy boundaries; adding a region-based loss (Dice/Jaccard) improves overlap but does not explicitly enforce spatial smoothness. The TV term fills this gap by penalising large pixel-to-pixel differences in the predicted mask, yielding clean, continuous boundaries even on a small heterogeneous training set. Implementation: echoroi/model.py.

Training Data Summary

The reference model was trained on 1,355 manually annotated echocardiographic frame-mask pairs drawn from public and institutional sources. Masks were created in LabelMe by outlining the visible scan sector while excluding padding, borders, and display graphics. Only one representative frame per cine loop was used for training because sector geometry is typically static within a clip.

Dataset	Frames	Access
MIMIC-IV-ECHO	403	PhysioNet
EchoNet-Dynamic	145	Stanford
EchoNet-Paediatric	263	Stanford
CACTUS (A4C subset)	38	Open access
EchoCP	60	Kaggle
Private dataset (consented)	50	Institutional
CardiacUDC	247	Kaggle
HMC-QU	149	By request
Total	1,355

The full citation list for these datasets is given in paper/paper.md.

Quick Start

# Clone
git clone https://github.com/Kamlin-MD/UNET-Echocardiography-ROI-segmentation.git
cd UNET-Echocardiography-ROI-segmentation

# Install
pip install -e ".[dev]"

# Run inference on a single image
python -c "
from echoroi import UNetPredictor
predictor = UNetPredictor('models/echoroi_unified.keras')
mask = predictor.predict_single_image('path/to/frame.png')  # (256,256,1) array
"

ONNX Inference (no TensorFlow)

import onnxruntime as ort
import numpy as np

sess = ort.InferenceSession("models/echoroi_unified.onnx")
# image: (1, 256, 256, 1) float32, normalised [0, 1]
mask = sess.run(None, {"input": image})[0]

DICOM Preprocessing Pipeline

Notebook 04_dataset_preprocessing.ipynb provides a complete, configurable pipeline for batch-processing echocardiography DICOM datasets using the ONNX model — no TensorFlow required.

What it does

DICOM files (recursive discovery)
  → Extract frames
  → Optional adaptive stride (e.g. normalise all clips to 32 frames)
  → Resize to 256×256 (aspect-ratio preserving, zero-padded)
  → Select representative frame (highest Shannon entropy)
  → ONNX ROI inference → broadcast mask to all frames
  → LV-focused square crop → resize to 112×112
  → Save as compressed NPZ

Key configuration options

CONFIG = {
    'target_frames': 32,    # None = keep original frame count; int = adaptive stride
    'max_files':     None,  # None = process all; int = limit for test runs
    'final_size':    (112, 112),
    'use_gpu':       False, # set True for CUDA acceleration
}

Features

Single-file demo with step-by-step visualisation (input → ROI overlay → cropped output)
Batch processor with progress tracking, error handling, and summary statistics
NPZ inspector to verify saved outputs
Representative frame selection via Shannon entropy (avoids blank/transition frames)
Hardware acceleration — CUDA, CoreML, or CPU via ONNX Runtime providers
Optional dependency installer cell for quick environment setup

Repository Structure

EchoROI/
├── data/
│   ├── images/          # 1,355 training images (PNG)
│   └── masks/           # 1,355 binary masks (PNG, from LabelMe)
├── models/
│   ├── echoroi_unified.keras   # Trained Keras model (373 MB)
│   └── echoroi_unified.onnx    # ONNX export (124 MB)
├── notebooks/
│   ├── 01_training_and_evaluation.ipynb # Training & evaluation
│   ├── 02_onnx_conversion.ipynb         # ONNX export & validation
│   ├── 03_inference_demo.ipynb          # Inference & visualisation
│   └── 04_dataset_preprocessing.ipynb   # DICOM preprocessing pipeline
├── echoroi/              # Python package
│   ├── model.py          # U-Net architecture
│   ├── preprocessing.py  # Image preprocessing
│   └── inference.py      # Prediction utilities
├── paper/
│   ├── paper.md          # JOSS manuscript
│   └── paper.bib         # References
├── tests/                # 23 unit tests
└── scripts/              # CLI utilities

Notebooks

#	Notebook	Description
01	Training & Evaluation	End-to-end training, augmentation, evaluation
02	ONNX Conversion	Export, validation, Keras-vs-ONNX comparison
03	Inference Demo	Inference, visualisation, ROI extraction
04	Dataset Preprocessing	DICOM → NPZ pipeline using ONNX model

Testing

pytest tests/ -v

All 23 tests cover model architecture, preprocessing, inference, and I/O.

Note for macOS (Apple Silicon) users

The tensorflow-metal GPU plugin can deadlock inside Jupyter kernels on some Apple Silicon configurations. The inference notebook (03_inference_demo.ipynb) disables GPU devices automatically so that all operations run on the CPU. This has no practical impact — inference on 256 × 256 images takes less than 1 second per frame on CPU.

If you are not using Jupyter (e.g. running via the CLI or a Python script) the Metal GPU works normally.

How to Cite

If you use EchoROI in your research, please cite:

@article{ekambaram2026echoroi,
  title   = {{EchoROI}: A {U-Net}-based Python Tool for Echocardiographic
             {ROI} Segmentation and De-identification},
  author  = {Ekambaram, Kamlin and Arnab, Anurag and Herbst, Philip and
             Theart, Rensu},
  journal = {Journal of Open Source Software},
  year    = {2026}
}

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Mar 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

echoroi-0.1.0.tar.gz (33.9 kB view details)

Uploaded Mar 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

echoroi-0.1.0-py3-none-any.whl (23.0 kB view details)

Uploaded Mar 21, 2026 Python 3

File details

Details for the file echoroi-0.1.0.tar.gz.

File metadata

Download URL: echoroi-0.1.0.tar.gz
Upload date: Mar 21, 2026
Size: 33.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for echoroi-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`939cc708f0e82de40858cda92821a6ec360d345a52b210e7f85b9ddc71924328`
MD5	`f07f74effba2daa94cb4e6323e084fee`
BLAKE2b-256	`7167ea7210eb44c7f1c2b83f5c58d29dfbcbd3af9b619be83b4e17375df362ce`

See more details on using hashes here.

File details

Details for the file echoroi-0.1.0-py3-none-any.whl.

File metadata

Download URL: echoroi-0.1.0-py3-none-any.whl
Upload date: Mar 21, 2026
Size: 23.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for echoroi-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a5f2d76c9ef0c9f4be585b11bb710bd32f8ba040c9fbdee6f30c7551d0db23e2`
MD5	`9b0de60dd400439214a1cd20970a9319`
BLAKE2b-256	`55da53d170243c22099fc5cf6a600dba2fae9962b56127a464ad06dd27b0f415`

See more details on using hashes here.

echoroi 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

EchoROI — U-Net ROI Segmentation for Echocardiography

Key Features

Model Architecture

Loss Function

Training Data Summary

Quick Start

ONNX Inference (no TensorFlow)

DICOM Preprocessing Pipeline

What it does

Key configuration options

Features

Repository Structure

Notebooks

Testing

Note for macOS (Apple Silicon) users

How to Cite

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes