Skip to main content

A standardized benchmark of ultrasound image analysis tasks.

Project description

UltraBench

arXiv

UltraBench is a standardized benchmark of 14 different convex ultrasound image classification and semantic segmentation tasks (7 classification and 7 segmentation tasks) drawn from the 10 different publicly available datasets covering 11 regions of the body. This repository contains the processing scripts to transform raw datasets from their original sources into a standardized format for training and evaluating machine learning models.

This benchmark was published alongside the article Revisiting Data Augmentation for Ultrasound Images. If you use this benchmark in your research please cite this article and the articles for each of the included datasets. Instructions for how to do so are provided at the end.

Regions of the body covered by UltraBench

Motivation

A common limitation of studies on ultrasound image analysis using machine learning is a lack of evaluations across multiple domains. However, this is made difficult by a lack of ultrasound image analysis tasks in existing medical image analysis benchmarks, such as MedMNIST, MedSegBench and the Medical Segmentation Decathlon. To address this issue, we created UltraBench. Our aim is to make it easier for researchers by standardizing the preprocessing of many publicly available ultrasound datasets and providing tools (e.g., dataset implementations for common deep learning frameworks) that (a) reduce the effort required to use them and (b) make it easier for researchers to compare results between papers.

Installation

Install UltraBench using:

pip install ultrabench

Getting Started

UltraBench is packaged as a commandline application. After installing, run ultrabench --help to get started.

For more information, checkout the Documentation on the wiki!

Current Datasets & Tasks

The following table contains the list of supported datasets and tasks. Click on the links to checkout the appropriate sections of the documentation or on the article/repository links to visit the original publications. We do not have the rights to republish the datasets, but links to the original articles and datasets are included here and in the Documentation.

Dataset Region Classification Tasks Segmentation Tasks
Annotated Ultrasound Liver (article, data) Liver Liver mass classification Liver segmentation, Liver mass segmentation
Butterfly (data) Multi-region Region classification
CAMUS (article, data) Heart Image quality classification Cardiac region segmentation
Dataset of B-mode fatty liver ultrasound images (article, data) Liver NFLD classification
Gallbladder Cancer Ultrasound (GBCU) (article, data) Gallbladder Gallbladder tumor classification
Multi-Modality Ovarian Tumor Ultrasound (MMOTU) (article, data) Ovaries Ovarian tumor classifcation Ovarian tumor segmentation
Open Kidney Ultrasound (article, data) Kidney Kidney capsule segmentation
Point-of-care Ultrasound (POCUS) (article, data) Lung COVID-19 classification
PSFHS (article, data) Fetus Fetal head and pubic symphysis segmentation
Stanford Thyroid (data) Thyroid Thyroid nodule segmentation

Contributing

We would love for this benchmark to grow and flourish into a resource that anyone with an interest in machine learning for ultrasound analysis can pick up and use quickly and easily. Any help addressing bugs, contributing new datasets or tasks, or any other improvements are welcome and appreiciated! I only ask that you respect the community guidelines laid out in the CODE_OF_CONDUCT.md. For more information on how to contribute, checkout out the Documentation.

To ensure that your code meets the style guidelines etc., make sure you install the optional development dependencies:

pip install '.[dev]'
nbstripout --install
pre-commit install

Questions

If you have any questions, please open an issue or contact us via email.

How to Cite UltraBench

If you use UltraBench in your research, please cite our article Revisiting Data Augmentation for Ultrasound Images and the articles for each of the datasets. This ensures that the work of the authors of the original datasets is properly acknowledged, helps more people find the benchmark and encourages us to continue maintaining and improving it!

@misc{tupper2025,
      title={Revisiting Data Augmentation for Ultrasound Images}, 
      author={Adam Tupper and Christian Gagné},
      year={2025},
      eprint={2501.13193},
      archivePrefix={arXiv},
      primaryClass={eess.IV},
      url={https://arxiv.org/abs/2501.13193}, 
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ultrabench-1.0.0.tar.gz (5.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ultrabench-1.0.0-py3-none-any.whl (33.0 kB view details)

Uploaded Python 3

File details

Details for the file ultrabench-1.0.0.tar.gz.

File metadata

  • Download URL: ultrabench-1.0.0.tar.gz
  • Upload date:
  • Size: 5.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.2

File hashes

Hashes for ultrabench-1.0.0.tar.gz
Algorithm Hash digest
SHA256 9ea455fa11b4f2fd8dd66b7800e5a97743143095f9d94f4d4d5e569020a77f13
MD5 3b36da487c5fb1ec38dd0281b878450c
BLAKE2b-256 9d315469958094f974659cd13eb8af859466985a72f046723b6db57e3afada29

See more details on using hashes here.

File details

Details for the file ultrabench-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: ultrabench-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 33.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.2

File hashes

Hashes for ultrabench-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fda957fb412bfdfb6dcd7b3a085930f18aecf141b1b4b44f812ab524606fb978
MD5 ddf944402212a3861fac7a3f3c70800d
BLAKE2b-256 d0421b01c1a5ebb2c9d376a89977b4a6afe5eab4770ade01cf0dee9255d16889

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page