A Label-Free and Data-Free Synthesis Engine and Training Framework for Vascular Segmentation of sOCT Data with PyTorch.
Project description
oct_vesselseg
Characterized by minimal priors and high variance sampling, this project builds on the emerging field of synthesis-based training by proposing an entirely data-free synthesis engine for training a Unet in the task of vascular labeling in sOCT data (mus modality). This project employs domain-randomized synthesis to create structured vessel and neural parenchyma labels, textures, and artifacts. It creates volumetric imaging data similar to, but not emulative of 3D sOCT data, with corresponding (perfect) ground truth labels for vasculature. The package contains a training module employing an on-the-fly image synthesis procedure to create a virtually infinite number of unique volumetric training data.
Table of Contents
Introduction
This project focuses on generating synthetic datasets for training a 3D U-Net in the task of vasculature segmentation in OCT data. Using a cubic spline synthesis pipeline first established in SynthSpline, and many data augmentation techniques from Cornucopia, this project employs domain-randomized synthesis to create structured labels, textures, and artifacts, enhancing the training of neural networks for vascular segmentation.
Check out our papers!
📜 July 2024: Journal Article Preprint
📜 April 2024: Conference/Short paper from Medical Imaging with Deep Learning Conference
Getting Started
Hard requirements include Cppyy~=2.3
and Python~=3.9
Installation
It is suggested that you create and activate a new mamba environment with python 3.9. You can learn how to install mamba by following the instructions provided in the Miniforge repo.
mamba create -n oct_vesselseg python=3.9
mamba activate oct_vesselseg
In order to synthesize vascular labels from splines, we will need to install the code from the synthspline repo.
pip install git+https://github.com/balbasty/synthspline.git#f78ba23
We need to identify and set our cuda version to make sure we install the right prebuilt wheel for cupy. You can find your cuda version by running the command nvcc --version
or nvidia-smi
.
export CUDA_VERSION=<cuda-version> # OR nvidia-smi
Finally, we can install oct_vesselseg from pypi.
pip install oct_vesselseg
Configuration
Identify the directory you want all oct_vesselseg related files to go into (you might need to make a new directory), and run the following command. This will add a line to your ~/.bashrc file to set a global environment variable OCT_VESSELSEG_BASE_DIR
. I recommend this to be an empty directory (or if you specify a non-existent directory, one will be made for you). PLEASE ENSURE IT IS A FULL (ABSOLUTE) PATH
oct_vesselseg configure
Usage
Vessel Synthesis
The vesselsynth
command is used to generate synthetic vascular labels that may be used, in conjunction with artifact and noise models, to train a 3D U-Net. These labels will be saved in the synthetic_data
directory within OCT_VESSELSEG_BASE_DIR
. This engine uses cubic splines and randomized domain-specific parameters to create highly variable yet structured vascular geometries.
Basic Usage
To generate a synthetic vascular dataset with default parameters, use:
oct_vesselseg vesselsynth
There are many different parameters you can specify with this command using these flags:
Customizing the Vessel Label Synthesis Engine
You can customize the vessel label synthesis engine with several flags, each corresponding to a specific aspect of the geometry of the volume generated, a single vascular tree, or individual branches. Below is a summary of some of the most important flags
-
--shape
: This is the shape of the volume (input data) to be synthesized. This does not need to be a perfect cube. This will also be the shape of the UNet you will train. -
--voxel-size
: This is the spatial resolution of the data to be synthesized measured in units of $\frac{mm^3}{voxel}$. The default value is $0.02 \frac{mm^3}{voxel}$. -
--tree-levels
: Sampler bounds for the number of hierarchical levels in the vascular tree. The level of a branch in a vascular tree refers to the distance (in terms of number of branches) from the root node to the node from which the branch in question originates. This quantity is sampled from a discrete uniform distribution for each tree that is created.- Root Branch (level 0): The root branch is at level 0.
- First Branch (level 1): The children of the root branch are at level 1.
- Second level (level 2): The grandchildren of the root are at level 2.
-
--tree-density
: Sampler bounds for the number of trees (or root points) per volume in units of $\frac{trees}{mm^3}$. This quantity is sampled from a uniform distribution for each volume that is created. -
--tree-root-radius
: Sampler bounds for the radius of the first branch (root, Level 0). This quantity is sampled from a uniform distribution for each tree that is created. -
--branch-tortuosity
: Sampler bounds for the tortuosity of a given branch. This quantity is sampled from a uniform distribution for each branch that is created. Tortuosity is defined as such:- $tortuosity = \frac{cord}{length}$
-
--branch-radius-ratio
: Sampler bounds for the ratio of the radius of the child branch compared to the radius of the parent branch. This is sampled from a uniform distribution for each child branch that is created. -
--branch-radius-change
: Sampler bounds for a multiplicative variation in radius along the legth of a vessel. This is sampled from a uniform distribution for each branch. -
--branch-children
: Sampler bounds for the number of children per parent. This is sampled from a discrete uniform distribution for each parent branch.
Example
To generate a dataset with data of shape (256, 256, 256) voxels, a resolution of 0.65 $\frac{mm^3}{voxel}$, and highly tortuous vessels:
oct_vesselseg vesselsynth --shape 256,256,256 --voxel_size 0.65 --branch-tortuosity 4.0,5.0
OCT Image Synthesis
The imagesynth
command generates synthetic OCT images which are made on-the-fly during the training process. By saving them to OCT_VESSELSEG_BASE_DIR/synthetic_data/exp000*/sample_vols
with this command, this allows us to visualize our desired noise and artifact parameters before using them to train the network.
Basic Usage
To generate OCT images with default settings, run:
oct_vesselseg imagesynth
Customizing the Image Synthesis Engine
You can customize the synthesis process using the following parameter flags:
-
--data-experiment-n
: Specifies the vesselsynth data experiment number for loading volumetric dat. -
--n-samples
: The number of synthetic OCT images to generate. -
--parenchyma-classes
: The upper bound for the number of classes of parenchyma (neural tissue) in the image synthesis. -
--parenchyma-shape
: The upper bound for the number of control points defining the shape of each parenchyma class. -
--vessel_intensity
: The bounds for the attenuation of the vessel intensities when blending them onto the parenchyma. Lower magnitude results in vessels that are darker than their surrounding tissue/background. -
--vessel_texture
: Optionally apply intra-vascular textures and artifacts. -
--image-gamma
: The bounds for non-linear contrast adjustment. Higher values increase contrast, while lower values decrease it. -
--image-z-decay
: The upper bound for z decay, roughly approximating the banding artifact due to serial sectioning. -
--image-speckle
: The bounds for speckle noise parameters. -
--image-spheres
: Whether to add spherical artifacts to the image. -
--image-banding
: Optionally to apply slabwise banding (z-decay) artifacts to the image. -
--image-dc-offset
: Optionally add a small DC offset to the parenchyma tensor.
Example Usage
Here's how you might use the imagesynth
command to turn 20 vascular labels into volumes with 7 classes (distinct intensities) of neural parenchyma, vessels that are much darker than the background tissue, and with low overall contrast:
oct_vesselseg imagesynth --n_samples 20 --parenchyma_classes 7 --vessel_intensity 0.1,0.2 --image_gamma 0.5,0.75
Training
Train the model on the vessel labels and on-the-fly OCT image synthesis. The models will go into a subdirectory of OCT_VESSELSEG_BASE_DIR
called models
.
oct_vesselseg train
Inference
Run inference on a compatable NIfTI file. Th
oct_vesselseg test --in-path <path-to-NIfTI>
Results
Here we provide some examples of synthetic vasculature generated by this method:
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgements
- SynthSeg, SynthSpline, and Cornucopia, for the inspiration and methodological foundation.
- Much of the computation resources required for this research was performed on computational hardware generously provided by the Massachusetts Life Sciences Center.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file oct_vesselseg-0.0.5.tar.gz
.
File metadata
- Download URL: oct_vesselseg-0.0.5.tar.gz
- Upload date:
- Size: 42.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.10.0 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/1.0.0 urllib3/1.26.19 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e859597dbd18c92b8510dc0e8627ce9769e798ca191e28c27fb1a3a32849fad8 |
|
MD5 | aedb8b5a06ff53b2094eb895dcbd8668 |
|
BLAKE2b-256 | bdae95fad1edfacba65b52af3e02e7c4be51a8e81937bc12602dbc9993276dd9 |
File details
Details for the file oct_vesselseg-0.0.5-py3-none-any.whl
.
File metadata
- Download URL: oct_vesselseg-0.0.5-py3-none-any.whl
- Upload date:
- Size: 42.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.8.0 pkginfo/1.10.0 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/1.0.0 urllib3/1.26.19 tqdm/4.64.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.5 CPython/3.6.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 73f4bb53acab67594fa92a4faf2353286ee7edbc8da376d1cbb2e20ad61d594f |
|
MD5 | 6e6922aebc3d68d30156cef6e259a54b |
|
BLAKE2b-256 | 4bf60a8fd4d8524dfa1b43a5d899e5fa0e2870edccc1a6c512485f7f3cd66ee5 |