Skip to main content

Napari plugin for the Pentachrome histology pipeline: VSI extraction, nnUNet inference, statistics.

Project description

pentachrome-plugin

Napari plugin for the Pentachrome histology pipeline. The public plugin exposes a single clinician-facing widget, Guided Analysis, that walks through the whole workflow in one pane:

  1. Extract — pull tissue-region TIFFs out of Olympus .vsi files.
  2. Detect — run the trained nnUNet Epithelium / MultiStructure models on those images and load colorized masks back into the viewer.
  3. Measure — per-region statistics (thickness, composition, cell densities), with CSV export.

Each phase also exists as a separate advanced widget (VsiExtractorWidget, NnUnetInferenceWidget, AnalysisWidget) used during development. These are not registered in the public menu — see From source (development).

Source and issues: https://github.com/dtsilis7/Pentrachrome-Pipeline

Requirements: Windows, Python 3.10, napari >= 0.4.18, a JDK 17, the Java/bioformats stack, and the nnUNet model weights. The Java stack and the weights are installed separately (see below) — they can't come from a plain pip install.

Installation (Windows, PowerShell)

A working install has three parts, in this order:

  1. A conda environment (Python 3.10) with NumPy<2, a JDK, and the Java/bioformats stack
  2. The plugin itself
  3. The nnUNet model weights (downloaded separately)

⚠️ pip install pentachrome-plugin on its own is not enough. It gives you the napari UI, but VSI extraction fails without the Java/bioformats stack and inference fails without the model weights. Installing through napari's plugin manager only does the pip part — you still need steps 1 and 3 in the same env.

Step 1 — environment + Java/bioformats stack

openjdk and numpy<2 come from conda-forge, but python-javabridge and python-bioformats are not packaged on conda-forge — they come from pip. Keeping Python at 3.10 lets pip grab the prebuilt python-javabridge wheel instead of compiling it.

conda create -n pentachrome python=3.10 -y
conda activate pentachrome
conda install -c conda-forge openjdk=17 "numpy<2" -y    # JDK + NumPy 1.x FIRST
pip install python-javabridge python-bioformats          # Java wrappers (pip, not conda)

Order matters: numpy<2 must be in place before javabridge installs, because the javabridge C extension is built against the NumPy 1.x ABI. If pip can't find a prebuilt wheel for your Python and falls back to compiling, add --no-build-isolation to the pip line (so the build sees the pinned NumPy) and make sure the MS C++ Build Tools and the JDK are present.

Step 2 — the plugin

pip install pentachrome-plugin
pip install nnunetv2          # required for the Detect step

You can also install the plugin through napari's Plugins -> Install/Uninstall Plugins dialog (search "pentachrome"), but that only covers this step — you still need Step 1 and Step 3 in the same environment.

Step 3 — model weights

Download the nnUNet weights and point the widget at them — see Model weights below.

Verify

python -m napari

In napari, open Plugins -> Guided Analysis — the widget should load and show its Extract / Analyze / Statistics steps.

From source (development)

For working on the plugin itself, do Step 1 above, then install editable from a checkout instead of from PyPI (cd into the plugin directory first, or pass the absolute path):

conda activate pentachrome
cd "...\pentachrome_plugin"
pip install -e .

The three phase widgets are de-registered from the public menu. To open them standalone during development, run the repo's dev_widgets.py — it docks the Extractor / Inference / Statistics widgets as tabs:

python dev_widgets.py

Launch

conda activate pentachrome   # or whichever env you installed into
python -m napari

In napari: Plugins -> Guided Analysis.

The per-phase sections below (nnUNet Inference, Mask Statistics, etc.) describe the underlying widgets, which the Guided Analysis pane drives end-to-end. Their Plugins -> ... menu references apply only to the dev widgets opened via dev_widgets.py; end users reach the same functionality through Guided Analysis.

Model weights

The nnUNet weights (~900 MB) aren't bundled in the PyPI package. Download [nnunet_results.zip] from the releases page, unzip it, and point the inference widget's nnUNet results field at the extracted folder (the one containing Dataset001_Epithelium and Dataset002_MultiStructure).

nnUNet Inference (Phase 2)

Requires nnunetv2 installed in the same environment (the nnUNetv2_predict CLI must be on PATH) — this is covered by Step 2 of Installation above.

Workflow:

  1. Load TIFFs into napari (e.g. via Phase 1's auto-load checkbox, or drag-and-drop).
  2. Open Plugins -> nnUNet Inference.
  3. Select one or more image layers in the list.
  4. Tick Epithelium, MultiStructure, or both.
  5. Set Output folder (where raw + colorized masks go) and nnUNet results (folder containing Dataset001_Epithelium and Dataset002_MultiStructure). The results path auto-fills if nnUNet_Training/nnUNet_results/results is found.
  6. Pick Device (cpu or cuda) and click Analyze.

Speed vs quality (important on CPU)

nnUNet inference on a laptop CPU is slow because every image goes through folds × mirror augmentations × sliding-window patches forward passes. With defaults that can be 20+ passes per image. The widget exposes three knobs in the Speed / quality group:

Knob Default What it does
Epithelium folds Fold 0 only Use 1 of the 5 trained folds for Dataset001. All 5 ensembled is best quality but ~5x slower. Dataset002 only has fold 0 trained, so it's always 1 fold.
Disable test-time mirroring on Passes --disable_tta. Skips the 4 mirror augmentations the model normally averages over. ~4x faster, small accuracy hit.
Sliding-window step 0.5 Passes -step_size. Larger = fewer overlapping patches = faster but rougher tile borders. Try 0.7 for a middle ground.

With all three defaults on a CPU laptop, one ROI tile should take a few minutes instead of 30+. Switch to All 5 folds + TTA on once you've moved to a GPU box.

Continuing from the extractor

The two widgets are linked through two small bridges, so you can run Extract -> Analyze in a single napari session without re-picking files:

  • When the extractor auto-loads a TIFF as a viewer layer, it stashes the on-disk path on layer.metadata['source_tiff']. The inference widget reads that during staging and copies the original file into _staging_input/ rather than re-saving the in-memory array, important for 15k x 15k tiles.
  • When an extraction completes, the inference widget's "Use last extractor output" button pre-fills the output folder to <extractor_output_root>/_inference, so masks land next to the per-VSI subfolders the extractor created.

Both bridges are in-process only (see _session.py); they reset when napari closes.

Outputs land in:

<output_folder>/
  _staging_input/            # nnUNet-named (_0000.tif) copies of the selected layers
  epithelium_raw/            # binary masks from Dataset001
  epithelium_colored/        # RGB colorized masks (red epithelium)
  multistructure_raw/        # 6-class masks from Dataset002
  multistructure_colored/    # RGB colorized masks (Elastin/Collagen/Nuclei/Mucins/Membrane/Goblets)

Colorized masks are added to the viewer as RGB image layers when the run finishes.

nnUNet inference architecture

Same subprocess pattern as Phase 1. The widget never imports torch or nnUNetv2 directly; it spawns _inference_worker.py which:

  • sets nnUNet_results to the configured results dir,
  • calls nnUNetv2_predict once per enabled model (folds 0-4 for Epithelium, fold 0 for MultiStructure, matching run_inference.py),
  • colorizes the resulting integer masks with the palettes from colorize_masks.py / compare_grid.py,
  • streams JSON-line events on stdout for the widget's progress bar and log.

How it works

  • The widget itself never touches the JVM. When you click Extract ROIs, it spawns _vsi_worker.py as a separate Python process.
  • That worker process starts the bioformats JVM, loops over the VSI files using TileMaskStitcher (reused from VSI_Handler/tile_mask_stitcher.py), writes numbered TIFFs into <output_root>/<vsi_basename>/, and emits JSON-line progress events on stdout.
  • The widget streams those events on a background thread and updates the progress bar / log without blocking the UI.
  • When the worker exits, the JVM dies with it. The next extraction batch starts a fresh JVM in a fresh process - this avoids the "JVM cannot be restarted" pitfall during a long napari session.

Defaults

The parameter defaults mirror Processing_VSI_Files.py:

Parameter Default
Series 6
Tile width / height 15000
Threshold 50
Min ROI area 150000
Merge margin 1000
Extra crop margin 100

Layout

pentachrome_plugin/
  pyproject.toml
  README.md
  src/pentachrome_plugin/
    __init__.py
    napari.yaml             # napari manifest
    _session.py             # in-process cross-widget state (extractor -> inference -> analysis)
    _widget.py              # VsiExtractorWidget (Phase 1)
    _vsi_worker.py          # VSI subprocess entrypoint
    _inference_widget.py    # NnUnetInferenceWidget (Phase 2)
    _inference_worker.py    # nnUNet subprocess entrypoint
    _analysis_widget.py     # AnalysisWidget (Phase 3, in-process)

Phase 3 (Mask Statistics) lives alongside these and registers through napari.yaml.

Mask Statistics (Phase 3)

Pure in-process; no subprocess needed (no JVM, no torch). Reuses EpithelialAnalysis/Analyzers/ (Descriptors.py, Thickness.py), so the same metrics that fed the original region_summary.csv show up in the widget.

Workflow:

  1. Run Phase 2 first so epithelium_raw/ and multistructure_raw/ exist.
  2. Open Plugins -> Mask Statistics.
  3. Select one or more image layers in the list (their names must match the mask filenames in epithelium_raw/ / multistructure_raw/; if the inference widget staged them, that's already true).
  4. Click Use last inference output (or browse).
  5. Tweak Pixel size, Region dilation, Min epithelium area if needed (defaults match Main.py).
  6. Click Analyze.

For each detected epithelial region the widget reports:

Column What it is
Area (mm^2) Region area after the 50 um dilation
Thickness mean/std (um) Medial-axis thickness of (membrane within eroded region) U goblets U nuclei
Elastin / Collagen / Other % Fraction of stained structure pixels, same definition as compute_structure_percentages
Mucin % Mucin pixels as a fraction of the epithelium area (not of total structure pixels)
Nuclei / mm^2 and Goblets / mm^2 Density per mm^2 of epithelium, goblet hyperplasia is a classic COPD readout
Nuclei (n), Goblets (n) Raw counts inside the region

A bold (all regions) row appended per image gives area-weighted means of the percentages / thickness and totals for the counts. Export CSV... saves the whole table (per-region rows + aggregate rows).

The elastin organization score (ElastinAnalyzer.determine_organized_region) from Main.py is intentionally not yet exposed, it's much heavier (skan + shapely + ROI polygons) and will land as a separate toggle.

Class isolation

A "Class isolation" group at the top of the widget lets you view a single class (or a combination) without rerunning anything:

  1. Pick a source layer (the original TIFF, not a colorized mask).
  2. Tick one or more of Elastin, Collagen, Nuclei, Mucins, Cell Membrane, Goblets, Epithelium.
  3. Click one of:
    • Show as mask — adds a new layer that's white everywhere except the ticked classes, colored with the same palette as the inference widget.
    • Show on original — adds a copy of the original image with all pixels outside the ticked classes turned white. Useful for sanity- checking the segmentation against the stain.
  4. Clear isolated layers removes everything this panel added in one go.

Masks are read on demand from the inference output folder; the original layer's pixels are taken from the viewer.

License

This project is licensed under the MIT License, see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pentachrome_plugin-0.4.4.tar.gz (75.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pentachrome_plugin-0.4.4-py3-none-any.whl (77.4 kB view details)

Uploaded Python 3

File details

Details for the file pentachrome_plugin-0.4.4.tar.gz.

File metadata

  • Download URL: pentachrome_plugin-0.4.4.tar.gz
  • Upload date:
  • Size: 75.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.20

File hashes

Hashes for pentachrome_plugin-0.4.4.tar.gz
Algorithm Hash digest
SHA256 eff458461fd3545d08ffd30a587b788661b5c651a7a3b46ed310dbe6c02af2a2
MD5 567a012aaca2b51f12b6781f5e4a2de0
BLAKE2b-256 7b773ea418c2df3faed0f65e1ddcda0fb6218d814e15b91c1f8b961c7ac68c70

See more details on using hashes here.

File details

Details for the file pentachrome_plugin-0.4.4-py3-none-any.whl.

File metadata

File hashes

Hashes for pentachrome_plugin-0.4.4-py3-none-any.whl
Algorithm Hash digest
SHA256 fa55ccef2121be5be2253665372c7cabf973fae0326af39237cf555543620815
MD5 4bd0f9656fcec4a66f6caeb69bc0e8a6
BLAKE2b-256 eabeb982584bb544db33a51cc42050ac840ccd0d17eab91ea12097d1c648c713

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page