Official package to compute metrics for the BraTS segmentation challenges.

These details have not been verified by PyPI

Project links

Repository

Project description

BraTS Evaluation

The Brain TumorS aka Brain Tumor Segmentation (BraTS) challenge is a globally recognized community benchmark for the evaluation of automated segmentation algorithms in neuro-oncology. Over the years, BraTS has expanded to encompass a variety of specialized tasks, including:

Glioma Segmentation: The flagship task, focusing on the delineation of distinct sub-regions (e.g., enhancing tumor, tumor core, and whole tumor) in adult gliomas.
Pediatric Tumor Segmentation: Targeting brain tumors in pediatric patients, addressing the distinct anatomical and pathological characteristics seen in this population.
Brain Metastasis Segmentation: Focusing on the detection and segmentation of metastatic brain lesions, which are often small, numerous, and anatomically diverse.
Meningioma Segmentation: Evaluating the accurate boundary delineation of meningiomas, the most common primary central nervous system tumor.

Robust, and rigorous evaluation of segmentation algorithms across these diverse tasks is essential to accurately gauge clinical applicability and algorithmic performance.

Panoptica: Instance-Wise Evaluation

Panoptica is a comprehensive Python library designed to bridge the gap between global semantic evaluation and clinical necessity by enabling rigorous instance-wise and lesion-wise quantification. While traditional metrics like the whole-volume Dice score often mask critical individual detection errors, Panoptica isolates and evaluates discrete structures such as tumor subregions through a robust pipeline of instance approximation, matching, and evaluation.

It computes a comprehensive suite of vital detection and segmentation metrics like:

Detection metics True Positive, False Positive, and False Negative detection rates,
Instance-specific overlap metrics including Intersection over Union (IoU), instance-level Dice scores, and Average Precision (AP).
Instance-specific distance metrics such as Normalized surface distance (NSD), and Hausdorff distance (HD95).
This makes Panoptica a reliable tool for benchmarking deep learning models in medical image segmentation tasks, standardizing clinical research pipelines, and ensuring that medical image segmentation models are evaluated on their true clinical utility rather than just gross volumetric overlap.

Installation

pip install BraTS-evaluation

This installs the brats_evaluation Python package and exposes two console scripts: brats-evaluate and brats-parse-metrics.

Usage

The evaluation pipeline runs in two steps: produce a JSON summary with brats-evaluate, then turn that JSON into a CSV report with brats-parse-metrics. Either step can also be driven from Python.

1. Run the evaluation (`brats-evaluate`)

This command evaluates prediction NIfTI files against reference (ground truth) NIfTI files using the Panoptica framework.

Command:

brats-evaluate \
    --config mets \
    --ref_path /path/to/reference/niftis/ \
    --pred_path /path/to/prediction/niftis/ \
    --summary_json ./panoptica_evaluation_summary.json

Use --config with a bundled config name (mets, gli, ped, MenRT, MenPre, GoAT), or --config_path to point at a custom YAML file.

Arguments:

--config: Name of a bundled Panoptica config (mets, gli, ped, MenRT, MenPre, GoAT).
--config_path: Path to a custom Panoptica configuration YAML file (mutually exclusive with --config).
--ref_path: Path to the directory containing reference (ground truth) NIfTI files.
--pred_path: Path to the directory containing prediction NIfTI files.
--summary_json: (Optional) Output path for the JSON file summarizing all evaluation metrics. Default: ./panoptica_evaluation_summary.json.
--num_subjects: (Optional) Number of subjects to process (e.g. --num_subjects 5). Useful for quick testing. If omitted, all subjects are processed.

2. Parse the results (`brats-parse-metrics`)

Once the evaluation is complete, a JSON file will be created which includes all the quantified metrics. In order to extract only the metrics which are used for the BraTS Leaderboard and ranking, use the parser command to extract these metrics into a clean CSV format.

The parser supports two commands: seg (for all segmentation tasks except for the Metastasis) and mets (for only the Metastasis task which needs both segmentation and detection metrics).

Command (Basic Segmentation Metrics):

brats-parse-metrics seg \
    --json_path ./panoptica_evaluation_summary.json \
    --output_csv_path ./parsed_panoptica_seg_stats.csv

Command (Metastasis/Detailed Instance Metrics):

brats-parse-metrics mets \
    --json_path ./panoptica_evaluation_summary.json \
    --vol_threshold 20.0 \
    --overlap_threshold 0.1 \
    --output_csv_path ./parsed_panoptica_mets_stats.csv

Arguments for mets command:

--vol_threshold: Volume threshold to differentiate between large and small lesions (e.g., 20.0 voxels/mm3 depending on your config).
--overlap_threshold: Dice score threshold to classify small lesions as True Positive (TP) or False Negative (FN).

Python library

Call the evaluator directly from your own Python code:

from panoptica import Panoptica_Evaluator
from brats_evaluation import config_path, evaluate_single_exam

# Bundled configs: "mets", "gli", "ped", "MenRT", "MenPre", "GoAT"
evaluator = Panoptica_Evaluator.load_from_config(str(config_path("mets")))
results = evaluate_single_exam(
    prediction_filepath="path/to/pred.nii.gz",
    reference_filepath="path/to/ref.nii.gz",
    subject_identifier="case-001",
    evaluator=evaluator,
)
print(results)

For a runnable, end-to-end example using the bundled sample data see ./example/programmatic_example.py.

Example notebook

For a complete, step-by-step walkthrough of the evaluation and parsing process, see the Jupyter notebook at ./example/brats_mets.ipynb.

Modifying the pipeline

If you want to tweak the evaluation logic or the Panoptica configs, clone the repo and install with Poetry.

Create and activate a Python environment using either conda or the built-in venv:

# Option 1 — conda
conda create -n brats_eval python=3.10
conda activate brats_eval

# Option 2 — venv (no conda required)
python3.10 -m venv .venv
source .venv/bin/activate   # Windows: .venv\Scripts\activate

Then clone the repo and install with Poetry:

git clone https://github.com/BraTS/BraTS_evaluation.git
cd BraTS_evaluation
poetry install

If Poetry is not yet available, install it via either route:

conda install -c conda-forge poetry (for conda users — keeps Poetry inside the env)
curl -sSL https://install.python-poetry.org | python3 - (official standalone installer)

References

BraTS Challenge: Brain TumorS (BraTS) Challenge
Panoptica Library: Panoptica evaluation framework

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

0.0.8

Jun 4, 2026

0.0.7

Jun 2, 2026

This version

0.0.6

May 26, 2026

0.0.5

May 26, 2026

0.0.4

May 22, 2026

0.0.3

May 22, 2026

0.0.2

May 22, 2026

0.0.1

May 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

brats_evaluation-0.0.6-py3-none-any.whl (17.9 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file brats_evaluation-0.0.6-py3-none-any.whl.

File metadata

Download URL: brats_evaluation-0.0.6-py3-none-any.whl
Upload date: May 26, 2026
Size: 17.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for brats_evaluation-0.0.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9111f04036bb01ccdb47b9968dec04f52402dbc6f83953a673193c0afe2ae150`
MD5	`3f46fd00f8c01ea1b6ff1c4f0b1ab6b8`
BLAKE2b-256	`c38c10d6fe383f49ea1064adee0021a927116dfba4aa500fb0fd7aea6bb7f334`

See more details on using hashes here.

BraTS-evaluation 0.0.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BraTS Evaluation

Panoptica: Instance-Wise Evaluation

Installation

Usage

1. Run the evaluation (`brats-evaluate`)

2. Parse the results (`brats-parse-metrics`)

Python library

Example notebook

Modifying the pipeline

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

BraTS-evaluation 0.0.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BraTS Evaluation

Panoptica: Instance-Wise Evaluation

Installation

Usage

1. Run the evaluation (brats-evaluate)

2. Parse the results (brats-parse-metrics)

Python library

Example notebook

Modifying the pipeline

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

1. Run the evaluation (`brats-evaluate`)

2. Parse the results (`brats-parse-metrics`)