Skip to main content

Dataset validation toolkit for neurology brain imaging (NIfTI)

Project description

DOI

NeuroTK: Dataset Validation for Neurology Brain Imaging

Motivation

Neurology brain imaging datasets are heterogeneous and frequently contain inconsistencies. Geometry, spacing, orientation, and annotation issues occur commonly across CT and MRI collections. These problems often surface late in modeling, when remediation is costly and compromises reproducibility. NeuroTK surfaces issues early, explicitly, and reproducibly to support dataset hygiene prior to analysis.

Scope

NeuroTK focuses on dataset quality assurance prior to downstream analysis. It provides dataset-level and file-level validation with structural and geometric consistency checks, and assessment of annotation presence and integrity.

  • Dataset-level and file-level validation
  • Structural and geometric consistency checks
  • Annotation presence and integrity assessment

NeuroTK does not modify scientific data.

Installation

pip install neurotk

Quickstart

neurotk validate --images imagesTr --labels labelsTr --out report.json

Inputs are expected as flat directories of NIfTI files, and filenames must match exactly for image–label pairing.

dataset/
  imagesTr/
    case_001.nii.gz
    case_002.nii.gz
  labelsTr/
    case_001.nii.gz
    case_002.nii.gz

Output

NeuroTK emits a JSON report containing a dataset-level summary, per-file diagnostics, and explicit listings of detected issues.

{
  "summary": {"num_images": 100, "files_with_issues": 7},
  "files": {"case_001.nii.gz": {"issues": ["label_missing"]}}
}

Citation

If you use NeuroTK in your research, please cite it as follows:

@software{neurotk,
  title  = {NeuroTK: Dataset Validation for Neurology Brain Imaging},
  author = {Sakshi Rathi},
  year   = {2026},
  url    = {https://github.com/SakshiRa/neurotk},
  note   = {Open-source toolkit for dataset validation and quality assurance in neurology brain imaging}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

neurotk-0.1.0.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

neurotk-0.1.0-py3-none-any.whl (11.9 kB view details)

Uploaded Python 3

File details

Details for the file neurotk-0.1.0.tar.gz.

File metadata

  • Download URL: neurotk-0.1.0.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.11

File hashes

Hashes for neurotk-0.1.0.tar.gz
Algorithm Hash digest
SHA256 a13b79ac378ce2e93dfd582149956a04b7026f5bb736b577f3329b0778d290c4
MD5 f9eae0e6fa9e9b54c91a6ec9e90bc7a8
BLAKE2b-256 d72d68bee9d8a1dfda334d1ccb6af94a8152ab242a6e3f1c861be329c92dd7ec

See more details on using hashes here.

File details

Details for the file neurotk-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: neurotk-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 11.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.11

File hashes

Hashes for neurotk-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5036482958f83efe3da2f4a4d4a70c583fb7e5484a09f756ff46ad89f4771922
MD5 aa32734bf34e0a2718d65186ee013892
BLAKE2b-256 9d48463e20109febd6f09875031228e2c5061981c19de757b91a226855e583b3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page