Reusable training, evaluation, and Grad-CAM utilities for CSV-backed image classification.

Project description

visionkit

visionkit is a small PyTorch package for image-classification projects where image metadata is stored in a CSV file and the image files live in a directory. It provides a reusable workflow for training, evaluation, prediction tables, and Grad-CAM visualization.

It turns the notebook workflow into a package that can be reused with different CSV files and image folders:

Load image filenames, labels, and train/validation splits from a CSV
Train torchvision models such as EfficientNet or ResNet
Save checkpoints, training logs, prediction CSVs, and metric CSVs
Generate Grad-CAM heatmaps and overlay images

Quick Start

Prepare an image directory and a CSV file.

your_project/
  data/
    images/
      case001.png
      case002.png
      case003.png
    meta.csv
  outputs/

The minimal meta.csv should look like this:

filename,label,split
case001.png,F,train
case002.png,N,train
case003.png,F,validation
case004.png,N,validation

Train a model:

visionkit train \
  --csv data/meta.csv \
  --image-dir data/images \
  --output-dir outputs/experiment_001 \
  --classes F N \
  --positive-class N \
  --epochs 30

Evaluate the validation split:

visionkit evaluate \
  --csv data/meta.csv \
  --image-dir data/images \
  --output-dir outputs/experiment_001 \
  --checkpoint outputs/experiment_001/models/best.pt \
  --classes F N \
  --positive-class N \
  --split validation

Generate Grad-CAM overlays:

visionkit gradcam \
  --csv data/meta.csv \
  --image-dir data/images \
  --output-dir outputs/experiment_001 \
  --checkpoint outputs/experiment_001/models/best.pt \
  --classes F N \
  --positive-class N \
  --split validation \
  --target-class N

Required CSV

For CLI training, the CSV usually needs these three columns.

column	required	example	meaning
`filename`	yes	`case001.png`	Image path relative to `--image-dir`
`label`	yes	`F` / `N`	Class label
`split`	yes	`train` / `validation`	Which split the row belongs to

The default column names are filename, label, and split. If your CSV uses different names, pass them explicitly:

visionkit train \
  --csv data/metadata.csv \
  --image-dir data/fig \
  --output-dir outputs/run_001 \
  --filename-column image_name \
  --label-column diagnosis \
  --split-column fixed_family_level_split \
  --train-split train \
  --val-split validation

Extra CSV columns are allowed. Columns such as patient_id, family_id, or age are carried through into the prediction CSVs.

filename,label,split,patient_id,family_id
case001.png,F,train,P001,A
case002.png,N,train,P002,B
case003.png,F,validation,P003,C

Image Paths

filename is interpreted as a path relative to --image-dir.

If all images are in the same folder:

filename,label,split
case001.png,F,train
case002.png,N,validation

If images are stored in subfolders, include the subfolder in filename:

filename,label,split
train/case001.png,F,train
validation/case002.png,N,validation

This expects files like:

data/images/train/case001.png
data/images/validation/case002.png

Classes And Positive Class

For binary classification, the order of --classes defines the class indices.

--classes F N

This means:

F is class 0
N is class 1

If you want AUC, sensitivity, specificity, and PR-AUC to use a specific positive class, set it explicitly:

--positive-class N

If --classes is omitted, class names are inferred from the labels in the CSV. For reproducible research or paper figures, it is better to pass --classes explicitly because class order affects the numeric labels and positive class.

Outputs

With --output-dir outputs/experiment_001, the package creates files like:

outputs/experiment_001/
  models/
    best.pt
    epoch_001.pt
    epoch_002.pt
  logs/
    training_history.csv
  predictions/
    validation_predictions.csv
    validation_predictions_metrics.csv
    gradcam_validation.csv
    gradcam_validation_metrics.csv
  gradcam/
    validation/
      case001__pred-N.png

By default, best.pt is the checkpoint from the epoch with the best val_auc.

Common Commands

Use a torchvision model other than EfficientNet-B3:

visionkit train \
  --csv data/meta.csv \
  --image-dir data/images \
  --output-dir outputs/resnet50_run \
  --architecture resnet50 \
  --image-size 224x224 \
  --classes F N \
  --positive-class N

Use a CSV layout similar to the original notebook:

visionkit train \
  --csv train_val/data/meta/fixed/fixed_family_level_split.csv \
  --image-dir train_val/data/fig \
  --output-dir train_val/review_outputs/family_level_split_efficientnetb3_seed123 \
  --filename-column filename \
  --label-column label \
  --split-column fixed_family_level_split \
  --train-split train \
  --val-split validation \
  --classes F N \
  --positive-class N \
  --architecture efficientnet_b3 \
  --image-size 300x300

Python API

from pathlib import Path

from visionkit.config import ExperimentConfig
from visionkit.pipeline import run_training, run_evaluation, run_gradcam

config = ExperimentConfig(
    csv_path=Path("data/meta.csv"),
    image_dir=Path("data/images"),
    output_dir=Path("outputs/experiment_001"),
    class_names=["F", "N"],
    positive_class="N",
    split_column="split",
    train_split="train",
    val_split="validation",
    architecture="efficientnet_b3",
    image_size=(300, 300),
    epochs=30,
)

model, history = run_training(config)

predictions, metrics = run_evaluation(
    config,
    checkpoint_path=config.output_dir / "models" / "best.pt",
    split_name="validation",
)

gradcam_results, gradcam_metrics = run_gradcam(
    config,
    checkpoint_path=config.output_dir / "models" / "best.pt",
    split_name="validation",
    target_class="N",
)

Install

Use the package locally in editable mode:

pip install -e .

If PyTorch is not installed yet, install torch and torchvision first using the command appropriate for your CPU or CUDA environment.

Project details

Release history Release notifications | RSS feed

This version

0.1.2

Jun 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visionkit_pro-0.1.2.tar.gz (15.9 kB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

visionkit_pro-0.1.2-py3-none-any.whl (18.5 kB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file visionkit_pro-0.1.2.tar.gz.

File metadata

Download URL: visionkit_pro-0.1.2.tar.gz
Upload date: Jun 23, 2026
Size: 15.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for visionkit_pro-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`c1c950ac768acc0d16674aa38d69078a30b55451521e03eb7ad7dd5636e5ac64`
MD5	`5dbd11b5b73fbb5ac3ae80dff5defc34`
BLAKE2b-256	`b889256b7fdf3d4598ab9ae17d3169b7c0c5dfc91fd4123029589b22243fc447`

See more details on using hashes here.

Provenance

The following attestation bundles were made for visionkit_pro-0.1.2.tar.gz:

Publisher: python-publish.yml on mizomizo1/visionkit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: visionkit_pro-0.1.2.tar.gz
- Subject digest: c1c950ac768acc0d16674aa38d69078a30b55451521e03eb7ad7dd5636e5ac64
- Sigstore transparency entry: 1932409649
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: mizomizo1/visionkit@45b39e5181fff92e4fb908ec5804db44c973b92c
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/mizomizo1
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@45b39e5181fff92e4fb908ec5804db44c973b92c
- Trigger Event: release

File details

Details for the file visionkit_pro-0.1.2-py3-none-any.whl.

File metadata

Download URL: visionkit_pro-0.1.2-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 18.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for visionkit_pro-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b8cb641bffd247654645a5cfd0cd8ab4b24cd80dfb131b340f841f77a9525875`
MD5	`25d6bf5876bb4dbeb14f1e960670ce83`
BLAKE2b-256	`6b5fceb3cc5889b6e9deb743455c4e29eace3e1008c7f5e50e1af47055eb7d22`

See more details on using hashes here.

Provenance

The following attestation bundles were made for visionkit_pro-0.1.2-py3-none-any.whl:

Publisher: python-publish.yml on mizomizo1/visionkit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: visionkit_pro-0.1.2-py3-none-any.whl
- Subject digest: b8cb641bffd247654645a5cfd0cd8ab4b24cd80dfb131b340f841f77a9525875
- Sigstore transparency entry: 1932409688
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: mizomizo1/visionkit@45b39e5181fff92e4fb908ec5804db44c973b92c
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/mizomizo1
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@45b39e5181fff92e4fb908ec5804db44c973b92c
- Trigger Event: release

visionkit-pro 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

visionkit

Quick Start

Required CSV

Image Paths

Classes And Positive Class

Outputs

Common Commands

Python API

Install

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance