YOLO-based detection of bright infrared marker balls in uint16 frames.

Project description

IRYoloTracker

IRYoloTracker detects bright circular infrared marker balls in single-channel 512x512 numpy.uint16 frames using a single-class YOLO model.

The public API accepts grayscale infrared intensity images only. If you use standard YOLO weights, the normalized gray image is copied into three identical channels internally because standard Ultralytics YOLO models expect 3-channel input. This does not add color information. If you train a true one-channel YOLO model, pass model_input_channels=1.

Install

Use the virtual environment in this repository:

.\venv\Scripts\python.exe -m pip install -e ".[dev]"

For GPU inference on Windows, install the CUDA PyTorch wheels in the same venv:

.\venv\Scripts\python.exe -m pip install --force-reinstall torch==2.12.0+cu130 torchvision==0.27.0+cu130 --index-url https://download.pytorch.org/whl/cu130

Python API

The package includes a bootstrap YOLO model trained from high-confidence bright-circle pseudo-labels in infrared_data. You can use it directly:

import numpy as np

from ir_yolo_tracker import IRMarkerTracker

frame: np.ndarray = np.load("frame.npy")

tracker = IRMarkerTracker(confidence_threshold=0.35, device="cpu")

detections = tracker.detect_dicts(frame)
print(detections)

For one-off detection, use the convenience function:

import numpy as np

from ir_yolo_tracker import detect_marker_dicts

frame: np.ndarray = np.load("frame.npy")
detections = detect_marker_dicts(frame, confidence_threshold=0.35, device="cpu")

For repeated frames, create one tracker and reuse it:

from ir_yolo_tracker import IRMarkerTracker, preload_pickle_frames

tracker = IRMarkerTracker(device="cpu")
frames = preload_pickle_frames("infrared_data", progress=True)

for path, frame in frames:
    detections = tracker.detect(frame)
    print(path.name, detections)

Batch-style helpers are also available:

from ir_yolo_tracker import detect_marker_batch, iter_pickle_detections

all_results = detect_marker_batch(frames, device="cpu")

for path, detections in iter_pickle_detections("infrared_data", device="cpu"):
    ...

To draw results on a frame:

import cv2

from ir_yolo_tracker import draw_detections

image = draw_detections(frame, detections, status="IR marker detections")
cv2.imshow("detections", image)
cv2.waitKey(0)

Pass device="cuda" or device=0 to request GPU inference. If device is not set, Ultralytics chooses the device.

The public API includes:

IRMarkerTracker: reusable YOLO detector class.
detect_markers, detect_marker_dicts: one-frame convenience functions.
detect_marker_batch, iter_marker_detections: multi-frame helpers.
load_pickle_frame, iter_pickle_frames, iter_pickle_detections: pickle sequence helpers.
draw_detections: OpenCV visualization helper.
BrightCircleDetector: non-YOLO bright-circle detector for pseudo-labeling/debugging.

For production accuracy, train with human-verified labels and pass your own best.pt:

tracker = IRMarkerTracker("runs/detect/ir_marker_ball/weights/best.pt", device="cpu")

Each detection contains only the marker-ball class:

[
    {
        "bbox_xyxy": [123.4, 205.1, 145.8, 228.0],
        "confidence": 0.91,
        "class_id": 0,
        "class_name": "ir_marker_ball",
        "center_xy": [134.6, 216.55],
    }
]

The input frame must be exactly:

shape: (512, 512)
dtype: numpy.uint16
channels: one intensity channel, not RGB/BGR

Command-Line Inference

.\venv\Scripts\python.exe scripts\detect_npy.py frame.npy

Infrared Video Example

The root-level example example_infrared_video.py plays .pickle frames from infrared_data at 24 FPS and overlays marker-ball boxes and confidences. This file is a repository example, not an installed package entry point. To use it, clone the GitHub project and run it from the project root.

git clone https://github.com/GGN-2015/IRYoloTracker.git
cd IRYoloTracker
.\venv\Scripts\python.exe -m pip install -e ".[dev]"

Run with the bundled bootstrap YOLO model:

.\venv\Scripts\python.exe example_infrared_video.py

Run with custom trained weights:

.\venv\Scripts\python.exe example_infrared_video.py --weights runs\detect\ir_marker_ball\weights\best.pt

Common options:

.\venv\Scripts\python.exe example_infrared_video.py --device cuda
.\venv\Scripts\python.exe example_infrared_video.py --device cpu
.\venv\Scripts\python.exe example_infrared_video.py --fps 24 --scale 1.5
.\venv\Scripts\python.exe example_infrared_video.py --lazy-load
.\venv\Scripts\python.exe example_infrared_video.py --loop
.\venv\Scripts\python.exe example_infrared_video.py --no-rescue-bright-circles

At startup the example preloads all pickle frames into memory and shows a tqdm progress bar. This removes disk IO and pickle decoding from the playback loop. Use --lazy-load to read frames during playback instead.

The example defaults to --conf 0.25 and enables a conservative YOLO+circle rescue pass. YOLO still supplies the main predictions, but the example also adds high-confidence bright circular blobs that do not overlap an existing YOLO box. This helps recover marker balls that the bootstrap YOLO model sees only at very low confidence. Use --no-rescue-bright-circles to view raw YOLO output.

There is no default marker-count cap. Use --max-detections N only when your camera setup has a known physical upper bound and you explicitly want to keep the top-scoring detections.

If no YOLO weights are found, the example falls back to a simple bright-circle preview detector so you can still inspect the infrared frame sequence. In a normal package checkout this should not happen because a bootstrap model is bundled. Pass --require-yolo if you want it to fail unless YOLO weights are present.

Press Space to pause/resume, or press Q/Esc to quit.

Bootstrap Model

The bundled model was trained from pseudo-labels generated by the preview bright-circle detector with confidence >= 0.70:

.\venv\Scripts\python.exe scripts\bootstrap_pseudo_dataset.py --confidence 0.70
.\venv\Scripts\python.exe scripts\train.py --data configs\pseudo_ir_marker_dataset.yaml --model yolo11n.pt --epochs 12 --imgsz 512 --batch 8 --device cpu --project runs\detect --name pseudo_ir_marker_bootstrap

This is useful as a project bootstrap. Replace it with a model trained on human-reviewed marker-ball labels before relying on it for critical measurement.

The video example can also rebuild a lower-threshold YOLO pseudo dataset from infrared_data. It uses the current inference model plus the same conservative bright-circle rescue pass. Low-confidence YOLO boxes below --pseudo-yolo-accept-conf are kept only when bright-circle evidence confirms the same location:

.\venv\Scripts\python.exe example_infrared_video.py --rebuild-yolo-pseudo-dataset --device cuda --pseudo-conf 0.05

By default this writes images and labels to datasets/yolo_low_conf_ir_marker_ball and writes configs/yolo_low_conf_ir_marker_dataset.yaml.

Training Data

Prepare a YOLO detection dataset with one class:

datasets/ir_marker_ball/
  images/train/*.png
  images/val/*.png
  labels/train/*.txt
  labels/val/*.txt

Each label row must use normalized YOLO format:

0 x_center y_center width height

The class id must always be 0, named ir_marker_ball. Do not include labels for non-marker objects.

If your annotations are stored with pixel-coordinate boxes, you can build the dataset from .npy frames:

.\venv\Scripts\python.exe scripts\build_dataset.py --annotations annotations.json --frames-root frames --output datasets\ir_marker_ball

annotations.json can be either a list or an object containing frames:

[
  {
    "image": "frame_0001.npy",
    "boxes": [[120, 210, 145, 235]]
  }
]

Training

The default dataset config is configs/ir_marker_dataset.yaml.

.\venv\Scripts\python.exe scripts\train.py --data configs\ir_marker_dataset.yaml --model yolo11n.pt --epochs 100 --imgsz 512

The detector also passes classes=[0] during inference and filters class 0 again after inference, so the returned output contains marker balls only.

True One-Channel Model

For most workflows, standard YOLO training with grayscale data copied into three equal channels is the simplest path. To use a custom YOLO model whose first layer accepts one channel, create/train that model separately and run:

tracker = IRMarkerTracker(
    "path/to/one_channel_best.pt",
    model_input_channels=1,
)

Tests

.\venv\Scripts\python.exe -m pytest

Project details

Release history Release notifications | RSS feed

0.1.2

Jun 11, 2026

0.1.1

Jun 7, 2026

This version

0.1.0

Jun 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ir_yolo_tracker-0.1.0.tar.gz (4.9 MB view details)

Uploaded Jun 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ir_yolo_tracker-0.1.0-py3-none-any.whl (4.9 MB view details)

Uploaded Jun 7, 2026 Python 3

File details

Details for the file ir_yolo_tracker-0.1.0.tar.gz.

File metadata

Download URL: ir_yolo_tracker-0.1.0.tar.gz
Upload date: Jun 7, 2026
Size: 4.9 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.4.1 CPython/3.11.15 Windows/10

File hashes

Hashes for ir_yolo_tracker-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`8027e7d37553e49d3eb1d78115365255b22cfd155364a2e333350f4fb05e9a2d`
MD5	`da3df4cba96906d28059d356353ad963`
BLAKE2b-256	`21663ba89a771516e7da57bdf350b63e5098c3774e2bb0a52dca63c88bad1494`

See more details on using hashes here.

File details

Details for the file ir_yolo_tracker-0.1.0-py3-none-any.whl.

File metadata

Download URL: ir_yolo_tracker-0.1.0-py3-none-any.whl
Upload date: Jun 7, 2026
Size: 4.9 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.4.1 CPython/3.11.15 Windows/10

File hashes

Hashes for ir_yolo_tracker-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1e2a3ec6412f3506d3e30d6f860a7a556f507a41f0745c8df862a81d6be7c4d4`
MD5	`5d4391dadf11eca81fe7790e21b54583`
BLAKE2b-256	`c0f56fb8b7d18baf11a0e23a21c65d06e1069480bc00042a4cea51a3e67f3b24`

See more details on using hashes here.

ir-yolo-tracker 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

IRYoloTracker

Install

Python API

Command-Line Inference

Infrared Video Example

Bootstrap Model

Training Data

Training

True One-Channel Model

Tests

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes