Skip to main content

Utilities for the ORena SAVE FOCUS challenge: Foreign Object Contextual Understanding for Safe Surgical AI

Project description

orena-focus

Tests PyPI Python License: MIT Data: CC BY-NC-SA 4.0 MICCAI 2026 Dataset


Python utilities for the FOCUS datasets and challengeForeign Object Contextual Understanding for Surgery.

The library provides dataset loaders, preprocessing pipelines, answer-format handling, and an evaluation framework for working with the FOCUS surgical VQA datasets. It can be used independently for research on foreign-object understanding in minimally invasive surgery, and also serves as the official toolkit for the ORena SAVE FOCUS challenge at MICCAI 2026.

Challenge soon open for registration. Submit your results and compete on the leaderboard at or-arena.org.

Retained foreign objects are a life-threatening and preventable surgical complication. FOCUS benchmarks vision-language models on clinically relevant VQA tasks around detecting, counting, and reasoning about foreign objects in endoscopic video.

Tracks

FOCUS offers three participation tracks, each requiring a different type of visual context:

Track Track enum Visual input Description
FRAME Track.FRAME Single frame Answer questions from one extracted video frame. The simplest entry point — no temporal modelling required.
SEGMENT Track.SEGMENT <= 5min clip Answer questions from a multi-second video segment surrounding the relevant event. Requires understanding of motion and temporal context.
PROCEDURE Track.PROCEDURE Up to full video Answer questions that may require reasoning over an entire surgical procedure, including events that happened well before or after the queried moment.

Participants may enter any subset of tracks. Each track is evaluated independently with the same hierarchical capability taxonomy.

Installation

pip install orena-focus

Quick start

from focus import FocusDataset, DatasetSplit, Track

ds = FocusDataset("heico", DatasetSplit.TEST, Track.SEGMENT)

request, reference = ds[0]
print(request.question)        # "How many sponges are visible?"
print(reference.answer)        # "2"
print(reference.format.type)   # "number"

Data preparation

Download, preprocess, and split the dataset in one script — see examples/data_preparation.py for the full walkthrough.

from focus import download
from focus.preprocessing import VideoTimestampOverlayPreprocessor, FrameExtractorPreprocessor

download("heico")

VideoTimestampOverlayPreprocessor().process(dataset="heico")
FrameExtractorPreprocessor(stride=1).process(dataset="heico")

QA annotations are fetched automatically from HuggingFace when you construct a FocusDataset.

Inference & evaluation

See examples/inference.py for an end-to-end example with Qwen3-VL.

from focus import Evaluator, Response

responses = [Response(qID=req.qID, content=my_model(req)) for req, _ in ds]

results_df, summary_df = Evaluator().run(
    requests=ds.requests,
    references=ds.references,
    responses=responses,
)
print(summary_df)

Capability taxonomy

Five capability groups, each composed of leaf capabilities assigned to questions.

SAVE FOCUS capability taxonomy with example questions

# Group Leaf capabilities
1 Object Recognition Identification, Instance Matching, Attributes, Spatial (camera), Spatial (situs)
2 Temporal Grounding Temporal Localization, Duration Estimation
3 Aggregation Object Aggregation, Event Aggregation
4 Event & Procedural Understanding FO Interaction Recognition, FO Usage Purpose, Temporal Ordering
5 Complex Reasoning Functional Reasoning, Causal & Consequence Reasoning, Multi-step Reasoning

Answer formats

Format Accepts Returns
Binary "yes" / "no" bool
Number Non-negative integer strings int
Percentage Numeric percentage strings float
FOClass Registered FO class names str
OpenEnded Free text (≤ 300 chars) str
Matching Regex-validated text str
MultipleChoice One of predefined options str
Time hh:mm:ss timestamps timedelta

Dataset

The QA annotations are publicly available on HuggingFace: orena-dkfz/heico-focus-vqa.

The FOCUS challenge is built on the HeiCo dataset. If you use this data, please cite the original publication:

Maier-Hein, L., et al. (2021). Heidelberg colorectal data set for surgical data science in the sensor operating room. https://doi.org/10.1038/s41597-021-00882-2

The HeiCo data is released under CC BY-NC-SA 4.0 — non-commercial use only, with attribution and share-alike conditions.

License

The code is licensed under the permissive MIT license. The underlying data is licensed independently, see Dataset for the data license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

orena_focus-0.1.1.tar.gz (1.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

orena_focus-0.1.1-py3-none-any.whl (1.8 MB view details)

Uploaded Python 3

File details

Details for the file orena_focus-0.1.1.tar.gz.

File metadata

  • Download URL: orena_focus-0.1.1.tar.gz
  • Upload date:
  • Size: 1.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for orena_focus-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c519ccd2d20c76e1ced696fc647ac64698328d05ec23cd2174c9566b0c3752e2
MD5 64da74e1f458e3bc0c1482226984fccf
BLAKE2b-256 62426f80074d8ba99e431375e96341c424bfe15f23ec11dfe08b0f614855019e

See more details on using hashes here.

Provenance

The following attestation bundles were made for orena_focus-0.1.1.tar.gz:

Publisher: release.yml on IMSY-DKFZ/orena-focus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file orena_focus-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: orena_focus-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 1.8 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for orena_focus-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3bb1cd307d99be54f1cac5b307fd4f2d96a98fd79a9504fefdb01c9185726703
MD5 bf48d9450766c7a2171e1651b608bcb9
BLAKE2b-256 c00d1af55238f9f8e7fd6ec30e046077abac4658ba7f28443bff241031f3cd6e

See more details on using hashes here.

Provenance

The following attestation bundles were made for orena_focus-0.1.1-py3-none-any.whl:

Publisher: release.yml on IMSY-DKFZ/orena-focus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page