Fast almost fully automated image annotation tool for computer vision tasks detection, oriented bounding boxes and segmentation.

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

safouaneelg

These details have not been verified by PyPI

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

VisioFirm

VisioFirm: Fast Almost fully-Automated Image Annotation for Computer Vision

[!IMPORTANT] VisioFirm v1 is now available. VisioFirm has now much more support for computer vision annotation, pushing further the boundaries of efficient, fast, and accurate annotation. Here's what’s New in v1.0.3 ✨

Classification and Preannotation: Predict and pre-suggest image classes using OpenAI CLIP pretrained model, enabling near-automatic labeling.

Video Support & Label Propagation: New VFTracker auto-labeling with frame-to-frame propagation: choose between: (1) SmartPropagator – Leverages SAM2 + pre/post processing for accurate, cumulative tracking. Annotate the first frame, propagate across the sequence. (2) OpenCV Trackers – Full support (CSRT, KCF, Boosting, MIL, TLD, MedianFlow, MOSSE, GOTURN) and (3) Interpolation – Classic propagation between [labeled_start] and [labeled_end].

Ultralytics Model Support: Works with YOLOv12 → YOLOv5, including YOLOv8-world for open-vocab pre-annotation.

Cross-domain annotation: use detection models to pre-generate segmentation masks, or segmentation models to pre-label bounding boxes.

Memory Management Improvements: Optimized GPU usage with better model load/unload behavior for large-scale pre-annotation and tracking.

Backend Migration to FastAPI: Faster performance, async support, and smoother UI interactions.

Python API: Integrate VisioFirm seamlessly into pipelines with the new visiofirm Python API.

VisioFirm is an open-source, AI-powered image annotation tool designed to accelerate labeling for computer vision tasks like classification, object detection, oriented bounding boxes (OBB), segmentation and video annotation. Built for speed and simplicity, it leverages state-of-the-art models for semi-automated pre-annotations, allowing you to focus on refining rather than starting from scratch. Whether you're preparing datasets for YOLO, SAM, or custom models, VisioFirm streamlines your workflow with a intuitive web interface and powerful backend.

Perfect for researchers, data scientists, and ML engineers handling large image datasets—get high-quality annotations in minutes, not hours!

Why VisioFirm?

VisioFirm is majoraly focused on AI-model integration easiness for fast CV tasks annotation.

AI-Driven Pre-Annotation: Automatically detect and segment objects using YOLO, SAM2, and Grounding DINO—saving up to 80% of manual effort.
Multi-Task Support: Handles classification, bounding boxes, oriented bounding boxes, and polygon segmentation and now even videos in one tool.
Browser-Based Editing: Interactive canvas for precise adjustments, with real-time SAM-powered segmentation in the browser.
Offline-Friendly: Models download automatically (or pre-fetch for offline use), with SQLite backend for local projects.
Extensible & Open-Source: Customize with your own ultralytics models or integrate into pipelines—contributions welcome!
SAM2-base webgpu: Insta-drawing of annotations via SAM2 with worker offloading and auto-annotation for faster computing!

Annotation Editing Demo

Features

Semi-Automated Labeling Kickstart annotations with AI models like YOLO (v5–v12) for detection, SAM2 for segmentation, Grounding DINO for zero-shot object grounding, and CLIP for automated classification.
Flexible Annotation Types
- Axis-aligned bounding boxes for standard detection.
- Oriented bounding boxes for rotated objects (e.g., aerial imagery).
- Polygon segmentation for precise boundaries.
- Image classification with automatic label suggestions.
Video Annotation & Label Propagation Annotate videos with frame-to-frame consistency:
- SmartPropagator (SAM2-powered accurate propagation).
- OpenCV trackers (CSRT, KCF, Boosting, MIL, TLD, MedianFlow, MOSSE, GOTURN).
- Interpolation between annotated start/end frames.
Cross-Domain Annotation
- Use detection models to auto-generate segmentation masks.
- Use segmentation models to pre-label bounding boxes.
Ultralytics Model Support Full support for YOLOv12, v11, v10, v9, v8, v5, plus YOLOv8-world for open-vocab pre-annotations (no GPU required).
Interactive Frontend Draw, edit, and refine labels on a responsive canvas.
- Click-to-segment with browser-based SAM2.
- Hotkeys, undo/redo, and zoom for efficient annotation.
Project Management Organize datasets with SQLite-backed projects.
- Multi-class support.
- Import/export with minimal setup.
Export Formats Export annotations to YOLO, COCO, or custom formats for seamless training.
Performance Optimizations
- GPU memory management for efficient model loading/unloading.
- Cluster overlapping detections, simplify contours, and filter by confidence.
- Multi-threaded uploading and optimized image import.
Cloud/SSH Integration Download images from cloud storage or SSH servers, save annotations remotely, and manage large-scale projects.
Backend Migration to FastAPI Faster response times, async support, and smoother UI performance.
VisioFirm Python API Integrate annotation workflows into custom scripts and ML pipelines.

DEMOs

Detection based on pre-trained/zeroshot models:

Annotation Editing Demo

Video Segmentation using Smart Propagator:

https://github.com/user-attachments/assets/c5caa227-a9bb-4ff3-a11a-688067fb58ae

Installation

VisioFirm was tested with Python 3.10+.

[!NOTE] VisioFirm v1 introduces a new database management logic.
To avoid conflicts with older versions, you need to rename/remove the old cache folder before running the new release:

Linux: ~/.cache/visiofirm_cache

macOS: ~/Library/Caches/visiofirm_cache

Windows: %LOCALAPPDATA%\visiofirm_cache

After deleting the folder, restart VisioFirm — it will automatically recreate the cache directory with the new structure.

pip install -U visiofirm

For development or editable install (from a cloned repo):

git clone https://github.com/OschAI/VisioFirm.git
cd VisioFirm
pip install -e .

Quick Start

Launch VisioFirm with a single command—it auto-starts a local web server and opens in your browser.

visiofirm

Create a new project and upload images.
Define classes (e.g., "car", "person").
For easy-to-detect object run AI pre-annotation (select model: YOLO, Grounding DINO).
Refine labels in the interactive editor.
Export your annotated dataset.

The VisioFirm app uses cache directories to store settings locally.

Usage

Pre-Annotation with AI

VisioFirm uses advanced models for initial labels:

YOLO: All ultralytics based YOLO model are now compatible and can be used.
SAM2: Precise segmentation use in image annotation and video propagation
Grounding DINO: Zero-shot detection via text prompts.

Community & Support

Issues: Report bugs or request features here.
Discord: Coming soon—star the repo for updates!
Roadmap: Multi-user support, custom model integration.

License

Apache 2.0 - See LICENSE for details.

This project uses third-party software and models:

Ultralytics YOLO
https://github.com/ultralytics/ultralytics
License: AGPL-3.0
SAM2 (Segment Anything Model v2)
https://github.com/facebookresearch/sam2
Licenses: Apache 2.0 and BSD 3-Clause
GroundingDINO
https://github.com/IDEA-Research/GroundingDINO
License: Apache 2.0

Built by Safouane El Ghazouali for the research community. Star the repo if it helps your workflow! 🚀

Citation

@misc{ghazouali2025visiofirm,
    title={VisioFirm: Cross-Platform AI-assisted Annotation Tool for Computer Vision},
    author={Safouane El Ghazouali and Umberto Michelucci},
    year={2025},
    eprint={2509.04180},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

SOON:

Documentation website
Discord community

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

safouaneelg

These details have not been verified by PyPI

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.3.1

Mar 15, 2026

1.3

Mar 14, 2026

1.2.1

Mar 14, 2026

1.2

Feb 25, 2026

1.1.1

Oct 3, 2025

1.1.0

Sep 28, 2025

This version

1.0.3

Sep 24, 2025

1.0.2

Sep 23, 2025

1.0.1

Sep 22, 2025

1.0.0

Sep 19, 2025

0.2.0

Sep 7, 2025

0.1.4

Sep 6, 2025

0.1.3

Sep 5, 2025

0.1.2

Sep 3, 2025

0.1.1

Sep 1, 2025

0.1.0

Sep 1, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visiofirm-1.0.3.tar.gz (24.4 MB view details)

Uploaded Sep 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

visiofirm-1.0.3-py3-none-any.whl (24.4 MB view details)

Uploaded Sep 24, 2025 Python 3

File details

Details for the file visiofirm-1.0.3.tar.gz.

File metadata

Download URL: visiofirm-1.0.3.tar.gz
Upload date: Sep 24, 2025
Size: 24.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for visiofirm-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`f48d47b189f1273a20d24c778e91934f5662fc197dc5552fd76a4dac240bb18d`
MD5	`0c4ef7d35e5e41454ac505dcd284eb6b`
BLAKE2b-256	`ba58d909eaa770a426b1ffb774b0715792394048dc0098570450433c4fe690ad`

See more details on using hashes here.

Provenance

The following attestation bundles were made for visiofirm-1.0.3.tar.gz:

Publisher: publish.yml on OschAI/VisioFirm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: visiofirm-1.0.3.tar.gz
- Subject digest: f48d47b189f1273a20d24c778e91934f5662fc197dc5552fd76a4dac240bb18d
- Sigstore transparency entry: 554614783
- Sigstore integration time: Sep 24, 2025
Source repository:
- Permalink: OschAI/VisioFirm@5bd41f829efe7799da39473ee5f1a7220b4f0e18
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/OschAI
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5bd41f829efe7799da39473ee5f1a7220b4f0e18
- Trigger Event: release

File details

Details for the file visiofirm-1.0.3-py3-none-any.whl.

File metadata

Download URL: visiofirm-1.0.3-py3-none-any.whl
Upload date: Sep 24, 2025
Size: 24.4 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for visiofirm-1.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`51b41dd60e03400dba57b3a55d440b3240e0a314bcc839a5854d373b85d9abeb`
MD5	`bd06505a00c94e4fefd9a4e05c4ca918`
BLAKE2b-256	`6d97be54430e7afbe6081fb72b25f1df0624f4de8c768964a8ef669fdeee8008`

See more details on using hashes here.

Provenance

The following attestation bundles were made for visiofirm-1.0.3-py3-none-any.whl:

Publisher: publish.yml on OschAI/VisioFirm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: visiofirm-1.0.3-py3-none-any.whl
- Subject digest: 51b41dd60e03400dba57b3a55d440b3240e0a314bcc839a5854d373b85d9abeb
- Sigstore transparency entry: 554614810
- Sigstore integration time: Sep 24, 2025
Source repository:
- Permalink: OschAI/VisioFirm@5bd41f829efe7799da39473ee5f1a7220b4f0e18
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/OschAI
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5bd41f829efe7799da39473ee5f1a7220b4f0e18
- Trigger Event: release

visiofirm 1.0.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

VisioFirm: Fast Almost fully-Automated Image Annotation for Computer Vision

Why VisioFirm?

Features

DEMOs

Installation

Quick Start

Usage

Pre-Annotation with AI

Community & Support

License

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance