Skip to main content

An open-source tool for creating image segmentation datasets from videos using SAM 2

Project description

ViVa-DataCreator: An Open-Source Human-in-the-Loop Data Annotation Engine for Semantic Segmentation

PyPI version License: MIT Python versions

ViVa-DataCreator Logo

ViVa-DataCreator is an open-source tool for creating semantic segmentation datasets by tracking objects of interest from videos. It leverages the Segment Anything Model 2 (SAM2) and You Only Look Once (YOLO) AI models to perform segmentation and object detection, guiding users through an 8-step process to generate complete datasets ready for model training.

ViVa-DataCreator GUI
ViVa-DataCreator: Graphical User Interface for Dataset Creation

This tool focuses on generating semantic segmentation datasets through object tracking, utilizing SAM 2 to enhance segmentation accuracy.

Key Features

  • Video-to-Dataset Conversion: Transform videos into high-quality segmentation datasets with minimal manual effort.
  • SAM 2 Integration: Utilize the latest Segment Anything Model 2 for accurate and interactive segmentation.
  • 8-Step Flexible Process: A comprehensive workflow that guides you through dataset creation, allowing you to move between steps as needed.
  • Interactive Refinement: Manually refine segmentations for precision and quality control.
  • Object Tracking Integration: Utilize YOLO and DeepSort for tracking objects of interest across video frames.
  • Batch Processing: Efficiently handle large videos through configurable batch processing.
  • Customizable Classes: Define and assign custom object classes with unique colors.
  • Safety-Focused: Designed for safe and reliable dataset generation without hardware risks.

Documentation

For detailed usage instructions, examples, and API documentation, please refer to the ViVa-DataCreator Documentation.

Citation

If you use ViVa-DataCreator in your research, please consider adding the following citations:

ViVa-DataCreator

@software{soriano2025datacreator,
  author = {Miguel Soriano-García, Diego Mercado-Ravell, Israel Becerra and Julio De La Torre-Vanegas},
  title = {ViVa-DataCreator: An Open-Source Human-in-the-Loop Data Annotation Engine for Semantic Segmentation},
  year = {2025},
  url = {https://github.com/viva-safeland/viva_datacreator}
}

ViVa-SAFELAND Simulator

@article{soriano2025viva,
  title={ViVa-SAFELAND: a New Freeware for Safe Validation of Vision-based Navigation in Aerial Vehicles},
  author={Miguel S. Soriano-Garcia and Diego A. Mercado-Ravell},
  journal={arXiv preprint arXiv:2503.14719},
  year={2024}
}

Related Application

@misc{delatorre2025riskaware,
      title={Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments}, 
      author={Julio de la Torre-Vanegas and Miguel Soriano-Garcia and Israel Becerra and Diego Mercado-Ravell},
      year={2025},
      eprint={2505.20423},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2505.20423}, 
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

viva_datacreator-0.0.7.tar.gz (68.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

viva_datacreator-0.0.7-py3-none-any.whl (101.2 kB view details)

Uploaded Python 3

File details

Details for the file viva_datacreator-0.0.7.tar.gz.

File metadata

  • Download URL: viva_datacreator-0.0.7.tar.gz
  • Upload date:
  • Size: 68.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for viva_datacreator-0.0.7.tar.gz
Algorithm Hash digest
SHA256 6a67858c00aa44c956c22efa1af48e87d3ed0c4ad0672f48687293af2d011d34
MD5 f5f4cf2486ea0ddf6b2c0431bca83d50
BLAKE2b-256 04df7ff2c4cc2f3dfba2f05820ff2e1d3318d11de3f46456595726724246590a

See more details on using hashes here.

File details

Details for the file viva_datacreator-0.0.7-py3-none-any.whl.

File metadata

File hashes

Hashes for viva_datacreator-0.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 8f7c67eeabb1bc8da880bf740f5b047e43738f327d58ea8a7113bd551a2e366f
MD5 648f43571c535e54dbd809d315d8e6ff
BLAKE2b-256 de634cc5ff1b994af86d4eeeafa29be40bdefa9be20e4b2cc6d4d83290328374

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page