Skip to main content

An open-source tool for creating image segmentation datasets from videos using SAM 2

Project description

ViVa-DataCreator: An Open-Source Human-in-the-Loop Data Annotation Engine for Semantic Segmentation

PyPI version License: MIT Python versions

ViVa-DataCreator Logo

ViVa-DataCreator is an open-source tool for creating semantic segmentation datasets by tracking objects of interest from videos. It leverages the Segment Anything Model 2 (SAM2) and You Only Look Once (YOLO) AI models to perform segmentation and object detection, guiding users through an 8-step process to generate complete datasets ready for model training.

ViVa-DataCreator GUI
ViVa-DataCreator: Graphical User Interface for Dataset Creation

This tool focuses on generating semantic segmentation datasets through object tracking, utilizing SAM 2 to enhance segmentation accuracy.

Key Features

  • Video-to-Dataset Conversion: Transform videos into high-quality segmentation datasets with minimal manual effort.
  • SAM 2 Integration: Utilize the latest Segment Anything Model 2 for accurate and interactive segmentation.
  • 8-Step Flexible Process: A comprehensive workflow that guides you through dataset creation, allowing you to move between steps as needed.
  • Interactive Refinement: Manually refine segmentations for precision and quality control.
  • Object Tracking Integration: Utilize YOLO and DeepSort for tracking objects of interest across video frames.
  • Batch Processing: Efficiently handle large videos through configurable batch processing.
  • Customizable Classes: Define and assign custom object classes with unique colors.
  • Safety-Focused: Designed for safe and reliable dataset generation without hardware risks.

Documentation

For detailed usage instructions, examples, and API documentation, please refer to the ViVa-DataCreator Documentation.

Citation

If you use ViVa-DataCreator in your research, please consider adding the following citations:

ViVa-DataCreator

@software{soriano2025datacreator,
  author = {Miguel Soriano-García, Diego Mercado-Ravell, Israel Becerra and Julio De La Torre-Vanegas},
  title = {ViVa-DataCreator: An Open-Source Human-in-the-Loop Data Annotation Engine for Semantic Segmentation},
  year = {2025},
  url = {https://github.com/viva-safeland/viva_datacreator}
}

ViVa-SAFELAND Simulator

@article{soriano2025viva,
  title={ViVa-SAFELAND: a New Freeware for Safe Validation of Vision-based Navigation in Aerial Vehicles},
  author={Miguel S. Soriano-Garcia and Diego A. Mercado-Ravell},
  journal={arXiv preprint arXiv:2503.14719},
  year={2024}
}

Related Application

@misc{delatorre2025riskaware,
      title={Vision-Based Risk Aware Emergency Landing for UAVs in Complex Urban Environments}, 
      author={Julio de la Torre-Vanegas and Miguel Soriano-Garcia and Israel Becerra and Diego Mercado-Ravell},
      year={2025},
      eprint={2505.20423},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2505.20423}, 
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

viva_datacreator-0.0.6.tar.gz (65.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

viva_datacreator-0.0.6-py3-none-any.whl (97.4 kB view details)

Uploaded Python 3

File details

Details for the file viva_datacreator-0.0.6.tar.gz.

File metadata

  • Download URL: viva_datacreator-0.0.6.tar.gz
  • Upload date:
  • Size: 65.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for viva_datacreator-0.0.6.tar.gz
Algorithm Hash digest
SHA256 c99ba987b4768b346c75aef858fc096a2b3b39aec6c9580525ec8e91e10c2d3a
MD5 ecbcb1564e39261cbabe2244e81a7de8
BLAKE2b-256 64fa430425a444d3aa1276be33199cbe124d1862f11335463dff2df4a13a9b72

See more details on using hashes here.

File details

Details for the file viva_datacreator-0.0.6-py3-none-any.whl.

File metadata

File hashes

Hashes for viva_datacreator-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 734c3b09d5655f03768b82c987d451780c2b23c2ab3f40a24a3490e2c6bcee26
MD5 658eaadbb49959dabb545fad033ed35d
BLAKE2b-256 31be9cdba1d54b6d952b6fff11469a2ed8cfeb3763c5d4ac9f5da7108ba5c035

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page