A Python library for every Computer Vision Engineer
Project description
CvPal ๐ผ๏ธ
๐ Introduction
Welcome to CvPal - Your Ultimate Computer Vision Companion! ๐
CvPal is a cutting-edge Python package designed to empower Computer Vision engineers worldwide. Our mission is to streamline image operations, dataset management, and synthetic data generation, allowing you to focus on what truly matters: building and optimizing your machine learning pipelines.
๐ Proudly Sponsored by VisionFullSpace ๐
๐ฏ Features
- ๐ผ๏ธ Synthetic Image Generation: Create custom datasets using text prompts
- ๐ Dataset Merging: Seamlessly combine datasets with multiple attributes
- ๐๏ธ Label Management: Effortlessly remove or replace specific labels from your dataset
- ๐ Label Occurrence Counting: Accurately track label frequencies
- ๐ Dataset Reporting: Generate comprehensive reports on your dataset
- ๐ Easy Integration: Smoothly integrate with existing ML pipelines
- ๐ Comprehensive Documentation: Detailed guides for all features
- ๐ค AI-Powered Dataset Creation: Generate entire datasets with just a prompt
- ๐ผ Pandas Integration: Group images and labels into a pandas DataFrame for advanced manipulation
๐ Dataset Folder Structure
For optimal performance, please structure your dataset folder as follows:
folder/
โโโ train/
โ โโโ images/
โ โโโ labels/
โโโ test/
โ โโโ images/
โ โโโ labels/
โโโ valid/
โโโ images/
โโโ labels/
For TXT format datasets, include a data.yaml
config file:
folder/
โโโ data.yaml
Example data.yaml
:
names:
- Old_Paper
- Rock
- Scissors
nc: 3
roboflow:
license: Private
project: rock-paper-scissors-sxsw
url: https://universe.roboflow.com/roboflow-58fyf/rock-paper-scissors-sxsw/dataset/14
version: 14
workspace: roboflow-58fyf
test: ../test/images
train: Rock-Paper-Scissors-SXSW-14/train/images
val: Rock-Paper-Scissors-SXSW-14/valid/images
๐ Installation
Install CvPal with a simple pip command:
pip install cvpal==1.0.2
๐ง Example Usage
from cvpal.generate import DetectionDataset
from cvpal.preprocessing import ImagesDetection
# Generate synthetic data
detection_dataset = DetectionDataset(model="stable-diffusion")
detection_dataset.generate("a cat looking at the camera", num_images=2, labels=["cat"], output_type="yolo", overwrite=False)
# Preprocess existing dataset
cp = ImagesDetection()
cp.read_data("/content/Rock-Paper-Scissors-SXSW-14", data_type="txt")
cp.report()
๐ Supported Models and Formats
Model Name | Supported Format | Support in Package | Detection | Segmentation |
---|---|---|---|---|
YOLOv5-v11 | TXT & YAML config | โ | โ | โ |
YOLOv3-v4 | Darknet TXT | โ | โ | โ |
EfficientDet | Pytorch JSON annotations | โ | โ | โ |
Detectron 2 | JSON annotations | โ | โ | โ |
Segmentation Models | XML format | โ | โ | โ |
TensorFlow Object Detection | Binary format | โ | โ | โ |
Fine-tune PaliGemma | JSONL annotations | โ | โ | โ |
Apple's CreateML | Proprietary JSON format | โ | โ | โ |
Turi Create tools | Proprietary JSON format | โ | โ | โ |
๐ค Call for Contributions
Join the CvPal community and make a global impact! We welcome contributions of all sizes:
- ๐ Bug fixes and enhancements
- ๐ Documentation improvements
- ๐จ UI/UX enhancements
- ๐งช New feature development
To contribute major changes, please reach out through our mailing list first.
Other ways to contribute:
- ๐ Help triage issues
- ๐ Create tutorials and presentations
- ๐ต๏ธ Review pull requests
๐ License
This project is licensed under the MIT License - see the LICENSE file for details.
Empowering Computer Vision Worldwide
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.