OpenVINO™ Training Extensions: Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™
Project description
Introduction
OpenVINO™ Training Extensions is a low-code transfer learning framework for Computer Vision. The CLI commands of the framework allows users to train, infer, optimize and deploy models easily and quickly even with low expertise in the deep learning field. OpenVINO™ Training Extensions offers diverse combinations of model architectures, learning methods, and task types based on PyTorch and OpenVINO™ toolkit.
OpenVINO™ Training Extensions provides a "model template" for every supported task type, which consolidates necessary information to build a model. Model templates are validated on various datasets and serve one-stop shop for obtaining the best models in general. If you are an experienced user, you can configure your own model based on torchvision, pytorchcv, mmcv and OpenVINO Model Zoo (OMZ).
Furthermore, OpenVINO™ Training Extensions provides automatic configuration of task types and hyperparameters. The framework will identify the most suitable model template based on your dataset, and choose the best hyperparameter configuration. The development team is continuously extending functionalities to make training as simple as possible so that single CLI command can obtain accurate, efficient and robust models ready to be integrated into your project.
Key Features
OpenVINO™ Training Extensions supports the following computer vision tasks:
- Classification, including multi-class, multi-label and hierarchical image classification tasks.
- Object detection including rotated bounding box support
- Semantic segmentation
- Instance segmentation including tiling algorithm support
- Action recognition including action classification and detection
- Anomaly recognition tasks including anomaly classification, detection and segmentation
OpenVINO™ Training Extensions supports the following learning methods:
- Supervised, incremental training, which includes class incremental scenario and contrastive learning for classification and semantic segmentation tasks
- Semi-supervised learning
- Self-supervised learning
OpenVINO™ Training Extensions will provide the following features in coming releases:
- Distributed training to accelerate the training process when you have multiple GPUs
- Half-precision training to save GPUs memory and use larger batch sizes
- Integrated, efficient hyper-parameter optimization module (HPO). Through dataset proxy and built-in hyper-parameter optimizer, you can get much faster hyper-parameter optimization compared to other off-the-shelf tools. The hyperparameter optimization is dynamically scheduled based on your resource budget.
- OpenVINO™ Training Extensions uses Datumaro as the backend to hadle datasets. Thanks to that, OpenVINO™ Training Extensions supports the most common academic field dataset formats for each task. We constantly working to extend supported formats to give more freedom of datasets format choice.
- Auto-configuration functionality. OpenVINO™ Training Extensions analyzes provided dataset and selects the proper task and model template to provide the best accuracy/speed trade-off. It will also make a random auto-split of your dataset if there is no validation set provided.
Getting Started
Installation
Please refer to the installation guide.
Note: Python 3.8 and 3.9 were tested, along with Ubuntu 18.04 and 20.04.
OpenVINO™ Training Extensions CLI Commands
otx find
helps you quickly find the best pre-configured models templates as well as a list of supported backbonesotx build
creates the workspace folder with all necessary components to start training. It can help you configure your own model with any supported backbone and even prepare a custom split for your datasetotx train
actually starts training on your datasetotx eval
runs evaluation of your trained model in PyTorch or OpenVINO™ IR formatotx optimize
runs an optimization algorithm to quantize and prune your deep learning model with help of NNCF and POT tools.otx export
starts exporting your model to the OpenVINO™ IR formatotx deploy
outputs the exported model together with the self-contained python package, a demo application to port and infer it outside of this repository.otx demo
allows one to apply a trained model on the custom data or the online footage from a web camera and see how it will work in a real-life scenario.otx explain
runs explain algorithm on the provided data and outputs images with the saliency maps to show how your model makes predictions.
You can find more details with examples in the CLI command intro.
Updates
v1.4.0 (3Q23)
- Support encrypted dataset training (https://github.com/openvinotoolkit/training_extensions/pull/2209)
- Add custom max iou assigner to prevent CPU OOM when large annotations are used (https://github.com/openvinotoolkit/training_extensions/pull/2228)
- Auto train type detection for Semi-SL, Self-SL and Incremental: "--train-type" now is optional (https://github.com/openvinotoolkit/training_extensions/pull/2195)
- Add per-class XAI saliency maps for Mask R-CNN model (https://github.com/openvinotoolkit/training_extensions/pull/2227)
- Add new object detector Deformable DETR (https://github.com/openvinotoolkit/training_extensions/pull/2249)
- Add new object detector DINO (https://github.com/openvinotoolkit/training_extensions/pull/2266)
- Add new visual prompting task (https://github.com/openvinotoolkit/training_extensions/pull/2203, https://github.com/openvinotoolkit/training_extensions/pull/2274, https://github.com/openvinotoolkit/training_extensions/pull/2311, https://github.com/openvinotoolkit/training_extensions/pull/2354, https://github.com/openvinotoolkit/training_extensions/pull/2318)
- Add new object detector ResNeXt101-ATSS (https://github.com/openvinotoolkit/training_extensions/pull/2309)
Release History
Please refer to the CHANGELOG.md
Branches
- develop
- Mainly maintained branch for developing new features for the future release
- misc
- Previously developed models can be found on this branch
License
OpenVINO™ Toolkit is licensed under Apache License Version 2.0. By contributing to the project, you agree to the license and copyright terms therein and release your contribution under these terms.
Issues / Discussions
Please use Issues tab for your bug reporting, feature requesting, or any questions.
Known limitations
misc branch contains training, evaluation, and export scripts for models based on TensorFlow and PyTorch. These scripts are not ready for production. They are exploratory and have not been validated.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for otx-1.4.3-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8e56ada6f41659effaa790a042d53084b65c6a5fabcc00ae18da5a61926302b1 |
|
MD5 | 813935dda45f0bf6015272f09b1a6945 |
|
BLAKE2b-256 | 6fd4c003764e3ab14cce7b2253eef0a75feff762911671790bca6f273da311d5 |
Hashes for otx-1.4.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 23a770e4970ce42ff690ebd729b380bcea677a18c2b5f1301ea05806361a6e6f |
|
MD5 | e850b76d4d292497955ce210d47a8cc8 |
|
BLAKE2b-256 | 64d3110208b7eb19ecd1b3adcdfa7e40fde2863f69bcc92a6739114acae29353 |
Hashes for otx-1.4.3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1caefe7b26ef2438d6dd52cb6cd5eb4c14d03f0982d01bbbd9d5508ce1c4c84a |
|
MD5 | baee8e6f4d36fae53dd7aa36a0e3dd9d |
|
BLAKE2b-256 | 811c852071ca939fb88b613d41ed05212600e5d66657317a2d1858564df58ea4 |
Hashes for otx-1.4.3-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d8977593b6c53764e12af5250d2d0f00e9e5fb0c43fd306fade9f26741a58b2f |
|
MD5 | 849c1934d01558d26b5d0d077ac2bdca |
|
BLAKE2b-256 | ef45e47d6988169fb5ed6480b9462723b6517bb560a2893420203c7e6cb9f3d8 |