converter tool for visionai format

Project description

visionai-data-format

VisionAI format is Dataverse["url"] standardized annotation format to label objects and sequences in the context of Autonomous Driving System(ADS). VisionAI provides consistent and effective driving environment description and categorization in the real-world case.

This tool provides validator of VisionAI format schema. Currently, the library supports:

Validate created VisionAI data format
Validate VisionAI data attributes with given Ontology information.

Package (PyPi) | Source code

Getting started

(WIP)

Install the package

pip install visionai-data-format

Prerequisites: You must have Python 3.7 and above to use this package.

Example

The following sections provide examples for the following:

Validate VisionAI schema
Validate VisionAI data with given Ontology

Validate VisionAI schema

To validate VisionAI data structure, could follow the example below:

from visionai_data_format.schemas.visionai_schema import VisionAIModel

# your custom visionai data
custom_visionai_data = {
    "visionai": {
        "frame_intervals": [
            {
                "frame_start": 0,
                "frame_end": 0
            }
        ],
        "frames": {
            "000000000000": {
                "objects": {
                    "893ac389-7782-4bc3-8f61-09a8e48c819f": {
                        "object_data": {
                            "bbox": [
                                {
                                    "name": "bbox_shape",
                                    "stream":"camera1",
                                    "val": [761.565,225.46,98.33000000000004, 164.92000000000002]
                                }
                            ],
                            "cuboid": [
                                {
                                    "name": "cuboid_shape",
                                    "stream": "lidar1",
                                    "val": [
                                        8.727633224700037,-1.8557590122690717,-0.6544039394148177, 0.0,
                                        0.0,-1.5807963267948966,1.2,0.48,1.89
                                    ]
                                }
                            ]
                        }
                    }
                },
                "frame_properties": {
                    "streams": {
                        "camera1": {
                            "uri": "https://helenmlopsstorageqatest.blob.core.windows.net/vainewformat/kitti/kitti_small/data/000000000000/data/camera1/000000000000.png"
                        },
                        "lidar1": {
                            "uri": "https://helenmlopsstorageqatest.blob.core.windows.net/vainewformat/kitti/kitti_small/data/000000000000/data/lidar1/000000000000.pcd"
                        }
                    }
                }
            }
        },
        "objects": {
            "893ac389-7782-4bc3-8f61-09a8e48c819f": {
                "frame_intervals": [
                    {
                        "frame_start": 0,
                        "frame_end": 0
                    }
                ],
                "name": "pedestrian",
                "object_data_pointers": {
                    "bbox_shape": {
                        "frame_intervals": [
                            {
                                "frame_start": 0,
                                "frame_end": 0
                            }
                        ],
                        "type": "bbox"
                    },
                    "cuboid_shape": {
                        "frame_intervals": [
                            {
                                "frame_start": 0,
                                "frame_end": 0
                            }
                        ],
                        "type": "cuboid"
                    }
                },
                "type": "pedestrian"
            }
        },
        "coordinate_systems": {
            "lidar1": {
                "type": "sensor_cs",
                "parent": "",
                "children": [
                    "camera1"
                ]
            },
            "camera1": {
                "type": "sensor_cs",
                "parent": "lidar1",
                "children": [],
                "pose_wrt_parent": {
                    "matrix4x4": [
                        -0.00159609942076306,
                        -0.005270645688933059,
                        0.999984790046273,
                        0.3321936949138632,
                        -0.9999162467477257,
                        0.012848695454066989,
                        -0.0015282672486530082,
                        -0.022106263278130818,
                        -0.012840436309973332,
                        -0.9999035522454274,
                        -0.0052907123281999745,
                        -0.06171977032225582,
                        0.0,
                        0.0,
                        0.0,
                        1.0
                    ]
                }
            }
        },
        "streams": {
            "camera1": {
                "type": "camera",
                "uri": "https://helenmlopsstorageqatest.blob.core.windows.net/vainewformat/kitti/kitti_small/data/000000000000/data/camera1/000000000000.png",
                "description": "Frontal camera",
                "stream_properties": {
                    "intrinsics_pinhole": {
                        "camera_matrix_3x4": [
                            -1.1285209781809271,
                            -706.9900823216068,
                            -181.46849639413674,
                            0.2499212908887926,
                            -3.726606344908137,
                            9.084661126711246,
                            -1.8645282480709864,
                            -0.31027342289053916,
                            707.0385458128643,
                            -1.0805602883730354,
                            603.7910589125847,
                            45.42556655376811
                        ],
                        "height_px": 370,
                        "width_px": 1224
                    }
                }
            },
            "lidar1": {
                "type": "lidar",
                "uri": "https://helenmlopsstorageqatest.blob.core.windows.net/vainewformat/kitti/kitti_small/data/000000000000/data/lidar1/000000000000.pcd",
                "description": "Central lidar"
            }
        },
        "metadata": {
            "schema_version": "1.0.0"
        }
    }
}

# validate custom data
# If the data structure doesn't meets the VisionAI requirements, it would raise BaseModel error message
# otherwise, it will returns dictionary of validated visionai data
validated_visionai = VisionAIModel(**custom_visionai_data).dict()

First, we declare our custom VisionAI data, then call VisionAI(**custom_visionai_data).dict() to validate our custom data visionai schema. It will raise error if any of required fields is missing or the value type doesn't meet with defined data type ( BaseModel error message). Otherwise, it will return dictionary of validated VisionAI data

Validate VisionAI data with given Ontology

Before upload dataset into Dataverse platform, we could try to validate a VisionAI annotation with Ontology schema. Ontology schema works as a predefined Project Ontology data in Dataverse.

Ontology contains contexts, objects, streams, and tags four main elements similar to VisioniAI schema. The difference is that Ontology is the union of all categories and attributes that will be compared with a VisionAI data.

contexts need to be filled if only the project ontology is classification type.
objects need to be filled for other project ontologies instead of classification, such as bounding_box or semantic_segmentation, etc.
streams required to be filled, since it is the project sensor related information.
tags need to be filled in case of semantic_segmentation project ontology.

Following is the example of Ontology Schema and how to validate VisionAI data with it:

from visionai_data_format.schemas.ontology import Ontology

custom_ontology = {
    "objects": {
        "pedestrian": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                },
                "activity": {
                    "type": "text",
                    "value": []
                }
            }
        },
        "truck": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                },
                "color": {
                    "type": "text",
                    "value": []
                },
                "new": {
                    "type": "boolean",
                    "value": []
                },
                "year": {
                    "type": "num",
                    "value": []
                },
                "status": {
                    "type": "vec",
                    "value": [
                        "stop",
                        "run",
                        "small",
                        "large"
                    ]
                }
            }
        },
        "car": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                },
                "color": {
                    "type": "text",
                    "value": []
                },
                "new": {
                    "type": "boolean",
                    "value": []
                },
                "year": {
                    "type": "num",
                    "value": []
                },
                "status": {
                    "type": "vec",
                    "value": [
                        "stop",
                        "run",
                        "small",
                        "large"
                    ]
                }
            }
        },
        "cyclist": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                }
            }
        },
        "dontcare": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                }
            }
        },
        "misc": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                },
                "color": {
                    "type": "text",
                    "value": []
                },
                "info": {
                    "type": "vec",
                    "value": [
                        "toyota",
                        "new"
                    ]
                }
            }
        },
        "van": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                }
            }
        },
        "tram": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                }
            }
        },
        "person_sitting": {
            "attributes": {
                "bbox_shape": {
                    "type": "bbox",
                    "value": None
                },
                "cuboid_shape": {
                    "type": "cuboid",
                    "value": None
                }
            }
        }
    },
    "contexts":{
        "*tagging": {
            "attributes":{
                "profession": {
                    "type": "text",
                    "value": []
                },
                "roadname": {
                    "type": "text",
                    "value": []
                },
                "name": {
                    "type": "text",
                    "value": []
                },
                "unknown_object": {
                    "type": "vec",
                    "value": [
                        "sky",
                        "leaves",
                        "wheel_vehicle",
                        "fire",
                        "water"
                    ]
                },
                "static_status": {
                    "type": "boolean",
                    "value": [
                        "true",
                        "false"
                    ]
                },
                "year": {
                    "type": "num",
                    "value": []
                },
                "weather": {
                    "type": "text",
                    "value": []
                }
            }
        }
    },
    "streams": {
        "camera1": {
            "type": "camera"
        },
        "lidar1": {
            "type": "lidar"
        }
    },
    "tags": None
}

# Validate your custom ontology
validated_ontology = Ontology(**custom_ontology).dict()

# Validate VisionAI data with our ontology, custom_visionai_data is the custom data from upper example
errors = VisionAIModel(**custom_visionai_data).validate_with_ontology(ontology=validated_ontology)

# Shows the errors
# If there is any error occurred, it will returns list of exception messages
# Otherwise, it will return empty list
# example of errors :
# >[visionai_data_format.exceptions.visionai.VisionAIException("frame stream sensors {'lidar2'} doesn't match with visionai streams sensor {'camera1', 'lidar1'}.")]
print(errors)

First, create a new Ontology that contains the project ontology. Then, call validate_with_ontology(ontology=validated_ontology) to validate whether current VisionAI data meets the Ontology data information. It will returns list of error messages if any error occured, otherwise it returns empty list.

Tools

Convert `BDD+` format data to `VisionAI` format

(Only support box2D and camera sensor data only for now)

 python visionai_data_format/convert_dataset.py -input_format bddp -output_format vision_ai -image_annotation_type 2d_bounding_box -input_annotation_path ./bdd_test.json -source_data_root ./data_root -output_dest_folder ~/visionai_output_dir -uri_root http://storage_test -n_frame 5 -sequence_idx_start 0 -camera_sensor_name camera1 -annotation_name groundtruth -img_extension .jpg --copy_sensor_data

Arguments :

-input_format : input format (use bddp for BDD+)
-output_format : output format (vision_ai)
-image_annotation_type : label annotation type for image (2d_bounding_box for box2D)
-input_annotation_path : source annotation path (BDD+ format json file)
-source_data_root : source data root for sensor data and calibration data (will find and copy image from this root)
-output_dest_folder : output root folder (VisionAI local root folder)
-uri_root : uri root for target upload VAI storage i.e: https://azuresorate/vai_dataset
-n_frame : number of frame to be converted (-1 means all), by default -1
-sequence_idx_start : sequence start id, by default 0
-camera_sensor_name : camera sensor name (default: "", specified it if need to convert camera data)
-lidar_sensor_name : lidar sensor name (default: "", specified it if need to convert lidar data)
-annotation_name : annotation folder name (default: "groundtruth")
-img_extension :image file extention (default: ".jpg")
--copy_sensor_data :enable to copy image/lidar data

Convert `VisionAI` format data to `BDD+` format

(Only support box2D for now)

The script below could help convert VisionAI annotation data to BDD+ json file

python visionai_data_format/vai_to_bdd.py -vai_src_folder /path_for_visionai_root_folder -bdd_dest_file /dest_path/bdd.json -company_code 99 -storage_name storge1 -container_name dataset1 -annotation_name groundtruth

Arguments :

-vai_src_folder : VAI root folder contains VAI format json file
-bdd_dest_file : BDD+ format file save destination
-company_code : company code
-storage_name : storage name
-container_name : container name (dataset name)
-annotation_name : annotation folder name (default: "groundtruth")

Convert `Kitti` format data to `VisionAI` format

(Only support KITTI with one camera and one lidar sensor)

Important:

image type is not restricted, could be ".jpg" or ".png", but we will convert it into ".jpg" in VisionAI format
only support for P2 projection matrix calibration information

Currently,only support KITTI dataset with structure folder :

.kitti_folder
├── calib
│   ├── 000000.txt
│   ├── 000001.txt
│   ├── 000002.txt
│   ├── 000003.txt
│   └── 000004.txt
├── data
│   ├── 000000.png
│   ├── 000001.png
│   ├── 000002.png
│   ├── 000003.png
│   └── 000004.png
├── labels
│   ├── 000000.txt
│   ├── 000001.txt
│   ├── 000002.txt
│   ├── 000003.txt
│   └── 000004.txt
└── pcd
    ├── 000000.pcd
    ├── 000001.pcd
    ├── 000002.pcd
    ├── 000003.pcd
    └── 000004.pcd

Command :

 python visionai_data_format/convert_dataset.py -input_format kitti -output_format vision_ai -image_annotation_type 2d_bounding_box -source_data_root ./data_root -output_dest_folder ~/visionai_output_dir -uri_root http://storage_test -n_frame 5 -sequence_idx_start 0 -camera_sensor_name camera1 -lidar_sensor_name lidar1 -annotation_name groundtruth -img_extension .jpg --copy_sensor_data

Arguments :

-input_format : input format (use kitti for KITTI)
-output_format : output format (vision_ai)
-image_annotation_type : label annotation type for image (2d_bounding_box for box2D)
-source_data_root : source data root for sensor data and calibration data (will find and copy image from this root)
-output_dest_folder : output root folder (VisionAI local root folder)
-uri_root : uri root for target upload VAI storage i.e: https://azuresorate/vai_dataset
-n_frame : number of frame to be converted (-1 means all), by default -1
-sequence_idx_start : sequence start id, by default 0
-camera_sensor_name : camera sensor name (default: "", specified it if need to convert camera data)
-lidar_sensor_name : lidar sensor name (default: "", specified it if need to convert lidar data)
-annotation_name : annotation folder name (default: "groundtruth")
-img_extension :image file extention (default: ".jpg")
--copy_sensor_data :enable to copy image/lidar data

Convert `COCO` format data to `VisionAI` format

 python visionai_data_format/convert_dataset.py -input_format coco -output_format vision_ai -image_annotation_type 2d_bounding_box -input_annotation_path ./coco_instance.json -source_data_root ./coco_images/ -output_dest_folder ~/visionai_output_dir -uri_root http://storage_test -n_frame 5 -sequence_idx_start 0 -camera_sensor_name camera1 -annotation_name groundtruth -img_extension .jpg --copy_sensor_data

Arguments :

-input_format : input format (use coco for COCO format)
-output_format : output format (vision_ai)
-image_annotation_type : label annotation type for image (2d_bounding_box for box2D)
-source_data_root : image data folder
-output_dest_folder : output root folder (VisionAI local root folder)
-uri_root : uri root for target upload VAI storage i.e: https://azuresorate/vai_dataset
-n_frame : number of frame to be converted (-1 means all), by default -1
-sequence_idx_start : sequence start id, by default 0
-camera_sensor_name : camera sensor name (default: "", specified it if need to convert camera data)
-annotation_name : annotation folder name (default: "groundtruth")
-img_extension :image file extention (default: ".jpg")
--copy_sensor_data :enable to copy image/lidar data

Troubleshooting

(WIP)

Next steps

(WIP)

Contributing

(WIP)

Links to language repos

(WIP)

Python Readme

Project details

Release history Release notifications | RSS feed

1.4.0

Sep 27, 2024

1.3.6

Sep 5, 2024

1.3.5

May 24, 2024

1.3.4

May 21, 2024

1.3.3

May 20, 2024

1.3.2

Apr 16, 2024

1.3.1

Mar 19, 2024

1.3.0

Feb 6, 2024

1.2.0

Dec 21, 2023

1.1.5

Dec 5, 2023

1.1.4

Dec 1, 2023

This version

1.1.3

Nov 28, 2023

1.1.3a0 pre-release

Nov 28, 2023

1.1.2

Nov 15, 2023

1.1.1

Nov 7, 2023

1.1.0

Nov 6, 2023

1.0.12

Oct 25, 2023

1.0.10

Aug 22, 2023

1.0.10a0 pre-release

Aug 22, 2023

1.0.9

Aug 16, 2023

1.0.9a0 pre-release

Aug 15, 2023

1.0.8

Aug 14, 2023

1.0.7

Aug 1, 2023

1.0.7a0 pre-release

Jul 25, 2023

1.0.6

Jul 20, 2023

1.0.6a0 pre-release

Jul 25, 2023

1.0.5

Jul 17, 2023

1.0.5a1 pre-release

Jul 14, 2023

1.0.4

Jul 10, 2023

1.0.3

May 19, 2023

1.0.2

May 18, 2023

1.0.1

May 11, 2023

1.0.0

May 2, 2023

0.1.9

Mar 27, 2023

0.1.8

Mar 15, 2023

0.1.7

Mar 13, 2023

0.1.6

Mar 10, 2023

0.1.5

Feb 4, 2023

0.1.4

Feb 4, 2023

0.1.3

Jan 30, 2023

0.1.3b0 pre-release

Dec 30, 2022

0.1.3a1 pre-release

Dec 28, 2022

0.1.3a0 pre-release

Dec 26, 2022

0.1.2

Dec 22, 2022

0.1.1

Dec 21, 2022

0.1.0

Nov 17, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visionai-data-format-1.1.3.tar.gz (46.2 kB view hashes)

Uploaded Nov 28, 2023 Source

Built Distribution

visionai_data_format-1.1.3-py3-none-any.whl (50.6 kB view hashes)

Uploaded Nov 28, 2023 Python 3

Hashes for visionai-data-format-1.1.3.tar.gz

Hashes for visionai-data-format-1.1.3.tar.gz
Algorithm	Hash digest
SHA256	`3e16eac6fcb776e2e7b76c6468a61df6ecd61882cfd3bb9d76ef8efc516ad6c8`
MD5	`8c147e61e9ccdf824da1d0fb09b74d8f`
BLAKE2b-256	`3ec9fbf3c25dd920d94b89b5166facef21d5580b278af128930bcf2c918f061f`

Hashes for visionai_data_format-1.1.3-py3-none-any.whl

Hashes for visionai_data_format-1.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e4cc734e69181ca317111c621b22c4a47d3ccd98c002bcf65ed9903b55e3f837`
MD5	`d09eda3433aaf0a46c12bf2664f41c5f`
BLAKE2b-256	`c7cd0b26660bb4a764f670083f16793fc93b00e2d58543216746ad4979a68bfc`

visionai-data-format 1.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

visionai-data-format

Getting started

Install the package

Example

Validate VisionAI schema

Validate VisionAI data with given Ontology

Tools

Convert `BDD+` format data to `VisionAI` format

(Only support box2D and camera sensor data only for now)

Convert `VisionAI` format data to `BDD+` format

(Only support box2D for now)

Convert `Kitti` format data to `VisionAI` format

(Only support KITTI with one camera and one lidar sensor)

Convert `COCO` format data to `VisionAI` format

Troubleshooting

Next steps

Contributing

Links to language repos

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

visionai-data-format 1.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

visionai-data-format

Getting started

Install the package

Example

Validate VisionAI schema

Validate VisionAI data with given Ontology

Tools

Convert BDD+ format data to VisionAI format

(Only support box2D and camera sensor data only for now)

Convert VisionAI format data to BDD+ format

(Only support box2D for now)

Convert Kitti format data to VisionAI format

(Only support KITTI with one camera and one lidar sensor)

Convert COCO format data to VisionAI format

Troubleshooting

Next steps

Contributing

Links to language repos

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

Convert `BDD+` format data to `VisionAI` format

Convert `VisionAI` format data to `BDD+` format

Convert `Kitti` format data to `VisionAI` format

Convert `COCO` format data to `VisionAI` format