Skip to main content

Convert Open Images Dataset v6 to PASCAL VOC format.

Project description


Convert bounding box datasets of Open Images Dataset v6 to VOC XML format.


pip3 install oidv6-to-voc


Once installed, you should be able to run it directly:

oidv6-to-voc -h

If your shell cannot find the command, try running it with:

python3 -m oidv6_to_voc -h

CLI options

To start converting, you need at least a part of the images, the class names metadata and at least one of the boxes annotation CSV file:

CSV files you need

oidv6-to-voc <annotation-file(s).csv>
             -d <class-names-file.csv> 
             --imgd <directory/to/your/images>
             --outd <your/output/diretory>

About the Dataset

The Open Images V6 Dataset contains 600 classes with 1900000+ images. The images are hosted on AWS, and the CSV files can be downloaded here.

To download it in full, you'll need 500+ GB of disk space. For downloading a part of the dataset only, I would recommend the DmitryRyumin/OIDv6 tool.


This repo is forked from AtriSaxena/OIDv4_to_VOC.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for oidv6-to-voc, version 0.1.5
Filename, size File type Python version Upload date Hashes
Filename, size oidv6_to_voc-0.1.5-py3-none-any.whl (5.1 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size oidv6-to-voc-0.1.5.tar.gz (3.8 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page