Helper for dealing with MS-COCO annotations

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

COCO-Assistant

CircleCI

Helper for dealing with MS-COCO annotations.

Overview

The MS COCO annotation format along with the pycocotools library is quite popular among the computer vision community. Yet I for one found it difficult to play around with the annotations. Deleting a specific category, combining multiple mini datasets to generate a larger dataset, viewing distribution of classes in the annotation file are things I would like to do without writing a separate script for each. The COCO Assistant is designed (or being designed) to assist with this problem. Please note that currently, the Assistant can only help out with object detection datasets. Any contributions and/or suggestions are welcome.

Requirements

Your data directory should look as follows:

Example:
.
├── images
│   ├── train
│   ├── val
|   ├── test
|   
├── annotations
│   ├── train.json
│   ├── val.json
│   ├── test.json

Installation

1. Installation: pip

pip install coco-assistant

2. Installation: From Source

# Clone the repository
git clone https://github.com/ashnair1/COCO-Assistant.git
# Build and install the library
make

Usage

Usage is similar to how you would use pycocotools

from coco_assistant import COCO_Assistant

# Specify image and annotation directories
img_dir = os.path.join(os.getcwd(), 'images')
ann_dir = os.path.join(os.getcwd(), 'annotations')

# Create COCO_Assistant object
cas = COCO_Assistant(img_dir, ann_dir)

Package features

1. Merge datasets

The combine function allows you to merge multiple datasets.

In[1]: cas = COCO_Assistant(img_dir, ann_dir)                                                                                                                                                              
loading annotations into memory...
Done (t=0.09s)
creating index...
index created!
loading annotations into memory...
Done (t=0.06s)
creating index...
index created!

In[2]: cas.combine()                                                                                                                                                                                       
Merging image dirs
100%|█████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00, 18.33it/s]
Merging annotations
100%|█████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00, 14.72it/s]

The merged dataset (images and annotation) can be found in ./results/combination

2. Remove categories

Removes a specific category from an annotation file.

In[1]: cas = COCO_Assistant(img_dir, ann_dir)                                                                                                                                                              
loading annotations into memory...
Done (t=0.09s)
creating index...
index created!
loading annotations into memory...
Done (t=0.06s)
creating index...
index created!

# In interactive mode
In[2]: cas.remove_cat(interactive=True)
['tiny.json', 'tiny2.json']
Who needs a cat removal?
tiny.json

Categories present:
['building', 'vehicles']

Enter categories you wish to remove as a list:
['building']
Removing specified categories...

# In non-interactive mode
In[3]: cas.remove_cat(interactive=False, jc="tiny.json", rcats=['building'])
Removing specified categories...

The modified annotation can be found in ./results/removal

3. Generate annotation statistics

Generate countplot of instances per category that occur in the annotation files. cas.ann_stats(stat="area",arearng=[10,144,512,1e5],save=False)
Generate pie-chart that shows distribution of objects according to their size (as specified in areaRng). cas.ann_stats(stat="cat", show_count=False, save=False)

4. Visualise annotations

Couldn't pycocotools visualise annotations (via showAnns) as well? Sure it could, but I required a way to freely view all the annotations of a particular dataset so here we are.

In[1]: cas.visualise()
Choose directory:
['tiny', 'tiny2']
tiny

5. Generate segmentation masks

The cas.get_segmasks() function allows you to create segmentation masks from your MS COCO object detection datasets. Similar to the Pascal VOC dataset, the mask values are their classes and a colour palette is applied to enable visualisation. The generated masks are stroed in the ./results folder. Samples are shown below.

	Detection	Segmentation
SpaceNet
iSAID

Todo

Converter for converting COCO annotations to YOLO format.
Write tests for untested functions :)

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.4.0

Aug 23, 2021

0.3.5

May 6, 2021

0.3.4

Mar 14, 2021

0.3.1

May 18, 2020

0.3.0

Apr 19, 2020

This version

0.2.0

Nov 28, 2019

0.1.0

Oct 8, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coco_assistant-0.2.0.tar.gz (5.7 MB view hashes)

Uploaded Nov 28, 2019 Source

Built Distribution

coco_assistant-0.2.0-py3-none-any.whl (47.3 kB view hashes)

Uploaded Nov 28, 2019 Python 3

Hashes for coco_assistant-0.2.0.tar.gz

Hashes for coco_assistant-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`396f7b76c022e4118f3ca63fe7b35dfc7e8fb15e4a082ebe423422e49e38613d`
MD5	`05e1904eacff35bcf125d963e338a275`
BLAKE2b-256	`6e1ed568c2f911bf184f50f34371ffbfbf9aff8ee10c2302532646c1e5931b6e`

Hashes for coco_assistant-0.2.0-py3-none-any.whl

Hashes for coco_assistant-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0df10d36d6fce9b1ea11feba2b10123ee1b4fd1ab4c14fe78947ad31b7784e25`
MD5	`0425cacad8cb0c4e2ae778a92c5c45df`
BLAKE2b-256	`acfaaf9cf203167f8396f28942e568453f78b27dd0f2f8ff6075f8c73c275a36`