simple-cocotools

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

fodom

These details have not been verified by PyPI

Project description

simple-cocotools

A simple, modern alternative to pycocotools.

About

Why not just use Pycocotools?

Code is more readable and hackable.
Metrics are more transparent and understandable.
Evaluation is fast.
Only dependencies are numpy and scipy. No cython extensions.
Code is more modern (type annotations, linting, etc).

Install

From PyPI

pip install simple-cocotools

From Repo

pip install "simple-cocotools @ git+ssh://git@github.com/fkodom/simple-cocotools.git"

For Contributors

# Clone this repository
gh repo clone fkodom/simple-cocotools
cd simple-cocotools
# Install all dev dependencies (tests etc.)
pip install -e .[all]
# Setup pre-commit hooks
pre-commit install

Usage

Expects target annotations to have the same format as model predictions. (The format used by all torchvision detection models.) You may already have code to convert annotations into this format, since it's required to train many detection models. If not, use 'AnnotationsToDetectionFormat' from this repo as an example for how to do that.

A minimal example:

from torchvision.detection.models import maskrcnn_resnet50_fpn
from simple_cocotools import CocoEvaluator

evaluator = CocoEvaluator()
model = maskrcnn_resnet50_fpn(pretrained=True).eval()

for images, targets in data_loader:
    predictions = model(images)
    evaluator.update(predictions, targets)

metrics = evaluator.summarize()

metrics will be a dictionary with format:

{
    "box": {
        "mAP": 0.40,
        "mAR": 0.41,
        "class_AP": {
            "cat": 0.39,
            "dog": 0.42,
            ...
        },
        "class_AR": {
            # Same as 'class_AP' above.
        }
    }
    "mask": {
        # Same as 'box' above.
    }
}

For a more complete example, see scripts/mask_rcnn_example.py.

Benchmarks

I benchmarked against several torchvision detection models, which have mAP scores reported on the PyTorch website.

Using a default score threshold of 0.5:

Model	Backbone	box mAP (official)	box mAP	box mAR	mask mAP (official)	mask mAP	mask mAR
Mask R-CNN	ResNet50	37.9	36.9	43.2	34.6	34.1	40.0
Faster R-CNN	ResNet50	37.0	36.3	42.0	-	-	-
Faster R-CNN	MobileNetV3-Large	32.8	39.9	35.0	-	-	-

Notice that the mAP for MobileNetV3-Large is artificially high, since it has a much lower mAR at that score threshold. After tuning the score threshold, so that mAP and mAR are more balanced:

Model	Backbone	Threshold	box mAP	box mAR	mask mAP	mask mAR
Mask R-CNN	ResNet50	0.6	41.1	41.3	38.2	38.5
Faster R-CNN	ResNet50	0.6	40.8	40.4	-	-
Faster R-CNN	MobileNetV3-Large	0.425	36.2	36.2	-	-

These scores are more reflective of model performance, in my opinion. Mask R-CNN slightly outperforms Faster R-CNN, and there is a noticeable (but not horrible) gap between ResNet50 and MobileNetV3 backbones. PyTorch docs don't mention what score thresholds were used for each model benchmark. ¯\(ツ)/¯

Ignoring the time spent getting predictions from the model, evaluation is very fast.

Bbox: ~400 samples/second
Bbox + mask: ~100 samples/second
Using a Google Cloud n1-standard-4 VM (4 vCPUs, 16 GB RAM).

Note: Speeds are dependent on the number of detections per image, and therefore dependent on the model and score threshold.

Keypoints Usage

Keypoint mAP and mAR normally use pre-computed "sigmas" to determin the "correctness" of each keypoint prediction. Unfortunately, those sigmas are tailored specifically for human pose (as in the COCO dataset), and not applicable to other keypoint datasets.

NOTE: Sigmas are actually computed using the predictions of a specific model trained on COCO. To make this applicable to other datasets, you would need to train a model on that dataset, and then use the sigmas from that model. The logic is somewhat circular -- you need to train a model to get the sigmas, but you need the sigmas to compute mAP / mAR.

There's no way around this, unless a large body of pretrained models are already available for the dataset you're using. For most real-world problems, that is not the case. So, the open-source mAP / mAR keypoints metrics are not generally extensible to other datasets.

simple-cocotools does not use sigmas, and instead computes the average distance between each keypoint prediction and ground truth. This is a much simpler approach, and is more applicable to other datasets. It's roughly how the sigmas for COCO were originally computed. The downside is that it's not directly comparable to the official COCO keypoints mAP / mAR.

Some keypoints are more ambiguous than others. For example, "left hip" is much more ambiguous than "left eye" -- the exact location of "left eye" should be obvious, while "left hip" is hidden by the torso and clothing. The average distance for "left hip" will be much larger than "left eye", even if the predictions are correct. (This is how sigmas were used in the official COCO keypoints mAP / mAR.) For that reason, keypoint distances should be interpreted with some knowledge about the specific dataset at hand.

metrics will be a dictionary with format:

{
    "box": {
        "mAP": 0.40,
        "mAR": 0.41,
        "class_AP": {
            "cat": 0.39,
            "dog": 0.42,
            ...
        },
        "class_AR": {
            # Same as 'class_AP' above.
        }
    }
    "keypoints": {
        "distance": 0.10,
        "class_distance": {
            "cat": {
                "distance": 0.11,
                "keypoint_distance": {
                    "left_eye": 0.12,
                    "right_eye": 0.13,
                    ...
                }
            },
            ...
        }
    }
}

How It Works

TODO: Blog post on how simple-cocotools works.

Match the predictions/labels together, maximizing the IoU between pairs with the same object class. SciPy's linear_sum_assignment method does most of the heavy lifting here.
For each IoU threshold, determine the number of "correct" predictions from the assignments above. Pairs with IoU < threshold are incorrect.
For each image, count the number of total predictions, correct predictions, and ground truth labels for each object class and IoU threshold.
Compute AP/AR for each class from the prediction counts above. Then compute mAP and mAR by averaging over all object classes.

Project details

These details have been verified by PyPI

Project links

Homepage

GitHub Statistics

Maintainers

fodom

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.0.2

Aug 4, 2025

1.0.1

Jul 31, 2025

1.0.0

Jun 27, 2025

0.2.1

Mar 3, 2023

0.2.0

Mar 1, 2023

0.1.2

Feb 3, 2023

0.1.1

Nov 23, 2022

0.1.0

Oct 27, 2022

0.1.0rc1 pre-release

Oct 27, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simple_cocotools-1.0.2.tar.gz (19.5 kB view details)

Uploaded Aug 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

simple_cocotools-1.0.2-py3-none-any.whl (16.1 kB view details)

Uploaded Aug 4, 2025 Python 3

File details

Details for the file simple_cocotools-1.0.2.tar.gz.

File metadata

Download URL: simple_cocotools-1.0.2.tar.gz
Upload date: Aug 4, 2025
Size: 19.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.8.4

File hashes

Hashes for simple_cocotools-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`a504d36be75c1ac0c5e479a897ffe8a87626106070a910cfb1007a7c7209da59`
MD5	`3ab8eec7fa8570cde25bd1a9e437adaf`
BLAKE2b-256	`5e5b061a89d0765dc96ad26dd0469f9bb0b3df6e899aed56af0e0f61209a436a`

See more details on using hashes here.

File details

Details for the file simple_cocotools-1.0.2-py3-none-any.whl.

File metadata

Download URL: simple_cocotools-1.0.2-py3-none-any.whl
Upload date: Aug 4, 2025
Size: 16.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.8.4

File hashes

Hashes for simple_cocotools-1.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7b313ab8bd583fef2c1c3b71ce1f6f2164256d2e8ac696d6ac342fc01cd64430`
MD5	`a63f1bfb6b22fc720233bba37735b10b`
BLAKE2b-256	`0b8a284c09973dcff383dd111af544423121b4078bfef00791203e590d6a5f71`

See more details on using hashes here.

simple-cocotools 1.0.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

simple-cocotools

About

Install

From PyPI

From Repo

For Contributors

Usage

Benchmarks

Keypoints Usage

How It Works

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes