A vision library for performing sliced inference on large images/small objects

These details have not been verified by PyPI

Project links

Homepage

Project description

SAHI: Slicing Aided Hyper Inference

A lightweight vision library for performing large scale object detection & instance segmentation

Overview

Object detection and instance segmentation are by far the most important fields of applications in Computer Vision. However, detection of small objects and inference on large images are still major issues in practical usage. Here comes the SAHI to help developers overcome these real-world problems.

Getting Started

Blogpost

Check the official SAHI blog post.

Installation

Install sahi using pip:

pip install sahi

On Windows, Shapely needs to be installed via Conda:

conda install -c conda-forge shapely

Install your desired version of pytorch and torchvision:

pip install torch torchvision

Install your desired detection framework (such as mmdet or yolov5):

pip install mmdet mmcv

pip install yolov5

Usage

From Python:

Sliced inference:

result = get_sliced_prediction(
    image,
    detection_model,
    slice_height = 256,
    slice_width = 256,
    overlap_height_ratio = 0.2,
    overlap_width_ratio = 0.2
)

Check YOLOv5 + SAHI demo:

Check MMDetection + SAHI demo:

Slice an image:

from sahi.slicing import slice_image

slice_image_result = slice_image(
    image=image_path,
    output_file_name=output_file_name,
    output_dir=output_dir,
    slice_height=256,
    slice_width=256,
    overlap_height_ratio=0.2,
    overlap_width_ratio=0.2,
)

Slice a coco formatted dataset:

from sahi.slicing import slice_coco

coco_dict, coco_path = slice_coco(
    coco_annotation_file_path=coco_annotation_file_path,
    image_dir=image_dir,
    slice_height=256,
    slice_width=256,
    overlap_height_ratio=0.2,
    overlap_width_ratio=0.2,
)

Refer to slicing notebook for detailed usage.

From CLI:

python scripts/predict.py --source image/file/or/folder --model_path path/to/model --config_path path/to/config

will perform sliced inference on default parameters and export the prediction visuals to runs/predict/exp folder.

You can specify sliced inference parameters as:

python scripts/predict.py --slice_width 256 --slice_height 256 --overlap_height_ratio 0.1 --overlap_width_ratio 0.1 --conf_thresh 0.25 --source image/file/or/folder --model_path path/to/model --config_path path/to/config

Specify postprocess type as --postprocess_type UNIONMERGE or --postprocess_type NMS to be applied over sliced predictions
Specify postprocess match metric as --match_metric IOS for intersection over smaller area or --match_metric IOU for intersection over union
Specify postprocess match threshold as --match_thresh 0.5
Add --class_agnostic argument to ignore category ids of the predictions during postprocess (merging/nms)
If you want to export prediction pickles and cropped predictions add --pickle and --crop arguments. If you want to change crop extension type, set it as --visual_export_format JPG.
If you don't want to export prediction visuals, add --novisual argument.
By default, scripts apply both standard and sliced prediction (multi-stage inference). If you don't want to perform sliced prediction add --no_sliced_pred argument. If you don't want to perform standard prediction add --no_standard_pred argument.
If you want to perform prediction using a COCO annotation file, provide COCO json path as add --coco_file path/to/coco/file and coco image folder as --source path/to/coco/image/folder, predictions will be exported as a coco json file to runs/predict/exp/results.json. Then you can use coco_error_analysis.py script to calculate COCO evaluation results.

Find detailed info on script usage (predict, coco2yolov5, coco_error_analysis) at SCRIPTS.md.

FiftyOne Utilities

Explore COCO dataset via FiftyOne app:

For supported version: pip install fiftyone>=0.11.1

from sahi.utils.fiftyone import launch_fiftyone_app

# launch fiftyone app:
session = launch_fiftyone_app(coco_image_dir, coco_json_path)

# close fiftyone app:
session.close()

Convert predictions to FiftyOne detection:

from sahi import get_sliced_prediction

# perform sliced prediction
result = get_sliced_prediction(
    image,
    detection_model,
    slice_height = 256,
    slice_width = 256,
    overlap_height_ratio = 0.2,
    overlap_width_ratio = 0.2
)

# convert detections into fiftyone detection format
fiftyone_detections = result.to_fiftyone_detections()

COCO Utilities

COCO dataset creation:

import required classes:

from sahi.utils.coco import Coco, CocoCategory, CocoImage, CocoAnnotation

init Coco object:

coco = Coco()

add categories starting from id 0:

coco.add_category(CocoCategory(id=0, name='human'))
coco.add_category(CocoCategory(id=1, name='vehicle'))

create a coco image:

coco_image = CocoImage(file_name="image1.jpg", height=1080, width=1920)

add annotations to coco image:

coco_image.add_annotation(
  CocoAnnotation(
    bbox=[x_min, y_min, width, height],
    category_id=0,
    category_name='human'
  )
)
coco_image.add_annotation(
  CocoAnnotation(
    bbox=[x_min, y_min, width, height],
    category_id=1,
    category_name='vehicle'
  )
)

add coco image to Coco object:

coco.add_image(coco_image)

after adding all images, convert coco object to coco json:

coco_json = coco.json

you can export it as json file:

from sahi.utils.file import save_json

save_json(coco_json, "coco_dataset.json")

Convert COCO dataset to ultralytics/yolov5 format:

from sahi.utils.coco import Coco

# init Coco object
coco = Coco.from_coco_dict_or_path("coco.json", image_dir="coco_images/")

# export converted YoloV5 formatted dataset into given output_dir with a 85% train/15% val split
coco.export_as_yolov5(
  output_dir="output/folder/dir",
  train_split_rate=0.85
)

Get dataset stats:

from sahi.utils.coco import Coco

# init Coco object
coco = Coco.from_coco_dict_or_path("coco.json")

# get dataset stats
coco.stats
{
  'num_images': 6471,
  'num_annotations': 343204,
  'num_categories': 2,
  'num_negative_images': 0,
  'num_images_per_category': {'human': 5684, 'vehicle': 6323},
  'num_annotations_per_category': {'human': 106396, 'vehicle': 236808},
  'min_num_annotations_in_image': 1,
  'max_num_annotations_in_image': 902,
  'avg_num_annotations_in_image': 53.037243084530985,
  'min_annotation_area': 3,
  'max_annotation_area': 328640,
  'avg_annotation_area': 2448.405738278109,
  'min_annotation_area_per_category': {'human': 3, 'vehicle': 3},
  'max_annotation_area_per_category': {'human': 72670, 'vehicle': 328640},
}

Find detailed info on COCO utilities (yolov5 conversion, slicing, subsampling, filtering, merging, splitting) at COCO.md.

MOT Challenge Utilities

MOT Challenge formatted ground truth dataset creation:

import required classes:

from sahi.utils.mot import MotAnnotation, MotFrame, MotVideo

init video:

mot_video = MotVideo(name="sequence_name")

init first frame:

mot_frame = MotFrame()

add annotations to frame:

mot_frame.add_annotation(
  MotAnnotation(bbox=[x_min, y_min, width, height])
)

mot_frame.add_annotation(
  MotAnnotation(bbox=[x_min, y_min, width, height])
)

add frame to video:

mot_video.add_frame(mot_frame)

export in MOT challenge format:

mot_video.export(export_dir="mot_gt", type="gt")

your MOT challenge formatted ground truth files are ready under mot_gt/sequence_name/ folder.

Find detailed info on MOT utilities (ground truth dataset creation, exporting tracker metrics in mot challenge format) at MOT.md.

Contributing

sahi library currently supports all YOLOv5 models and MMDetection models. Moreover, it is easy to add new frameworks.

All you need to do is, creating a new class in model.py that implements DetectionModel class. You can take the MMDetection wrapper or YOLOv5 wrapper as a reference.

Before opening a PR:

Install required development packages:

pip install -U -e .[dev]

Reformat with black and isort:

black . --config pyproject.toml
isort .

Contributers

Fatih Cagatay Akyon

Cemil Cengiz

Sinan Onur Altinuc

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.11.18

Jul 10, 2024

0.11.16

May 20, 2024

0.11.15

Nov 5, 2023

0.11.14

May 15, 2023

0.11.13

Mar 24, 2023

0.11.12

Mar 7, 2023

0.11.11

Jan 15, 2023

0.11.10

Jan 5, 2023

0.11.9

Dec 24, 2022

0.11.8

Dec 23, 2022

0.11.7

Dec 20, 2022

0.11.6

Dec 1, 2022

0.11.5

Nov 27, 2022

0.11.4

Nov 15, 2022

0.11.3

Nov 14, 2022

0.11.2

Nov 14, 2022

0.11.1

Nov 1, 2022

0.11.0

Oct 29, 2022

0.10.8

Oct 26, 2022

0.10.7

Sep 27, 2022

0.10.6

Sep 25, 2022

0.10.5

Sep 4, 2022

0.10.4

Aug 12, 2022

0.10.3

Aug 2, 2022

0.10.2

Jul 28, 2022

0.10.1

Jun 25, 2022

0.10.0

Jun 21, 2022

0.9.4

May 28, 2022

0.9.3

May 8, 2022

0.9.2

Apr 9, 2022

0.9.1

Mar 17, 2022

0.9.0

Feb 12, 2022

0.8.22

Jan 13, 2022

0.8.21

Jan 12, 2022

0.8.20

Jan 11, 2022

0.8.19

Jan 6, 2022

0.8.18

Jan 2, 2022

0.8.16

Dec 26, 2021

0.8.15

Dec 15, 2021

0.8.14

Dec 11, 2021

0.8.13

Dec 6, 2021

0.8.12

Nov 30, 2021

0.8.11

Nov 24, 2021

0.8.10

Nov 23, 2021

0.8.9

Nov 15, 2021

0.8.8

Nov 2, 2021

0.8.6

Oct 19, 2021

0.8.5

Oct 19, 2021

0.8.4

Sep 13, 2021

0.8.3

Sep 10, 2021

0.8.2

Sep 9, 2021

0.8.1

Sep 8, 2021

0.8.0

Sep 2, 2021

0.7.4

Aug 24, 2021

This version

0.7.3

Aug 8, 2021

0.7.2

Aug 3, 2021

0.7.1

Jul 28, 2021

0.7.0

Jul 13, 2021

0.6.2

Jul 13, 2021

0.6.1

Jul 9, 2021

0.6.0

Jul 6, 2021

0.5.2

Jul 5, 2021

0.5.1

Jul 3, 2021

0.5.0

Jun 28, 2021

0.4.8

Jun 23, 2021

0.4.7

Jun 23, 2021

0.4.6

Jun 14, 2021

0.4.5

Jun 12, 2021

0.4.4

Jun 10, 2021

0.4.3

May 31, 2021

0.4.2

May 22, 2021

0.4.1

May 22, 2021

0.4.0

May 22, 2021

0.3.19

May 15, 2021

0.3.18

May 10, 2021

0.3.17

May 8, 2021

0.3.16

May 8, 2021

0.3.15

May 7, 2021

0.3.14

May 7, 2021

0.3.13

May 6, 2021

0.3.12

May 4, 2021

0.3.11

May 1, 2021

0.3.10

May 1, 2021

0.3.9

May 1, 2021

0.3.8

Apr 28, 2021

0.3.7

Apr 28, 2021

0.3.6

Apr 27, 2021

0.3.5

Apr 27, 2021

0.3.4

Apr 24, 2021

0.3.3

Apr 11, 2021

0.3.2

Apr 11, 2021

0.3.1

Feb 26, 2021

0.3.0

Feb 23, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sahi-0.7.3.tar.gz (56.2 kB view details)

Uploaded Aug 8, 2021 Source

Built Distribution

sahi-0.7.3-py3-none-any.whl (59.2 kB view details)

Uploaded Aug 8, 2021 Python 3

File details

Details for the file sahi-0.7.3.tar.gz.

File metadata

Download URL: sahi-0.7.3.tar.gz
Upload date: Aug 8, 2021
Size: 56.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.0 CPython/3.9.6

File hashes

Hashes for sahi-0.7.3.tar.gz
Algorithm	Hash digest
SHA256	`9ca4a5f2fe0823d1808815f33944280cad3683c2b6e417bc8352e7f25f899814`
MD5	`3d73b18337902265fee6042830674544`
BLAKE2b-256	`7711f9f7f388af2e9ef26e31249638cb739c41293e32e9796e0e1a9bcc840084`

See more details on using hashes here.

File details

Details for the file sahi-0.7.3-py3-none-any.whl.

File metadata

Download URL: sahi-0.7.3-py3-none-any.whl
Upload date: Aug 8, 2021
Size: 59.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.0 CPython/3.9.6

File hashes

Hashes for sahi-0.7.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ec3394419d7255ea8443f462610bdb324d9457faad1322d05f5e2a70a030fb38`
MD5	`cf0021b0acc225382002a5f85fb5cc10`
BLAKE2b-256	`777771fcd05a30558949c1c17c9277635923ba444097678388edddaa24ece505`