Skip to main content

A more realtime adaptation of Deep SORT

Project description

Deep SORT

Introduction

A more realtime adaptation of Deep SORT.

Adapted from the official repo of Simple Online and Realtime Tracking with a Deep Association Metric (Deep SORT)

See their paper for more technical information.

Dependencies

requirements.txt gives the default packages required (installs torch/torchvision to use the default mobilenet embedder), modify accordingly.

Main dependencies are:

  • Python3
  • NumPy, pip install numpy
  • SciPy, pip install scipy
  • cv2, pip install opencv-python
  • (optional) Embedder requires Pytorch & Torchvision or Tensorflow 2+
    • pip install torch torchvision
    • pip install tensorflow
  • (optional) To use CLIP embedder, pip install git+https://github.com/openai/CLIP.git

Install

  • from PyPI via pip3 install deep-sort-realtime
  • or, clone this repo & install deep-sort-realtime as a python package using pip or as an editable package if you like (-e flag)
cd deep_sort_realtime && pip3 install .
  • or, download .whl file in this repo's releases

Run

Example usage:

from deep_sort_realtime.deepsort_tracker import DeepSort
tracker = DeepSort(max_age=5)
bbs = object_detector.detect(frame) 
tracks = tracker.update_tracks(bbs, frame=frame) # bbs expected to be a list of detections, each in tuples of ( [left,top,w,h], confidence, detection_class )
for track in tracks:
    if not track.is_confirmed():
        continue
    track_id = track.track_id
    ltrb = track.to_ltrb()
  • To add project-specific logic into the Track class, you can make a subclass (of Track) and pass it in (override_track_class argument) when instantiating DeepSort.

  • Example with your own embedder/ReID model:

from deep_sort_realtime.deepsort_tracker import DeepSort
tracker = DeepSort(max_age=5)
bbs = object_detector.detect(frame) # your own object detection
object_chips = chipper(frame, bbs) # your own logic to crop frame based on bbox values
embeds = embedder(object_chips) # your own embedder to take in the cropped object chips, and output feature vectors
tracks = tracker.update_tracks(bbs, embeds=embeds) # bbs expected to be a list of detections, each in tuples of ( [left,top,w,h], confidence, detection_class ), also, no need to give frame as your chips has already been embedded
for track in tracks:
    if not track.is_confirmed():
        continue
    track_id = track.track_id
    ltrb = track.to_ltrb()

Getting bounding box of original detection

The original Track.to_* methods for retrieving bounding box values returns only the Kalman predicted values. However, in some applications, it is better to return the bb values of the original detections the track was associated to at the current round.

Here we added an orig argument to all the Track.to_* methods. If orig is flagged as True and this track is associated to a detection this update round, then the bounding box values returned by the method will be that associated to the original detection. Otherwise, it will still return the Kalman predicted values.

orig_strict argument in all the Track.to_* methods is only active when orig is True. Flagging orig_strict=True will mean it will output None when there's no original detection associated to this track at current frame, otherwise normally it will return Kalman predicted values.

Storing supplementary info of original detection

Supplementary info can be pass into the track from the detection. Detection class now has an others argument to store this and pass it to the associate track during update. Can be retrieved through Track.get_det_supplementary method. Can be passed in through others argument of DeepSort.update_tracks, expects to be a list with same length as raw_detections. Examples of when you will this includes passing in corresponding instance segmentation masks, to be consumed when iterating through the tracks output.

Polygon support

Other than horizontal bounding boxes, detections can now be given as polygons. We do not track polygon points per se, but merely convert the polygon to its bounding rectangle for tracking. That said, if embedding is enabled, the embedder works on the crop around the bounding rectangle, with area not covered by the polygon masked away.

When instantiating a DeepSort object (as in deepsort_tracker.py), polygon argument should be flagged to True. See DeepSort.update_tracks docstring for details on the polygon format. In polygon mode, the original polygon coordinates are passed to the associated track through the supplementary info.

Differences from original repo

  • Remove "academic style" offline processing style and implemented it to take in real-time detections and output accordingly.

  • Provides both options of using an in-built appearance feature embedder or to provide embeddings during update

  • Added pytorch mobilenetv2 as appearance embedder (tensorflow embedder is also available now too).

  • Added CLIP network from OpenAI as embedder (pytorch).

  • Skip nms completely in preprocessing detections if nms_max_overlap == 1.0 (which is the default), in the original repo, nms will still be done even if threshold is set to 1.0 (probably because it was not optimised for speed).

  • Now able to override the Track class with a custom Track class (that inherits from Track class) for custom track logic

  • Takes in today's date now, which provides date for track naming and facilities track id reset every day, preventing overflow and overly large track ids when system runs for a long time.

    from datetime import datetime
    today = datetime.now().date()
    
  • Now supports polygon detections. We do not track polygon points per se, but merely convert the polygon to its bounding rectangle for tracking. That said, if embedding is enabled, the embedder works on the crop around the bounding rectangle, with area not covered by the polygon masked away. Read more here.

  • The original Track.to_* methods for retrieving bounding box values returns only the Kalman predicted values. In some applications, it is better to return the bb values of the original detections the track was associated to at the current round. Added a orig argument which can be flagged True to get that. Read more here.

  • Added get_det_supplementary method to Track class, in order to pass detection related info through the track. Read more here.

  • Other minor adjustments/optimisation of code.

Highlevel overview of source files in deep_sort (from original repo)

In package deep_sort is the main tracking code:

  • detection.py: Detection base class.
  • kalman_filter.py: A Kalman filter implementation and concrete parametrization for image space filtering.
  • linear_assignment.py: This module contains code for min cost matching and the matching cascade.
  • iou_matching.py: This module contains the IOU matching metric.
  • nn_matching.py: A module for a nearest neighbor matching metric.
  • track.py: The track class contains single-target track data such as Kalman state, number of hits, misses, hit streak, associated feature vectors, etc.
  • tracker.py: This is the multi-target tracker class.

Test

python3 -m unittest

Appearance Embedding Network

Pytorch Embedder (default)

Default embedder is a pytorch MobilenetV2 (trained on Imagenet).

For convenience (I know it's not exactly best practice) & since the weights file is quite small, it is pushed in this github repo and will be installed to your Python environment when you install deep_sort_realtime.

CLIP

CLIP is added as another option of embedder due to its proven flexibility and generalisability. Download the CLIP model weights you want at deep_sort_realtime/embedder/weights/download_clip_wts.sh and store the weights at that directory as well, or you can provide your own CLIP weights through embedder_wts argument of the DeepSort object.

Tensorflow Embedder

Available now at deep_sort_realtime/embedder/embedder_tf.py, as alternative to (the default) pytorch embedder. Tested on Tensorflow 2.3.1. You need to make your own code change to use it.

The tf MobilenetV2 weights (pretrained on imagenet) are not available in this github repo (unlike the torch one). Download from this link or run download script. You may drop it into deep_sort_realtime/embedder/weights/ before pip installing.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deep-sort-realtime-1.3.tar.gz (8.4 MB view details)

Uploaded Source

Built Distribution

deep_sort_realtime-1.3-py3-none-any.whl (8.4 MB view details)

Uploaded Python 3

File details

Details for the file deep-sort-realtime-1.3.tar.gz.

File metadata

  • Download URL: deep-sort-realtime-1.3.tar.gz
  • Upload date:
  • Size: 8.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for deep-sort-realtime-1.3.tar.gz
Algorithm Hash digest
SHA256 a20a3424fc6ba0681bac393a33bd1ace9b04e0425e7502449ea2c0a948d7a0eb
MD5 bcd96d4d0000ee38022d3037358d46af
BLAKE2b-256 a53761d7fc143465be471abaaf4a2ae588701ef6f475bdaddb81661a7fa697dd

See more details on using hashes here.

File details

Details for the file deep_sort_realtime-1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for deep_sort_realtime-1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 847b21254ee4ba93a63f73d20c368d9fad988845fe64cff2d44a7c1ad4a5a809
MD5 88092362b2489e2bfdc35a42ab5b95dd
BLAKE2b-256 960410fded329d1cd266519888668d2b4de419e771ffa9fb3ac09eaf0d992c7e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page