Skip to main content

Visia's fork of MMDetection - Object Detection Toolbox and Benchmark

Project description

English | 简体中文

Note: This is a fork of the MMDetection library, customized for internal use at Visia. The original project can be found at https://github.com/open-mmlab/mmdetection

Maintained by the Visia ML Engineering team

Introduction

MMDetection is an open source object detection toolbox based on PyTorch. It is a part of the OpenMMLab project.

The master branch works with PyTorch 1.5+.

Major features
  • Modular Design

    We decompose the detection framework into different components and one can easily construct a customized object detection framework by combining different modules.

  • Support of multiple frameworks out of box

    The toolbox directly supports popular and contemporary detection frameworks, e.g. Faster RCNN, Mask RCNN, RetinaNet, etc.

  • High efficiency

    All basic bbox and mask operations run on GPUs. The training speed is faster than or comparable to other codebases, including Detectron2, maskrcnn-benchmark and SimpleDet.

  • State of the art

    The toolbox stems from the codebase developed by the MMDet team, who won COCO Detection Challenge in 2018, and we keep pushing it forward.

Apart from MMDetection, we also released a library mmcv for computer vision research, which is heavily depended on by this toolbox.

What's New

💎 Stable version

2.28.2 was released in 27/2/2023:

  • Fixed some known documentation, configuration and linking error issues

Please refer to changelog.md for details and release history.

For compatibility changes between different versions of MMDetection, please refer to compatibility.md.

🌟 Preview of 3.x version

Highlight

We are excited to announce our latest work on real-time object recognition tasks, RTMDet, a family of fully convolutional single-stage detectors. RTMDet not only achieves the best parameter-accuracy trade-off on object detection from tiny to extra-large model sizes but also obtains new state-of-the-art performance on instance segmentation and rotated object detection tasks. Details can be found in the technical report. Pre-trained models are here.

PWC PWC PWC

Task Dataset AP FPS(TRT FP16 BS1 3090)
Object Detection COCO 52.8 322
Instance Segmentation COCO 44.6 188
Rotated Object Detection DOTA 78.9(single-scale)/81.3(multi-scale) 121

A brand new version of MMDetection v3.0.0rc6 was released in 27/2/2023:

Find more new features in 3.x branch. Issues and PRs are welcome!

Installation

Install visdet using uv:

uv pip install visdet

Or with pip:

pip install visdet

For detailed installation instructions including development setup, please refer to Installation.

Getting Started

Please see get_started.md for the basic usage of MMDetection. We provide colab tutorial and instance segmentation colab tutorial, and other tutorials for:

Overview of Benchmark and Model Zoo

Results and models are available in the model zoo.

Architectures
Object Detection Instance Segmentation Panoptic Segmentation Other
  • Contrastive Learning
  • Distillation
  • Receptive Field Search
  • Components
    Backbones Necks Loss Common

    Some other methods are also supported in projects using MMDetection.

    Alternatives

    There are several other object detection frameworks available. Below is a comparison of popular alternatives:

    Framework Repository Description Key Features
    RT-DETR lyuwenyu/RT-DETR
    D-FINE Peterande/D-FINE
    Ultralytics ultralytics/ultralytics
    Detrex IDEA-Research/detrex
    Detectron2 facebookresearch/detectron2

    FAQ

    Please refer to FAQ for frequently asked questions.

    Contributing

    We appreciate all contributions to improve MMDetection. Ongoing projects can be found in out GitHub Projects. Welcome community users to participate in these projects. Please refer to CONTRIBUTING.md for the contributing guideline.

    Acknowledgement

    MMDetection is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new detectors.

    Citation

    If you use this toolbox or benchmark in your research, please cite this project.

    @article{mmdetection,
      title   = {{MMDetection}: Open MMLab Detection Toolbox and Benchmark},
      author  = {Chen, Kai and Wang, Jiaqi and Pang, Jiangmiao and Cao, Yuhang and
                 Xiong, Yu and Li, Xiaoxiao and Sun, Shuyang and Feng, Wansen and
                 Liu, Ziwei and Xu, Jiarui and Zhang, Zheng and Cheng, Dazhi and
                 Zhu, Chenchen and Cheng, Tianheng and Zhao, Qijie and Li, Buyu and
                 Lu, Xin and Zhu, Rui and Wu, Yue and Dai, Jifeng and Wang, Jingdong
                 and Shi, Jianping and Ouyang, Wanli and Loy, Chen Change and Lin, Dahua},
      journal= {arXiv preprint arXiv:1906.07155},
      year={2019}
    }
    

    License

    This project is released under the Apache 2.0 license.

    Projects in OpenMMLab

    • MMEngine: OpenMMLab foundational library for training deep learning models.
    • MMCV: OpenMMLab foundational library for computer vision.
    • MMEval: A unified evaluation library for multiple machine learning libraries.
    • MIM: MIM installs OpenMMLab packages.
    • MMClassification: OpenMMLab image classification toolbox and benchmark.
    • MMDetection: OpenMMLab detection toolbox and benchmark.
    • MMDetection3D: OpenMMLab's next-generation platform for general 3D object detection.
    • MMRotate: OpenMMLab rotated object detection toolbox and benchmark.
    • MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
    • MMOCR: OpenMMLab text detection, recognition, and understanding toolbox.
    • MMPose: OpenMMLab pose estimation toolbox and benchmark.
    • MMHuman3D: OpenMMLab 3D human parametric model toolbox and benchmark.
    • MMSelfSup: OpenMMLab self-supervised learning toolbox and benchmark.
    • MMRazor: OpenMMLab model compression toolbox and benchmark.
    • MMFewShot: OpenMMLab fewshot learning toolbox and benchmark.
    • MMAction2: OpenMMLab's next-generation action understanding toolbox and benchmark.
    • MMTracking: OpenMMLab video perception toolbox and benchmark.
    • MMFlow: OpenMMLab optical flow toolbox and benchmark.
    • MMEditing: OpenMMLab image and video editing toolbox.
    • MMGeneration: OpenMMLab image and video generative models toolbox.
    • MMDeploy: OpenMMLab model deployment framework.

    Project details


    Download files

    Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

    Source Distribution

    visdet-2.28.4.tar.gz (32.2 MB view details)

    Uploaded Source

    Built Distribution

    If you're not sure about the file name format, learn more about wheel file names.

    visdet-2.28.4-py3-none-any.whl (289.1 kB view details)

    Uploaded Python 3

    File details

    Details for the file visdet-2.28.4.tar.gz.

    File metadata

    • Download URL: visdet-2.28.4.tar.gz
    • Upload date:
    • Size: 32.2 MB
    • Tags: Source
    • Uploaded using Trusted Publishing? No
    • Uploaded via: twine/6.1.0 CPython/3.13.7

    File hashes

    Hashes for visdet-2.28.4.tar.gz
    Algorithm Hash digest
    SHA256 8ff7e7e0f453575704e2a67b06beef70de843772fff525452cfb837833112ff3
    MD5 3f955ca5a180fddef7e41102a3b8f000
    BLAKE2b-256 d117ba7fa0c01c6b20b51c7348ba62e9ff02268b8c99e864ff9c2cc467763cd0

    See more details on using hashes here.

    File details

    Details for the file visdet-2.28.4-py3-none-any.whl.

    File metadata

    • Download URL: visdet-2.28.4-py3-none-any.whl
    • Upload date:
    • Size: 289.1 kB
    • Tags: Python 3
    • Uploaded using Trusted Publishing? No
    • Uploaded via: twine/6.1.0 CPython/3.13.7

    File hashes

    Hashes for visdet-2.28.4-py3-none-any.whl
    Algorithm Hash digest
    SHA256 625f3ea44aeceb25c9c7427beb0c0f7a45010738573077308bde4be93ef0460e
    MD5 112a6b852b02fe36e8d3b9146927c9f9
    BLAKE2b-256 eac4a72bfa91fbf613557daa69b8e21a0c023e8205c09543b6963da6b7a392ea

    See more details on using hashes here.

    Supported by

    AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page