OpenMMLab Pose Estimation Toolbox and Benchmark.
Project description
Introduction
English | 简体中文
MMPose is an open-source toolbox for pose estimation based on PyTorch. It is a part of the OpenMMLab project.
The master branch works with PyTorch 1.3+. Please kindly note that MMPose will switch to PyTorch 1.5+ from Oct. 2021. The compatibility to earlier versions of PyTorch will no longer be guaranteed.
Major Features
-
Support diverse tasks
We support a wide spectrum of mainstream pose analysis tasks in current research community, including 2d multi-person human pose estimation, 2d hand pose estimation, 2d face landmark detection, 133 keypoint whole-body human pose estimation, 3d human mesh recovery, fashion landmark detection and animal pose estimation. See demo.md for more information.
-
Higher efficiency and higher accuracy
MMPose implements multiple state-of-the-art (SOTA) deep learning models, including both top-down & bottom-up approaches. We achieve faster training speed and higher accuracy than other popular codebases, such as HRNet. See benchmark.md for more information.
-
Support for various datasets
The toolbox directly supports multiple popular and representative datasets, COCO, AIC, MPII, MPII-TRB, OCHuman etc. See data_preparation.md for more information.
-
Well designed, tested and documented
We decompose MMPose into different components and one can easily construct a customized pose estimation framework by combining different modules. We provide detailed documentation and API reference, as well as unittests.
Model Zoo
Supported algorithms:
(click to collapse)
- DeepPose (CVPR'2014)
- CPM (CVPR'2016)
- Hourglass (ECCV'2016)
- MSPN (ArXiv'2019)
- RSN (ECCV'2020)
- SimpleBaseline2D (ECCV'2018)
- HRNet (CVPR'2019)
- HRNetv2 (TPAMI'2019)
- LiteHRNet (CVPR'2021)
- SCNet (CVPR'2020)
- Associative Embedding (NeurIPS'2017)
- HigherHRNet (CVPR'2020)
- HMR (CVPR'2018)
- SimpleBaseline3D (ICCV'2017)
- InterNet (ECCV'2020)
- VideoPose3D (CVPR'2019)
- ViPNAS (CVPR'2021)
Supported techniques:
(click to collapse)
- Wingloss (CVPR'2018)
- DarkPose (CVPR'2020)
- UDP (CVPR'2020)
- FP16 (ArXiv'2017)
- Albumentations (Information'2020)
Supported datasets:
(click to collapse)
- COCO [homepage] (ECCV'2014)
- COCO-WholeBody [homepage] (ECCV'2020)
- MPII [homepage] (CVPR'2014)
- MPII-TRB [homepage] (ICCV'2019)
- AI Challenger [homepage] (ArXiv'2017)
- OCHuman [homepage] (CVPR'2019)
- CrowdPose [homepage] (CVPR'2019)
- PoseTrack18 [homepage] (CVPR'2018)
- MHP [homepage] (ACM MM'2018)
- sub-JHMDB [homepage] (ICCV'2013)
- Human3.6M [homepage] (TPAMI'2014)
- 300W [homepage] (IMAVIS'2016)
- WFLW [homepage] (CVPR'2018)
- AFLW [homepage] (ICCVW'2011)
- COFW [homepage] (ICCV'2013)
- OneHand10K [homepage] (TCSVT'2019)
- FreiHand [homepage] (ICCV'2019)
- RHD [homepage] (ICCV'2017)
- CMU Panoptic HandDB [homepage] (CVPR'2017)
- InterHand2.6M [homepage] (ECCV'2020)
- DeepFashion [homepage] (CVPR'2016)
- Animal-Pose [homepage] (ICCV'2019)
- Horse-10 [homepage] (WACV'2021)
- MacaquePose [homepage] (bioRxiv'2020)
- Vinegar Fly [homepage] (Nature Methods'2019)
- Desert Locust [homepage] (Elife'2019)
- Grévy’s Zebra [homepage] (Elife'2019)
- ATRW [homepage] (ACM MM'2020)
Supported backbones:
(click to expand)
- AlexNet (NeurIPS'2012)
- VGG (ICLR'2015)
- ResNet (CVPR'2016)
- ResNetV1D (CVPR'2019)
- ResNeSt (ArXiv'2020)
- ResNext (CVPR'2017)
- SEResNet (CVPR'2018)
- ShufflenetV1 (CVPR'2018)
- ShufflenetV2 (ECCV'2018)
- MobilenetV2 (CVPR'2018)
Results and models are available in the README.md of each method's config directory. A summary can be found in the model zoo page. We will keep up with the latest progress of the community, and support more popular algorithms and frameworks.
If you have any feature requests, please feel free to leave a comment in Issues.
Benchmark
We demonstrate the superiority of our MMPose framework in terms of speed and accuracy on the standard COCO keypoint detection benchmark.
Model | Input size | MMPose (s/iter) | HRNet (s/iter) | MMPose (mAP) | HRNet (mAP) |
---|---|---|---|---|---|
resnet_50 | 256x192 | 0.28 | 0.64 | 0.718 | 0.704 |
resnet_50 | 384x288 | 0.81 | 1.24 | 0.731 | 0.722 |
resnet_101 | 256x192 | 0.36 | 0.84 | 0.726 | 0.714 |
resnet_101 | 384x288 | 0.79 | 1.53 | 0.748 | 0.736 |
resnet_152 | 256x192 | 0.49 | 1.00 | 0.735 | 0.720 |
resnet_152 | 384x288 | 0.96 | 1.65 | 0.750 | 0.743 |
hrnet_w32 | 256x192 | 0.54 | 1.31 | 0.746 | 0.744 |
hrnet_w32 | 384x288 | 0.76 | 2.00 | 0.760 | 0.758 |
hrnet_w48 | 256x192 | 0.66 | 1.55 | 0.756 | 0.751 |
hrnet_w48 | 384x288 | 1.23 | 2.20 | 0.767 | 0.763 |
More details about the benchmark are available on benchmark.md.
Installation
Please refer to install.md for installation.
Data Preparation
Please refer to data_preparation.md for a general knowledge of data preparation.
Get Started
Please see getting_started.md for the basic usage of MMPose. There are also tutorials:
- learn about configs
- finetune model
- add new dataset
- customize data pipelines
- add new modules
- export a model to ONNX
- customize runtime settings
FAQ
Please refer to FAQ for frequently asked questions.
License
This project is released under the Apache 2.0 license.
Citation
If you find this project useful in your research, please consider cite:
@misc{mmpose2020,
title={OpenMMLab Pose Estimation Toolbox and Benchmark},
author={MMPose Contributors},
howpublished = {\url{https://github.com/open-mmlab/mmpose}},
year={2020}
}
Contributing
We appreciate all contributions to improve MMPose. Please refer to CONTRIBUTING.md for the contributing guideline.
Acknowledgement
MMPose is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new models.
Projects in OpenMMLab
- MMCV: OpenMMLab foundational library for computer vision.
- MIM: MIM Installs OpenMMLab Packages.
- MMClassification: OpenMMLab image classification toolbox and benchmark.
- MMDetection: OpenMMLab detection toolbox and benchmark.
- MMDetection3D: OpenMMLab next-generation platform for general 3D object detection.
- MMSegmentation: OpenMMLab semantic segmentation toolbox and benchmark.
- MMAction2: OpenMMLab next-generation action understanding toolbox and benchmark.
- MMTracking: OpenMMLab video perception toolbox and benchmark.
- MMPose: OpenMMLab pose estimation toolbox and benchmark.
- MMEditing: OpenMMLab image and video editing toolbox.
- MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding.
- MMGeneration: OpenMMLab next-generation toolbox for generative models.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file mmpose-0.18.0.tar.gz
.
File metadata
- Download URL: mmpose-0.18.0.tar.gz
- Upload date:
- Size: 360.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.7.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1fee936b8daca5da106434cef5076c7bee2829c32f4976b012ec9336ef1b5e06 |
|
MD5 | a60c0364e0bb94b4cde8ddbe8d98df1a |
|
BLAKE2b-256 | ac91abd818342ecc702ce74fbe2a050e78ed48665b5e13709d8bcf580246a58c |
File details
Details for the file mmpose-0.18.0-py2.py3-none-any.whl
.
File metadata
- Download URL: mmpose-0.18.0-py2.py3-none-any.whl
- Upload date:
- Size: 1.1 MB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.7.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c91d5216b0c535b9f9464b149164e8892bb83daef749d5f9a717ac32690ca446 |
|
MD5 | 764a0ace5ccd70a73dfb21d23646acbf |
|
BLAKE2b-256 | bee3220d32bdfb7995d40939a6f920decae3d300dac353b446d1c750cca646ba |