Project description
Computer vision models on MXNet/Gluon

This is a collection of image classification, segmentation, detection, and pose estimation models. Many of them are pretrained on
ImageNet-1K, CIFAR-10/100,
SVHN, CUB-200-2011,
Pascal VOC2012, ADE20K,
Cityscapes, and COCO datasets and loaded automatically
during use. All pretrained models require the same ordinary normalization. Scripts for training/evaluating/converting
models are in the imgclsmob
repo.
List of implemented models
Installation
To use the models in your project, simply install the gluoncv2
package with mxnet
:
pip install gluoncv2 mxnet>=1.2.1
To enable different hardware supports such as GPUs, check out MXNet variants.
For example, you can install with CUDA-9.2 supported MXNet:
pip install gluoncv2 mxnet-cu92>=1.2.1
Usage
Example of using a pretrained ResNet-18 model:
from gluoncv2.model_provider import get_model as glcv2_get_model
import mxnet as mx
net = glcv2_get_model("resnet18", pretrained=True)
x = mx.nd.zeros((1, 3, 224, 224), ctx=mx.cpu())
y = net(x)
Pretrained models
ImageNet-1K
Some remarks:
- Top1/Top5 are the standard 1-crop Top-1/Top-5 errors (in percents) on the validation subset of the ImageNet-1K dataset.
- FLOPs/2 is the number of FLOPs divided by two to be similar to the number of MACs.
- ResNet/PreResNet with b-suffix is a version of the networks with the stride in the second convolution of the
bottleneck block. Respectively a network without b-suffix has the stride in the first convolution.
- ResNet/PreResNet models do not use biases in convolutions at all.
- CondenseNet models are only so-called converted versions.
- ShuffleNetV2 and ShuffleNetV2b are different implementations of the same architecture.
- ResNet(A) is an average downsampled ResNet intended for use as an feature extractor in some pose estimation networks.
- ResNet(D) is a dilated ResNet intended for use as an feature extractor in some segmentation networks.
- Models with *-suffix use non-standard preprocessing (see the training log).
Model |
Top1 |
Top5 |
Params |
FLOPs/2 |
Remarks |
AlexNet |
38.07 |
16.10 |
62,378,344 |
1,132.33M |
Training (log) |
AlexNet-b |
39.30 |
17.05 |
61,100,840 |
714.83M |
Training (log) |
ZFNet |
39.21 |
16.78 |
62,357,608 |
1,170.33M |
Training (log) |
ZFNet-b |
35.81 |
14.59 |
107,627,624 |
2,479.13M |
Training (log) |
VGG-11 |
29.59 |
10.16 |
132,863,336 |
7,615.87M |
Training (log) |
VGG-13 |
28.37 |
9.50 |
133,047,848 |
11,317.65M |
Training (log) |
VGG-16 |
26.61 |
8.32 |
138,357,544 |
15,480.10M |
Training (log) |
VGG-19 |
25.58 |
7.67 |
143,667,240 |
19,642.55M |
Training (log) |
BN-VGG-11 |
28.56 |
9.34 |
132,866,088 |
7,630.21M |
Training (log) |
BN-VGG-13 |
27.68 |
8.87 |
133,050,792 |
11,341.62M |
Training (log) |
BN-VGG-16 |
25.50 |
7.57 |
138,361,768 |
15,506.38M |
Training (log) |
BN-VGG-19 |
23.91 |
6.89 |
143,672,744 |
19,671.15M |
Training (log) |
BN-VGG-11b |
29.24 |
9.75 |
132,868,840 |
7,630.72M |
Training (log) |
BN-VGG-13b |
28.23 |
9.12 |
133,053,736 |
11,342.14M |
Training (log) |
BN-VGG-16b |
25.83 |
7.75 |
138,365,992 |
15,507.20M |
Training (log) |
BN-VGG-19b |
24.79 |
7.35 |
143,678,248 |
19,672.26M |
Training (log) |
BN-Inception |
25.12 |
7.54 |
11,295,240 |
2,048.06M |
Training (log) |
ResNet-10 |
32.54 |
12.53 |
5,418,792 |
894.04M |
Training (log) |
ResNet-12 |
31.68 |
12.03 |
5,492,776 |
1,126.25M |
Training (log) |
ResNet-14 |
30.38 |
10.86 |
5,788,200 |
1,357.94M |
Training (log) |
ResNet-BC-14b |
29.22 |
10.33 |
10,064,936 |
1,479.12M |
Training (log) |
ResNet-16 |
28.53 |
9.78 |
6,968,872 |
1,589.34M |
Training (log) |
ResNet-18 x0.25 |
39.31 |
17.40 |
3,937,400 |
270.94M |
Training (log) |
ResNet-18 x0.5 |
33.41 |
12.84 |
5,804,296 |
608.70M |
Training (log) |
ResNet-18 x0.75 |
30.00 |
10.66 |
8,476,056 |
1,129.45M |
Training (log) |
ResNet-18 |
26.79 |
8.67 |
11,689,512 |
1,820.41M |
Training (log) |
ResNet-26 |
25.96 |
8.23 |
17,960,232 |
2,746.79M |
Training (log) |
ResNet-BC-26b |
24.86 |
7.58 |
15,995,176 |
2,356.67M |
Training (log) |
ResNet-34 |
24.53 |
7.43 |
21,797,672 |
3,672.68M |
Training (log) |
ResNet-BC-38b |
23.50 |
6.72 |
21,925,416 |
3,234.21M |
Training (log) |
ResNet-50 |
22.15 |
6.04 |
25,557,032 |
3,877.95M |
Training (log) |
ResNet-50b |
22.06 |
6.11 |
25,557,032 |
4,110.48M |
Training (log) |
ResNet-101 |
20.52 |
5.16 |
44,549,160 |
7,597.95M |
Training (log) |
ResNet-101b |
20.26 |
5.12 |
44,549,160 |
7,830.48M |
Training (log) |
ResNet-152 |
19.20 |
4.44 |
60,192,808 |
11,321.85M |
Training (log) |
ResNet-152b |
18.84 |
4.29 |
60,192,808 |
11,554.38M |
Training (log) |
PreResNet-10 |
34.65 |
14.01 |
5,417,128 |
894.19M |
Training (log) |
PreResNet-12 |
33.57 |
13.21 |
5,491,112 |
1,126.40M |
Training (log) |
PreResNet-14 |
32.29 |
12.18 |
5,786,536 |
1,358.09M |
Training (log) |
PreResNet-BC-14b |
30.67 |
11.51 |
10,057,384 |
1,476.62M |
Training (log) |
PreResNet-16 |
30.21 |
10.81 |
6,967,208 |
1,589.49M |
Training (log) |
PreResNet-18 x0.25 |
39.62 |
17.78 |
3,935,960 |
270.93M |
Training (log) |
PreResNet-18 x0.5 |
33.67 |
13.19 |
5,802,440 |
608.73M |
Training (log) |
PreResNet-18 x0.75 |
29.96 |
10.68 |
8,473,784 |
1,129.51M |
Training (log) |
PreResNet-18 |
28.16 |
9.51 |
11,687,848 |
1,820.56M |
Training (log) |
PreResNet-26 |
26.03 |
8.34 |
17,958,568 |
2,746.94M |
Training (log) |
PreResNet-BC-26b |
25.21 |
7.86 |
15,987,624 |
2,354.16M |
Training (log) |
PreResNet-34 |
24.55 |
7.51 |
21,796,008 |
3,672.83M |
Training (log) |
PreResNet-BC-38b |
22.67 |
6.33 |
21,917,864 |
3,231.70M |
Training (log) |
PreResNet-50 |
22.27 |
6.20 |
25,549,480 |
3,875.44M |
Training (log) |
PreResNet-50b |
22.36 |
6.32 |
25,549,480 |
4,107.97M |
Training (log) |
PreResNet-101 |
20.60 |
5.34 |
44,541,608 |
7,595.44M |
Training (log) |
PreResNet-101b |
20.85 |
5.40 |
44,541,608 |
7,827.97M |
Training (log) |
PreResNet-152 |
19.17 |
4.46 |
60,185,256 |
11,319.34M |
Training (log) |
PreResNet-152b |
19.01 |
4.38 |
60,185,256 |
11,551.87M |
Training (log) |
PreResNet-200b |
18.96 |
4.46 |
64,666,280 |
15,068.63M |
Training (log) |
PreResNet-269b |
20.17 |
5.01 |
102,065,832 |
20,101.11M |
Training (log) |
ResNeXt-14 (16x4d) |
31.66 |
12.23 |
7,127,336 |
1,045.77M |
Training (log) |
ResNeXt-14 (32x2d) |
32.16 |
12.47 |
7,029,416 |
1,031.32M |
Training (log) |
ResNeXt-14 (32x4d) |
29.95 |
11.10 |
9,411,880 |
1,603.46M |
Training (log) |
ResNeXt-26 (32x2d) |
26.34 |
8.50 |
9,924,136 |
1,461.06M |
Training (log) |
ResNeXt-26 (32x4d) |
23.93 |
7.21 |
15,389,480 |
2,488.07M |
Training (log) |
ResNeXt-50 (32x4d) |
20.84 |
5.45 |
25,028,904 |
4,255.86M |
Training (log) |
ResNeXt-101 (32x4d) |
18.46 |
4.18 |
44,177,704 |
8,003.45M |
Training (log) |
ResNeXt-101 (64x4d) |
18.80 |
4.39 |
83,455,272 |
15,500.27M |
Training (log) |
SE-ResNet-10 |
31.36 |
11.69 |
5,463,332 |
894.08M |
Training (log) |
SE-ResNet-12 |
31.64 |
11.76 |
5,537,896 |
1,126.29M |
Training (log) |
SE-ResNet-14 |
30.34 |
10.95 |
5,835,504 |
1,357.99M |
Training (log) |
SE-ResNet-16 |
28.41 |
9.72 |
7,024,640 |
1,589.40M |
Training (log) |
SE-ResNet-18 |
27.95 |
9.20 |
11,778,592 |
1,820.51M |
Training (log) |
SE-ResNet-26 |
25.42 |
8.03 |
18,093,852 |
2,746.93M |
Training (log) |
SE-ResNet-BC-26b |
23.44 |
6.82 |
17,395,976 |
2,358.07M |
Training (log) |
SE-ResNet-BC-38b |
21.44 |
5.75 |
24,026,616 |
3,236.32M |
Training (log) |
SE-ResNet-50 |
21.07 |
5.60 |
28,088,024 |
3,880.49M |
Training (log) |
SE-ResNet-50b |
20.58 |
5.33 |
28,088,024 |
4,113.02M |
Training (log) |
SE-ResNet-101 |
19.00 |
4.41 |
49,326,872 |
7,602.76M |
Training (log) |
SE-ResNet-101b |
19.46 |
4.62 |
49,326,872 |
7,835.29M |
Training (log) |
SE-ResNet-152 |
18.59 |
4.30 |
66,821,848 |
11,328.52M |
Training (log) |
SE-PreResNet-10 |
32.37 |
12.21 |
5,461,668 |
894.23M |
Training (log) |
SE-PreResNet-12 |
31.58 |
11.80 |
5,536,232 |
1,126.44M |
Training (log) |
SE-PreResNet-16 |
28.39 |
9.56 |
7,022,976 |
1,589.55M |
Training (log) |
SE-PreResNet-18 |
27.16 |
8.81 |
11,776,928 |
1,820.66M |
Training (log) |
SE-PreResNet-26 |
25.95 |
8.04 |
18,092,188 |
2,747.08M |
Training (log) |
SE-PreResNet-BC-26b |
22.95 |
6.36 |
17,388,424 |
2,355.57M |
Training (log) |
SE-PreResNet-BC-38b |
21.42 |
5.63 |
24,019,064 |
3,233.81M |
Training (log) |
SE-PreResNet-50b |
20.67 |
5.32 |
28,080,472 |
4,110.51M |
Training (log) |
SE-ResNeXt-50 (32x4d) |
18.74 |
4.33 |
27,559,896 |
4,258.40M |
Training (log) |
SE-ResNeXt-101 (32x4d) |
19.06 |
4.44 |
48,955,416 |
8,008.26M |
Training (log) |
SE-ResNeXt-101 (64x4d) |
18.43 |
4.08 |
88,232,984 |
15,505.08M |
Training (log) |
SENet-16 |
25.34 |
8.06 |
31,366,168 |
5,080.55M |
Training (log) |
SENet-28 |
21.68 |
5.91 |
36,453,768 |
5,731.20M |
Training (log) |
SENet-154 |
18.84 |
4.40 |
115,088,984 |
20,745.78M |
Training (log) |
ResNeSt(A)-BC-14 |
22.27 |
6.34 |
10,611,688 |
2,766.86M |
Training (log) |
ResNeSt(A)-18 |
23.43 |
6.89 |
12,763,784 |
2,587.11M |
Training (log) |
ResNeSt(A)-BC-26 |
19.57 |
4.70 |
17,069,448 |
3,645.87M |
Training (log) |
ResNeSt(A)-50 |
18.92 |
4.38 |
27,483,240 |
5,402.09M |
Training (log) |
ResNeSt(A)-101 |
17.74 |
3.99 |
48,275,016 |
10,246.42M |
From dmlc/gluon-cv (log) |
ResNeSt(A)-152 |
18.72 |
4.51 |
65,316,040 |
13,974.33M |
Training (log) |
ResNeSt(A)-200 |
16.78 |
3.40 |
70,201,544 |
22,854.22M |
From dmlc/gluon-cv (log) |
ResNeSt(A)-269 |
16.38 |
3.36 |
110,929,480 |
46,005.88M |
From dmlc/gluon-cv (log) |
IBN-ResNet-50 |
21.46 |
5.59 |
25,557,032 |
4,110.48M |
Training (log) |
IBN-ResNet-101 |
19.69 |
4.89 |
44,549,160 |
7,830.48M |
Training (log) |
IBN(b)-ResNet-50 |
21.70 |
5.79 |
25,558,568 |
4,112.89M |
Training (log) |
IBN-ResNeXt-101 (32x4d) |
19.76 |
4.87 |
44,177,704 |
8,003.45M |
Training (log) |
IBN-DenseNet-121 |
23.33 |
6.46 |
7,978,856 |
2,872.13M |
Training (log) |
IBN-DenseNet-169 |
22.14 |
6.08 |
14,149,480 |
3,403.89M |
Training (log) |
AirNet50-1x64d (r=2) |
20.45 |
5.23 |
27,425,864 |
4,772.11M |
Training (log) |
AirNet50-1x64d (r=16) |
21.11 |
5.44 |
25,714,952 |
4,399.97M |
Training (log) |
AirNeXt50-32x4d (r=2) |
19.84 |
5.04 |
27,604,296 |
5,339.58M |
Training (log) |
BAM-ResNet-50 |
20.59 |
5.38 |
25,915,099 |
4,196.09M |
Training (log) |
CBAM-ResNet-50 |
19.94 |
4.88 |
28,089,624 |
4,116.97M |
Training (log) |
SCNet-50 |
20.53 |
5.11 |
25,564,584 |
3,951.01M |
Training (log) |
SCNet-101 |
19.21 |
4.46 |
44,565,416 |
7,204.19M |
Training (log) |
SCNet(A)-50 |
19.01 |
4.59 |
25,583,816 |
4,715.79M |
Training (log) |
RegNetX-200MF |
29.91 |
10.38 |
2,684,792 |
203.32M |
Training (log) |
RegNetX-400MF |
26.25 |
8.55 |
5,157,512 |
403.44M |
Training (log) |
RegNetX-600MF |
24.71 |
7.56 |
6,196,040 |
608.36M |
Training (log) |
RegNetX-800MF |
24.09 |
7.24 |
7,259,656 |
809.47M |
Training (log) |
RegNetX-1.6GF |
22.12 |
6.13 |
9,190,136 |
1,618.97M |
Training (log) |
RegNetX-3.2GF |
21.28 |
5.68 |
15,296,552 |
3,199.52M |
Training (log) |
RegNetX-4.0GF |
19.51 |
4.69 |
22,118,248 |
3,986.26M |
Training (log) |
RegNetX-6.4GF |
19.22 |
4.58 |
26,209,256 |
6,490.97M |
Training (log) |
RegNetX-8.0GF |
19.62 |
4.66 |
39,572,648 |
8,017.90M |
Training (log) |
RegNetX-12GF |
19.99 |
5.18 |
46,106,056 |
12,124.16M |
Training (log) |
RegNetX-16GF |
19.12 |
4.56 |
54,278,536 |
15,986.59M |
Training (log) |
RegNetX-32GF |
17.83 |
3.94 |
107,811,560 |
31,790.18M |
Training (log) |
RegNetY-200MF |
28.50 |
9.53 |
3,162,996 |
203.80M |
Training (log) |
RegNetY-400MF |
24.85 |
7.47 |
4,344,144 |
409.95M |
Training (log) |
RegNetY-600MF |
23.59 |
6.97 |
6,055,160 |
609.91M |
Training (log) |
RegNetY-800MF |
22.54 |
6.45 |
6,263,168 |
808.07M |
Training (log) |
RegNetY-1.6GF |
21.23 |
5.68 |
11,202,430 |
1,628.43M |
Training (log) |
RegNetY-3.2GF |
18.32 |
4.13 |
19,436,338 |
3,197.70M |
From rwightman/pyt...models (log) |
RegNetY-4.0GF |
19.56 |
4.67 |
20,646,656 |
3,997.63M |
Training (log) |
RegNetY-6.4GF |
18.95 |
4.45 |
30,583,252 |
6,386.79M |
Training (log) |
RegNetY-8.0GF |
18.78 |
4.36 |
39,180,068 |
7,994.33M |
Training (log) |
RegNetY-12GF |
18.51 |
4.31 |
51,822,544 |
12,129.89M |
Training (log) |
RegNetY-16GF |
18.63 |
4.30 |
83,590,140 |
15,941.65M |
Training (log) |
RegNetY-32GF |
17.83 |
3.73 |
145,046,770 |
32,313.76M |
Training (log) |
PyramidNet-101 (a=360) |
20.41 |
5.20 |
42,455,070 |
8,743.54M |
Training (log) |
DiracNetV2-18 |
30.61 |
11.17 |
11,511,784 |
1,796.62M |
From szagoruyko/diracnets (log) |
DiracNetV2-34 |
27.93 |
9.46 |
21,616,232 |
3,646.93M |
From szagoruyko/diracnets (log) |
CRU-Net-56 |
20.64 |
5.36 |
25,609,384 |
5,660.66M |
Training (log) |
DenseNet-121 |
23.25 |
6.85 |
7,978,856 |
2,872.13M |
Training (log) |
DenseNet-161 |
21.82 |
5.92 |
28,681,000 |
7,793.16M |
Training (log) |
DenseNet-169 |
22.10 |
6.05 |
14,149,480 |
3,403.89M |
Training (log) |
DenseNet-201 |
21.56 |
5.90 |
20,013,928 |
4,347.15M |
Training (log) |
CondenseNet-74 (C=G=4) |
26.82 |
8.64 |
4,773,944 |
546.06M |
From ShichenLiu/CondenseNet (log) |
CondenseNet-74 (C=G=8) |
29.76 |
10.49 |
2,935,416 |
291.52M |
From ShichenLiu/CondenseNet (log) |
PeleeNet |
29.38 |
9.79 |
2,802,248 |
514.87M |
Training (log) |
WRN-50-2 |
22.02 |
6.06 |
68,849,128 |
11,405.42M |
Training (log) |
DRN-C-26 |
24.32 |
7.11 |
21,126,584 |
16,993.90M |
Training (log) |
DRN-C-42 |
22.25 |
6.14 |
31,234,744 |
25,093.75M |
Training (log) |
DRN-C-58 |
20.47 |
5.15 |
40,542,008 |
32,489.94M |
Training (log) |
DRN-D-22 |
24.67 |
7.44 |
16,393,752 |
13,051.33M |
Training (log) |
DRN-D-38 |
22.83 |
6.24 |
26,501,912 |
21,151.19M |
Training (log) |
DRN-D-54 |
20.29 |
4.97 |
35,809,176 |
28,547.38M |
Training (log) |
DRN-D-105 |
21.31 |
5.81 |
54,801,304 |
43,442.43M |
From fyu/drn (log) |
DPN-68 |
22.87 |
6.58 |
12,611,602 |
2,351.84M |
Training (log) |
DPN-98 |
18.29 |
4.22 |
61,570,728 |
11,716.51M |
Training (log) |
DPN-131 |
19.42 |
4.77 |
79,254,504 |
16,076.15M |
Training (log) |
DarkNet Tiny |
40.31 |
17.46 |
1,042,104 |
500.85M |
Training (log) |
DarkNet Ref |
38.00 |
16.68 |
7,319,416 |
367.59M |
Training (log) |
DarkNet-53 |
21.27 |
5.50 |
41,609,928 |
7,133.86M |
Training (log) |
i-RevNet-301 |
26.97 |
8.97 |
125,120,356 |
14,453.87M |
From jhjacobsen/pytorch-i-revnet (log) |
BagNet-9 |
48.77 |
25.36 |
15,688,744 |
16,049.19M |
Training (log) |
BagNet-17 |
36.51 |
15.23 |
16,213,032 |
15,768.77M |
Training (log) |
BagNet-33 |
29.49 |
10.41 |
18,310,184 |
16,371.52M |
Training (log) |
DLA-34 |
24.31 |
7.05 |
15,742,104 |
3,071.37M |
Training (log) |
DLA-46-C |
33.84 |
12.86 |
1,301,400 |
585.45M |
Training (log) |
DLA-X-46-C |
32.96 |
12.25 |
1,068,440 |
546.72M |
Training (log) |
DLA-60 |
21.27 |
5.54 |
22,036,632 |
4,255.49M |
Training (log) |
DLA-X-60 |
20.70 |
5.53 |
17,352,344 |
3,543.68M |
Training (log) |
DLA-X-60-C |
30.67 |
10.74 |
1,319,832 |
596.06M |
Training (log) |
DLA-102 |
20.58 |
5.17 |
33,268,888 |
7,190.95M |
Training (log) |
DLA-X-102 |
19.59 |
4.70 |
26,309,272 |
5,884.94M |
Training (log) |
DLA-X2-102 |
18.66 |
4.23 |
41,282,200 |
9,340.61M |
Training (log) |
DLA-169 |
19.28 |
4.60 |
53,389,720 |
11,593.20M |
Training (log) |
FishNet-150 |
19.15 |
4.66 |
24,959,400 |
6,435.02M |
Training (log) |
ESPNetv2 x0.5 |
43.38 |
20.82 |
1,241,332 |
35.36M |
Training (log) |
ESPNetv2 x1.0 |
35.33 |
14.27 |
1,670,072 |
98.09M |
From sacmehta/ESPNetv2 (log) |
ESPNetv2 x1.25 |
33.14 |
12.73 |
1,965,440 |
138.18M |
From sacmehta/ESPNetv2 (log) |
ESPNetv2 x1.5 |
31.05 |
11.35 |
2,314,856 |
185.77M |
Training (log) |
ESPNetv2 x2.0 |
28.91 |
9.94 |
3,498,136 |
306.93M |
From sacmehta/ESPNetv2 (log) |
DiCENet x0.2 |
55.15 |
30.67 |
1,130,704 |
18.70M |
From sacmehta/EdgeNets (log) |
DiCENet x0.5 |
47.15 |
23.08 |
1,214,120 |
30.39M |
Training (log) |
DiCENet x0.75 |
38.25 |
16.47 |
1,495,676 |
55.64M |
From sacmehta/EdgeNets (log) |
DiCENet x1.0 |
35.02 |
14.11 |
1,805,604 |
81.96M |
Training (log) |
DiCENet x1.25 |
33.11 |
12.51 |
2,162,888 |
111.60M |
Training (log) |
DiCENet x1.5 |
31.00 |
11.44 |
2,652,200 |
151.48M |
Training (log) |
DiCENet x1.75 |
30.08 |
10.81 |
3,264,932 |
200.87M |
Training (log) |
DiCENet x2.0 |
29.93 |
10.58 |
3,979,044 |
257.49M |
From sacmehta/EdgeNets (log) |
HRNet-W18 Small V1 |
26.20 |
8.73 |
13,187,464 |
1,614.87M |
Training (log) |
HRNet-W18 Small V2 |
21.71 |
6.02 |
15,597,464 |
2,618.54M |
Training (log) |
HRNetV2-W18 |
20.15 |
5.00 |
21,299,004 |
4,322.66M |
Training (log) |
HRNetV2-W30 |
20.30 |
5.08 |
37,712,220 |
8,156.14M |
Training (log) |
HRNetV2-W32 |
19.94 |
4.96 |
41,232,680 |
8,973.31M |
Training (log) |
HRNetV2-W40 |
19.65 |
4.81 |
57,557,160 |
12,751.34M |
Training (log) |
HRNetV2-W44 |
19.67 |
4.86 |
67,064,984 |
14,945.95M |
Training (log) |
HRNetV2-W48 |
19.46 |
4.84 |
77,469,864 |
17,344.29M |
Training (log) |
HRNetV2-W64 |
19.50 |
4.78 |
128,059,944 |
28,974.95M |
Training (log) |
VoVNet-27-slim |
29.28 |
9.80 |
3,525,736 |
2,187.25M |
Training (log) |
VoVNet-39 |
21.54 |
5.48 |
22,600,296 |
7,086.16M |
Training (log) |
VoVNet-57 |
20.14 |
5.10 |
36,640,296 |
8,943.09M |
Training (log) |
SelecSLS-42b |
21.72 |
5.96 |
32,458,248 |
2,980.62M |
Training (log) |
SelecSLS-60 |
20.20 |
5.11 |
30,670,768 |
3,591.78M |
Training (log) |
SelecSLS-60b |
20.62 |
5.37 |
32,774,064 |
3,629.14M |
Training (log) |
HarDNet-39DS |
26.52 |
8.64 |
3,488,228 |
437.52M |
Training (log) |
HarDNet-68DS |
24.25 |
7.38 |
4,180,602 |
788.86M |
Training (log) |
HarDNet-68 |
24.03 |
7.02 |
17,565,348 |
4,256.32M |
Training (log) |
HarDNet-85 |
21.87 |
5.72 |
36,670,212 |
9,088.58M |
Training (log) |
SqueezeNet v1.0 |
38.73 |
17.34 |
1,248,424 |
823.67M |
Training (log) |
SqueezeNet v1.1 |
39.09 |
17.39 |
1,235,496 |
352.02M |
Training (log) |
SqueezeResNet v1.0 |
39.32 |
17.67 |
1,248,424 |
823.67M |
Training (log) |
SqueezeResNet v1.1 |
39.83 |
17.84 |
1,235,496 |
352.02M |
Training (log) |
1.0-SqNxt-23 |
42.25 |
18.66 |
724,056 |
287.28M |
Training (log) |
1.0-SqNxt-23v5 |
40.43 |
17.43 |
921,816 |
285.82M |
Training (log) |
1.5-SqNxt-23 |
34.46 |
13.21 |
1,511,824 |
552.39M |
Training (log) |
1.5-SqNxt-23v5 |
33.48 |
12.68 |
1,953,616 |
550.97M |
Training (log) |
2.0-SqNxt-23 |
30.24 |
10.63 |
2,583,752 |
898.48M |
Training (log) |
2.0-SqNxt-23v5 |
29.27 |
10.24 |
3,366,344 |
897.60M |
Training (log) |
ShuffleNet x0.25 (g=1) |
62.00 |
36.77 |
209,746 |
12.35M |
Training (log) |
ShuffleNet x0.25 (g=3) |
61.34 |
36.17 |
305,902 |
13.09M |
Training (log) |
ShuffleNet x0.5 (g=1) |
46.22 |
22.38 |
534,484 |
41.16M |
Training (log) |
ShuffleNet x0.5 (g=3) |
43.83 |
20.60 |
718,324 |
41.70M |
Training (log) |
ShuffleNet x0.75 (g=1) |
39.25 |
16.75 |
975,214 |
86.42M |
Training (log) |
ShuffleNet x0.75 (g=3) |
37.81 |
16.09 |
1,238,266 |
85.82M |
Training (log) |
ShuffleNet x1.0 (g=1) |
34.41 |
13.50 |
1,531,936 |
148.13M |
Training (log) |
ShuffleNet x1.0 (g=2) |
33.98 |
13.32 |
1,733,848 |
147.60M |
Training (log) |
ShuffleNet x1.0 (g=3) |
33.96 |
13.29 |
1,865,728 |
145.46M |
Training (log) |
ShuffleNet x1.0 (g=4) |
33.84 |
13.10 |
1,968,344 |
143.33M |
Training (log) |
ShuffleNet x1.0 (g=8) |
33.65 |
13.19 |
2,434,768 |
150.76M |
Training (log) |
ShuffleNetV2 x0.5 |
40.61 |
18.30 |
1,366,792 |
43.31M |
Training (log) |
ShuffleNetV2 x1.0 |
30.94 |
11.23 |
2,278,604 |
149.72M |
Training (log) |
ShuffleNetV2 x1.5 |
27.17 |
9.13 |
4,406,098 |
320.77M |
Training (log) |
ShuffleNetV2 x2.0 |
25.80 |
8.23 |
7,601,686 |
595.84M |
Training (log) |
ShuffleNetV2b x0.5 |
39.81 |
17.82 |
1,366,792 |
43.31M |
Training (log) |
ShuffleNetV2b x1.0 |
30.39 |
11.01 |
2,279,760 |
150.62M |
Training (log) |
ShuffleNetV2b x1.5 |
26.90 |
8.79 |
4,410,194 |
323.98M |
Training (log) |
ShuffleNetV2b x2.0 |
25.20 |
8.10 |
7,611,290 |
603.37M |
Training (log) |
108-MENet-8x1 (g=3) |
43.62 |
20.30 |
654,516 |
42.68M |
Training (log) |
128-MENet-8x1 (g=4) |
42.10 |
19.13 |
750,796 |
45.98M |
Training (log) |
128-MENet-8x1 (g=4) |
42.10 |
19.13 |
750,796 |
45.98M |
Training (log) |
160-MENet-8x1 (g=8) |
43.47 |
20.28 |
850,120 |
45.63M |
Training (log) |
256-MENet-12x1 (g=4) |
32.23 |
12.16 |
1,888,240 |
150.65M |
Training (log) |
348-MENet-12x1 (g=3) |
27.85 |
9.36 |
3,368,128 |
312.00M |
Training (log) |
352-MENet-12x1 (g=8) |
31.30 |
11.67 |
2,272,872 |
157.35M |
Training (log) |
456-MENet-24x1 (g=3) |
25.02 |
7.80 |
5,304,784 |
567.90M |
Training (log) |
MobileNet x0.25 |
45.78 |
22.18 |
470,072 |
44.09M |
Training (log) |
MobileNet x0.5 |
33.94 |
13.30 |
1,331,592 |
155.42M |
Training (log) |
MobileNet x0.75 |
29.85 |
10.51 |
2,585,560 |
333.99M |
Training (log) |
MobileNet x1.0 |
26.43 |
8.65 |
4,231,976 |
579.80M |
Training (log) |
MobileNetb x0.25 |
45.25 |
21.65 |
467,592 |
42.88M |
Training (log) |
MobileNetb x0.5 |
32.89 |
12.71 |
1,326,632 |
153.00M |
Training (log) |
MobileNetb x0.75 |
29.08 |
10.20 |
2,578,120 |
330.37M |
Training (log) |
MobileNetb x1.0 |
25.06 |
7.88 |
4,222,056 |
574.97M |
Training (log) |
FD-MobileNet x0.25 |
55.44 |
30.53 |
383,160 |
12.95M |
Training (log) |
FD-MobileNet x0.5 |
42.62 |
19.69 |
993,928 |
41.84M |
Training (log) |
FD-MobileNet x0.75 |
37.91 |
16.01 |
1,833,304 |
86.68M |
Training (log) |
FD-MobileNet x1.0 |
33.80 |
13.12 |
2,901,288 |
147.46M |
Training (log) |
MobileNetV2 x0.25 |
48.08 |
24.12 |
1,516,392 |
34.24M |
Training (log) |
MobileNetV2 x0.5 |
35.63 |
14.42 |
1,964,736 |
100.13M |
Training (log) |
MobileNetV2 x0.75 |
29.78 |
10.44 |
2,627,592 |
198.50M |
Training (log) |
MobileNetV2 x1.0 |
26.77 |
8.64 |
3,504,960 |
329.36M |
Training (log) |
MobileNetV2b x0.25 |
46.72 |
23.38 |
1,516,312 |
33.18M |
Training (log) |
MobileNetV2b x0.5 |
34.26 |
13.73 |
1,964,448 |
96.42M |
Training (log) |
MobileNetV2b x0.75 |
30.19 |
10.64 |
2,626,968 |
190.52M |
Training (log) |
MobileNetV2b x1.0 |
27.16 |
8.84 |
3,503,872 |
315.51M |
Training (log) |
MobileNetV3 L/224/1.0 |
24.36 |
7.29 |
5,481,752 |
226.80M |
Training (log) |
IGCV3 x0.25 |
53.43 |
28.30 |
1,534,020 |
41.29M |
Training (log) |
IGCV3 x0.5 |
39.41 |
17.03 |
1,985,528 |
111.12M |
Training (log) |
IGCV3 x0.75 |
30.71 |
10.96 |
2,638,084 |
210.95M |
Training (log) |
IGCV3 x1.0 |
27.73 |
9.00 |
3,491,688 |
340.79M |
Training (log) |
MnasNet-B1 |
24.67 |
7.23 |
4,383,312 |
326.30M |
Training (log) |
MnasNet-A1 |
24.04 |
7.05 |
3,887,038 |
325.77M |
Training (log) |
DARTS |
24.91 |
7.56 |
4,718,752 |
539.86M |
Training (log) |
ProxylessNAS CPU |
24.78 |
7.50 |
4,361,648 |
459.96M |
Training (log) |
ProxylessNAS GPU |
24.67 |
7.24 |
7,119,848 |
476.08M |
Training (log) |
ProxylessNAS Mobile |
25.31 |
7.80 |
4,080,512 |
332.46M |
Training (log) |
ProxylessNAS Mob-14 |
22.96 |
6.51 |
6,857,568 |
597.10M |
Training (log) |
FBNet-Cb |
24.86 |
7.61 |
5,572,200 |
399.26M |
Training (log) |
Xception |
20.43 |
5.11 |
22,855,952 |
8,403.63M |
Training (log) |
InceptionV3 |
20.51 |
5.33 |
23,834,568 |
5,743.06M |
Training (log) |
InceptionV4 |
19.83 |
4.88 |
42,679,816 |
12,304.93M |
Training (log) |
InceptionResNetV1 |
19.56 |
4.81 |
23,995,624 |
6,329.60M |
Training (log) |
InceptionResNetV2 |
19.48 |
4.70 |
55,843,464 |
13,188.64M |
Training (log) |
PolyNet |
19.09 |
4.53 |
95,366,600 |
34,821.34M |
From Cadene/pretrained...pytorch (log) |
NASNet-A 4@1056 |
25.16 |
7.90 |
5,289,978 |
584.90M |
Training (log) |
NASNet-A 6@4032 |
18.17 |
4.24 |
88,753,150 |
23,976.44M |
From Cadene/pretrained...pytorch (log) |
PNASNet-5-Large |
17.90 |
4.28 |
86,057,668 |
25,140.77M |
From Cadene/pretrained...pytorch (log) |
SPNASNet |
25.10 |
7.76 |
4,421,616 |
346.73M |
Training (log) |
EfficientNet-B0 |
24.50 |
7.22 |
5,288,548 |
413.13M |
Training (log) |
EfficientNet-B1 |
22.89 |
6.26 |
7,794,184 |
730.44M |
Training (log) |
EfficientNet-B0b |
22.96 |
6.70 |
5,288,548 |
413.13M |
From rwightman/pyt...models (log) |
EfficientNet-B1b |
20.98 |
5.65 |
7,794,184 |
730.44M |
From rwightman/pyt...models (log) |
EfficientNet-B2b |
19.94 |
5.16 |
9,109,994 |
1,049.29M |
From rwightman/pyt...models (log) |
EfficientNet-B3b |
18.60 |
4.31 |
12,233,232 |
1,923.98M |
From rwightman/pyt...models (log) |
EfficientNet-B4b |
17.25 |
3.76 |
19,341,616 |
4,597.56M |
From rwightman/pyt...models (log) |
EfficientNet-B5b |
16.39 |
3.34 |
30,389,784 |
10,674.67M |
From rwightman/pyt...models (log) |
EfficientNet-B6b |
15.96 |
3.12 |
43,040,704 |
19,761.35M |
From rwightman/pyt...models (log) |
EfficientNet-B7b |
15.70 |
3.11 |
66,347,960 |
38,949.07M |
From rwightman/pyt...models (log) |
EfficientNet-B0c* |
22.52 |
6.46 |
5,288,548 |
413.13M |
From rwightman/pyt...models (log) |
EfficientNet-B1c* |
20.50 |
5.55 |
7,794,184 |
730.44M |
From rwightman/pyt...models (log) |
EfficientNet-B2c* |
19.60 |
4.89 |
9,109,994 |
1,049.29M |
From rwightman/pyt...models (log) |
EfficientNet-B3c* |
18.19 |
4.34 |
12,233,232 |
1,923.98M |
From rwightman/pyt...models (log) |
EfficientNet-B4c* |
16.74 |
3.59 |
19,341,616 |
4,597.56M |
From rwightman/pyt...models (log) |
EfficientNet-B5c* |
15.79 |
3.02 |
30,389,784 |
10,674.67M |
From rwightman/pyt...models (log) |
EfficientNet-B6c* |
15.29 |
2.85 |
43,040,704 |
19,761.35M |
From rwightman/pyt...models (log) |
EfficientNet-B7c* |
14.87 |
2.77 |
66,347,960 |
38,949.07M |
From rwightman/pyt...models (log) |
EfficientNet-B8c* |
14.61 |
2.70 |
87,413,142 |
64,446.06M |
From rwightman/pyt...models (log) |
EfficientNet-Edge-Small-b* |
22.48 |
6.29 |
5,438,392 |
2,378.09M |
From rwightman/pyt...models (log) |
EfficientNet-Edge-Medium-b* |
21.08 |
5.53 |
6,899,496 |
3,700.08M |
From rwightman/pyt...models (log) |
EfficientNet-Edge-Large-b* |
19.50 |
4.77 |
10,589,712 |
9,747.58M |
From rwightman/pyt...models (log) |
MixNet-S |
23.83 |
7.03 |
4,134,606 |
260.26M |
Training (log) |
MixNet-M |
22.37 |
6.31 |
5,014,382 |
366.05M |
Training (log) |
MixNet-L |
21.48 |
5.57 |
7,329,252 |
590.45M |
Training (log) |
ResNet(A)-10 |
30.89 |
11.59 |
5,438,024 |
1,135.85M |
Training (log) |
ResNet(A)-BC-14 |
27.75 |
9.56 |
10,084,168 |
1,721.52M |
Training (log) |
ResNet(A)-18 |
25.38 |
8.02 |
11,708,744 |
2,062.22M |
Training (log) |
ResNet(A)-50b |
20.78 |
5.34 |
25,576,264 |
4,352.88M |
Training (log) |
ResNet(A)-101b |
18.98 |
4.42 |
44,568,392 |
8,072.88M |
Training (log) |
ResNet(A)-152b |
18.58 |
4.24 |
60,212,040 |
11,796.78M |
Training (log) |
ResNet(D)-50b |
20.79 |
5.49 |
25,680,808 |
20,496.80M |
From dmlc/gluon-cv (log) |
ResNet(D)-101b |
19.49 |
4.61 |
44,672,936 |
35,391.85M |
From dmlc/gluon-cv (log) |
ResNet(D)-152b |
19.39 |
4.67 |
60,316,584 |
47,661.38M |
From dmlc/gluon-cv (log) |
CIFAR-10
Some remarks:
- Testing subset is used for validation purpose.
Features
means feature extractor output size.
Model |
Error, % |
Features |
Params |
FLOPs/2 |
Remarks |
NIN |
7.43 |
192 |
966,986 |
222.97M |
Training (log) |
ResNet-20 |
5.97 |
64 |
272,474 |
41.29M |
Training (log) |
ResNet-56 |
4.52 |
64 |
855,770 |
127.06M |
Training (log) |
ResNet-110 |
3.69 |
64 |
1,730,714 |
255.70M |
Training (log) |
ResNet-164(BN) |
3.68 |
256 |
1,704,154 |
255.31M |
Training (log) |
ResNet-272(BN) |
3.33 |
256 |
2,816,986 |
420.61M |
Training (log) |
ResNet-542(BN) |
3.43 |
256 |
5,599,066 |
833.87M |
Training (log) |
ResNet-1001 |
3.28 |
256 |
10,328,602 |
1,536.40M |
Training (log) |
ResNet-1202 |
3.53 |
64 |
19,424,026 |
2,857.17M |
Training (log) |
PreResNet-20 |
6.51 |
64 |
272,282 |
41.27M |
Training (log) |
PreResNet-56 |
4.49 |
64 |
855,578 |
127.03M |
Training (log) |
PreResNet-110 |
3.86 |
64 |
1,730,522 |
255.68M |
Training (log) |
PreResNet-164(BN) |
3.64 |
256 |
1,703,258 |
255.08M |
Training (log) |
PreResNet-272(BN) |
3.25 |
256 |
2,816,090 |
420.38M |
Training (log) |
PreResNet-542(BN) |
3.14 |
256 |
5,598,170 |
833.64M |
Training (log) |
PreResNet-1001 |
2.65 |
256 |
10,327,706 |
1,536.18M |
Training (log) |
PreResNet-1202 |
3.39 |
64 |
19,423,834 |
2,857.14M |
Training (log) |
ResNeXt-20 (1x64d) |
4.33 |
1024 |
3,446,602 |
538.36M |
Training (log) |
ResNeXt-20 (2x32d) |
4.53 |
1024 |
2,672,458 |
425.15M |
Training (log) |
ResNeXt-20 (4x16d) |
4.70 |
1024 |
2,285,386 |
368.55M |
Training (log) |
ResNeXt-20 (8x8d) |
4.66 |
1024 |
2,091,850 |
340.25M |
Training (log) |
ResNeXt-20 (16x4d) |
4.04 |
1024 |
1,995,082 |
326.10M |
Training (log) |
ResNeXt-20 (32x2d) |
4.61 |
1024 |
1,946,698 |
319.03M |
Training (log) |
ResNeXt-20 (64x1d) |
4.93 |
1024 |
1,922,506 |
315.49M |
Training (log) |
ResNeXt-20 (2x64d) |
4.03 |
1024 |
6,198,602 |
987.98M |
Training (log) |
ResNeXt-20 (4x32d) |
3.73 |
1024 |
4,650,314 |
761.57M |
Training (log) |
ResNeXt-20 (8x16d) |
4.04 |
1024 |
3,876,170 |
648.37M |
Training (log) |
ResNeXt-20 (16x8d) |
3.94 |
1024 |
3,489,098 |
591.77M |
Training (log) |
ResNeXt-20 (32x4d) |
4.20 |
1024 |
3,295,562 |
563.47M |
Training (log) |
ResNeXt-20 (64x2d) |
4.38 |
1024 |
3,198,794 |
549.32M |
Training (log) |
ResNeXt-56 (1x64d) |
2.87 |
1024 |
9,317,194 |
1,399.33M |
Training (log) |
ResNeXt-56 (2x32d) |
3.01 |
1024 |
6,994,762 |
1,059.72M |
Training (log) |
ResNeXt-56 (4x16d) |
3.11 |
1024 |
5,833,546 |
889.91M |
Training (log) |
ResNeXt-56 (8x8d) |
3.07 |
1024 |
5,252,938 |
805.01M |
Training (log) |
ResNeXt-56 (16x4d) |
3.12 |
1024 |
4,962,634 |
762.56M |
Training (log) |
ResNeXt-56 (32x2d) |
3.14 |
1024 |
4,817,482 |
741.34M |
Training (log) |
ResNeXt-56 (64x1d) |
3.41 |
1024 |
4,744,906 |
730.72M |
Training (log) |
ResNeXt-29 (32x4d) |
3.15 |
1024 |
4,775,754 |
780.55M |
Training (log) |
ResNeXt-29 (16x64d) |
2.41 |
1024 |
68,155,210 |
10,709.34M |
Training (log) |
ResNeXt-272 (1x64d) |
2.55 |
1024 |
44,540,746 |
6,565.15M |
Training (log) |
ResNeXt-272 (2x32d) |
2.74 |
1024 |
32,928,586 |
4,867.11M |
Training (log) |
SE-ResNet-20 |
6.01 |
64 |
274,847 |
41.30M |
Training (log) |
SE-ResNet-56 |
4.13 |
64 |
862,889 |
127.07M |
Training (log) |
SE-ResNet-110 |
3.63 |
64 |
1,744,952 |
255.72M |
Training (log) |
SE-ResNet-164(BN) |
3.39 |
256 |
1,906,258 |
255.52M |
Training (log) |
SE-ResNet-272(BN) |
3.39 |
256 |
3,153,826 |
420.96M |
Training (log) |
SE-ResNet-542(BN) |
3.47 |
256 |
6,272,746 |
834.57M |
Training (log) |
SE-PreResNet-20 |
6.18 |
64 |
274,559 |
41.30M |
Training (log) |
SE-PreResNet-56 |
4.51 |
64 |
862,601 |
127.07M |
Training (log) |
SE-PreResNet-110 |
4.54 |
64 |
1,744,664 |
255.72M |
Training (log) |
SE-PreResNet-164(BN) |
3.73 |
256 |
1,904,882 |
255.29M |
Training (log) |
SE-PreResNet-272(BN) |
3.39 |
256 |
3,152,450 |
420.73M |
Training (log) |
SE-PreResNet-542(BN) |
3.08 |
256 |
6,271,370 |
834.34M |
Training (log) |
PyramidNet-110 (a=48) |
3.72 |
64 |
1,772,706 |
408.37M |
Training (log) |
PyramidNet-110 (a=84) |
2.98 |
100 |
3,904,446 |
778.15M |
Training (log) |
PyramidNet-110 (a=270) |
2.51 |
286 |
28,485,477 |
4,730.60M |
Training (log) |
PyramidNet-164 (a=270, BN) |
2.42 |
1144 |
27,216,021 |
4,608.81M |
Training (log) |
PyramidNet-200 (a=240, BN) |
2.44 |
1024 |
26,752,702 |
4,563.40M |
Training (log) |
PyramidNet-236 (a=220, BN) |
2.47 |
944 |
26,969,046 |
4,631.32M |
Training (log) |
PyramidNet-272 (a=200, BN) |
2.39 |
864 |
26,210,842 |
4,541.36M |
Training (log) |
DenseNet-40 (k=12) |
5.61 |
258 |
599,050 |
210.80M |
Training (log) |
DenseNet-BC-40 (k=12) |
6.43 |
132 |
176,122 |
74.89M |
Training (log) |
DenseNet-BC-40 (k=24) |
4.52 |
264 |
690,346 |
293.09M |
Training (log) |
DenseNet-BC-40 (k=36) |
4.04 |
396 |
1,542,682 |
654.60M |
Training (log) |
DenseNet-100 (k=12) |
3.66 |
678 |
4,068,490 |
1,353.55M |
Training (log) |
DenseNet-100 (k=24) |
3.13 |
1356 |
16,114,138 |
5,354.19M |
Training (log) |
DenseNet-BC-100 (k=12) |
4.16 |
342 |
769,162 |
298.45M |
Training (log) |
DenseNet-BC-190 (k=40) |
2.52 |
2190 |
25,624,430 |
9,400.45M |
Training (log) |
DenseNet-BC-250 (k=24) |
2.67 |
1734 |
15,324,406 |
5,519.54M |
Training (log) |
X-DenseNet-BC-40-2 (k=24) |
5.31 |
264 |
690,346 |
293.09M |
Training (log) |
X-DenseNet-BC-40-2 (k=36) |
4.37 |
396 |
1,542,682 |
654.60M |
Training (log) |
WRN-16-10 |
2.93 |
640 |
17,116,634 |
2,414.04M |
Training (log) |
WRN-28-10 |
2.39 |
640 |
36,479,194 |
5,246.98M |
Training (log) |
WRN-40-8 |
2.37 |
512 |
35,748,314 |
5,176.90M |
Training (log) |
WRN-20-10-1bit |
3.26 |
640 |
26,737,140 |
4,019.14M |
Training (log) |
WRN-20-10-32bit |
3.14 |
640 |
26,737,140 |
4,019.14M |
Training (log) |
RoR-3-56 |
5.43 |
64 |
762,746 |
113.43M |
Training (log) |
RoR-3-110 |
4.35 |
64 |
1,637,690 |
242.07M |
Training (log) |
RoR-3-164 |
3.93 |
64 |
2,512,634 |
370.72M |
Training (log) |
RiR |
3.28 |
384 |
9,492,980 |
1,281.08M |
Training (log) |
Shake-Shake-ResNet-20-2x16d |
5.15 |
64 |
541,082 |
81.78M |
Training (log) |
Shake-Shake-ResNet-26-2x32d |
3.17 |
64 |
2,923,162 |
428.89M |
Training (log) |
DIA-ResNet-20 |
6.22 |
64 |
286,866 |
41.34M |
Training (log) |
DIA-ResNet-56 |
5.05 |
64 |
870,162 |
127.18M |
Training (log) |
DIA-ResNet-110 |
4.10 |
64 |
1,745,106 |
255.94M |
Training (log) |
DIA-ResNet-164(BN) |
3.50 |
256 |
1,923,002 |
259.18M |
Training (log) |
DIA-PreResNet-20 |
6.42 |
64 |
286,674 |
41.31M |
Training (log) |
DIA-PreResNet-56 |
4.83 |
64 |
869,970 |
127.15M |
Training (log) |
DIA-PreResNet-110 |
4.25 |
64 |
1,744,914 |
255.92M |
Training (log) |
DIA-PreResNet-164(BN) |
3.56 |
256 |
1,922,106 |
258.95M |
Training (log) |
CIFAR-100
Some remarks:
- Testing subset is used for validation purpose.
Model |
Error, % |
Params |
FLOPs/2 |
Remarks |
NIN |
28.39 |
984,356 |
224.08M |
Training (log) |
ResNet-20 |
29.64 |
278,324 |
41.30M |
Training (log) |
ResNet-56 |
24.88 |
861,620 |
127.06M |
Training (log) |
ResNet-110 |
22.80 |
1,736,564 |
255.71M |
Training (log) |
ResNet-164(BN) |
20.44 |
1,727,284 |
255.33M |
Training (log) |
ResNet-272(BN) |
20.07 |
2,840,116 |
420.63M |
Training (log) |
ResNet-542(BN) |
19.32 |
5,622,196 |
833.89M |
Training (log) |
ResNet-1001 |
19.79 |
10,351,732 |
1,536.43M |
Training (log) |
ResNet-1202 |
21.56 |
19,429,876 |
2,857.17M |
Training (log) |
PreResNet-20 |
30.22 |
278,132 |
41.28M |
Training (log) |
PreResNet-56 |
25.05 |
861,428 |
127.04M |
Training (log) |
PreResNet-110 |
22.67 |
1,736,372 |
255.68M |
Training (log) |
PreResNet-164(BN) |
20.18 |
1,726,388 |
255.10M |
Training (log) |
PreResNet-272(BN) |
19.63 |
2,839,220 |
420.40M |
Training (log) |
PreResNet-542(BN) |
18.71 |
5,621,300 |
833.66M |
Training (log) |
PreResNet-1001 |
18.41 |
10,350,836 |
1,536.20M |
Training (log) |
ResNeXt-20 (1x64d) |
21.97 |
3,538,852 |
538.45M |
Training (log) |
ResNeXt-20 (2x32d) |
22.55 |
2,764,708 |
425.25M |
Training (log) |
ResNeXt-20 (4x16d) |
23.04 |
2,377,636 |
368.65M |
Training (log) |
ResNeXt-20 (8x8d) |
22.82 |
2,184,100 |
340.34M |
Training (log) |
ResNeXt-20 (16x4d) |
22.82 |
2,087,332 |
326.19M |
Training (log) |
ResNeXt-20 (32x2d) |
21.73 |
2,038,948 |
319.12M |
Training (log) |
ResNeXt-20 (64x1d) |
23.53 |
2,014,756 |
315.58M |
Training (log) |
ResNeXt-20 (2x64d) |
20.60 |
6,290,852 |
988.07M |
Training (log) |
ResNeXt-20 (4x32d) |
21.31 |
4,742,564 |
761.66M |
Training (log) |
ResNeXt-20 (8x16d) |
21.72 |
3,968,420 |
648.46M |
Training (log) |
ResNeXt-20 (16x8d) |
21.73 |
3,581,348 |
591.86M |
Training (log) |
ResNeXt-20 (32x4d) |
22.13 |
3,387,812 |
563.56M |
Training (log) |
ResNeXt-20 (64x2d) |
22.35 |
3,291,044 |
549.41M |
Training (log) |
ResNeXt-56 (1x64d) |
18.25 |
9,409,444 |
1,399.42M |
Training (log) |
ResNeXt-56 (2x32d) |
17.86 |
7,087,012 |
1,059.81M |
Training (log) |
ResNeXt-56 (4x16d) |
18.09 |
5,925,796 |
890.01M |
Training (log) |
ResNeXt-56 (8x8d) |
18.06 |
5,345,188 |
805.10M |
Training (log) |
ResNeXt-56 (16x4d) |
18.24 |
5,054,884 |
762.65M |
Training (log) |
ResNeXt-56 (32x2d) |
18.60 |
4,909,732 |
741.43M |
Training (log) |
ResNeXt-56 (64x1d) |
18.16 |
4,837,156 |
730.81M |
Training (log) |
ResNeXt-29 (32x4d) |
19.50 |
4,868,004 |
780.64M |
Training (log) |
ResNeXt-29 (16x64d) |
16.93 |
68,247,460 |
10,709.43M |
Training (log) |
ResNeXt-272 (1x64d) |
19.11 |
44,632,996 |
6,565.25M |
Training (log) |
ResNeXt-272 (2x32d) |
18.34 |
33,020,836 |
4,867.20M |
Training (log) |
SE-ResNet-20 |
28.54 |
280,697 |
41.30M |
Training (log) |
SE-ResNet-56 |
22.94 |
868,739 |
127.07M |
Training (log) |
SE-ResNet-110 |
20.86 |
1,750,802 |
255.72M |
Training (log) |
SE-ResNet-164(BN) |
19.95 |
1,929,388 |
255.54M |
Training (log) |
SE-ResNet-272(BN) |
19.07 |
3,176,956 |
420.98M |
Training (log) |
SE-ResNet-542(BN) |
18.87 |
6,295,876 |
834.59M |
Training (log) |
SE-PreResNet-20 |
28.31 |
280,409 |
41.31M |
Training (log) |
SE-PreResNet-56 |
23.05 |
868,451 |
127.08M |
Training (log) |
SE-PreResNet-110 |
22.61 |
1,750,514 |
255.73M |
Training (log) |
SE-PreResNet-164(BN) |
20.05 |
1,928,012 |
255.31M |
Training (log) |
SE-PreResNet-272(BN) |
19.13 |
3,175,580 |
420.75M |
Training (log) |
SE-PreResNet-542(BN) |
19.45 |
6,294,500 |
834.36M |
Training (log) |
PyramidNet-110 (a=48) |
20.95 |
1,778,556 |
408.38M |
Training (log) |
PyramidNet-110 (a=84) |
18.87 |
3,913,536 |
778.16M |
Training (log) |
PyramidNet-110 (a=270) |
17.10 |
28,511,307 |
4,730.62M |
Training (log) |
PyramidNet-164 (a=270, BN) |
16.70 |
27,319,071 |
4,608.91M |
Training (log) |
PyramidNet-200 (a=240, BN) |
16.09 |
26,844,952 |
4,563.49M |
Training (log) |
PyramidNet-236 (a=220, BN) |
16.34 |
27,054,096 |
4,631.41M |
Training (log) |
PyramidNet-272 (a=200, BN) |
16.19 |
26,288,692 |
4,541.43M |
Training (log) |
DenseNet-40 (k=12) |
24.90 |
622,360 |
210.82M |
Training (log) |
DenseNet-BC-40 (k=12) |
28.41 |
188,092 |
74.90M |
Training (log) |
DenseNet-BC-40 (k=24) |
22.67 |
714,196 |
293.11M |
Training (log) |
DenseNet-BC-40 (k=36) |
20.50 |
1,578,412 |
654.64M |
Training (log) |
DenseNet-100 (k=12) |
19.64 |
4,129,600 |
1,353.62M |
Training (log) |
DenseNet-100 (k=24) |
18.08 |
16,236,268 |
5,354.32M |
Training (log) |
DenseNet-BC-100 (k=12) |
21.19 |
800,032 |
298.48M |
Training (log) |
DenseNet-BC-250 (k=24) |
17.39 |
15,480,556 |
5,519.69M |
Training (log) |
X-DenseNet-BC-40-2 (k=24) |
23.96 |
714,196 |
293.11M |
Training (log) |
X-DenseNet-BC-40-2 (k=36) |
21.65 |
1,578,412 |
654.64M |
Training (log) |
WRN-16-10 |
18.95 |
17,174,324 |
2,414.09M |
Training (log) |
WRN-28-10 |
17.88 |
36,536,884 |
5,247.04M |
Training (log) |
WRN-40-8 |
18.03 |
35,794,484 |
5,176.95M |
Training (log) |
WRN-20-10-1bit |
19.04 |
26,794,920 |
4,022.81M |
Training (log) |
WRN-20-10-32bit |
18.12 |
26,794,920 |
4,022.81M |
Training (log) |
RoR-3-56 |
25.49 |
768,596 |
113.43M |
Training (log) |
RoR-3-110 |
23.64 |
1,643,540 |
242.08M |
Training (log) |
RoR-3-164 |
22.34 |
2,518,484 |
370.72M |
Training (log) |
RiR |
19.23 |
9,527,720 |
1,283.29M |
Training (log) |
Shake-Shake-ResNet-20-2x16d |
29.22 |
546,932 |
81.79M |
Training (log) |
Shake-Shake-ResNet-26-2x32d |
18.80 |
2,934,772 |
428.90M |
Training (log) |
DIA-ResNet-20 |
27.71 |
292,716 |
41.34M |
Training (log) |
DIA-ResNet-56 |
24.35 |
876,012 |
127.18M |
Training (log) |
DIA-ResNet-110 |
22.11 |
1,750,956 |
255.95M |
Training (log) |
DIA-ResNet-164(BN) |
19.53 |
1,946,132 |
259.20M |
Training (log) |
DIA-PreResNet-20 |
28.37 |
292,524 |
41.32M |
Training (log) |
DIA-PreResNet-56 |
25.05 |
875,820 |
127.16M |
Training (log) |
DIA-PreResNet-110 |
22.69 |
1,750,764 |
255.92M |
Training (log) |
DIA-PreResNet-164(BN) |
19.99 |
1,945,236 |
258.97M |
Training (log) |
SVHN
Model |
Error, % |
Params |
FLOPs/2 |
Remarks |
NIN |
3.76 |
966,986 |
222.97M |
Training (log) |
ResNet-20 |
3.43 |
272,474 |
41.29M |
Training (log) |
ResNet-56 |
2.75 |
855,770 |
127.06M |
Training (log) |
ResNet-110 |
2.45 |
1,730,714 |
255.70M |
Training (log) |
ResNet-164(BN) |
2.42 |
1,704,154 |
255.31M |
Training (log) |
ResNet-272(BN) |
2.43 |
2,816,986 |
420.61M |
Training (log) |
ResNet-542(BN) |
2.34 |
5,599,066 |
833.87M |
Training (log) |
ResNet-1001 |
2.41 |
10,328,602 |
1,536.40M |
Training (log) |
PreResNet-20 |
3.22 |
272,282 |
41.27M |
Training (log) |
PreResNet-56 |
2.80 |
855,578 |
127.03M |
Training (log) |
PreResNet-110 |
2.79 |
1,730,522 |
255.68M |
Training (log) |
PreResNet-164(BN) |
2.58 |
1,703,258 |
255.08M |
Training (log) |
PreResNet-272(BN) |
2.34 |
2,816,090 |
420.38M |
Training (log) |
PreResNet-542(BN) |
2.36 |
5,598,170 |
833.64M |
Training (log) |
ResNeXt-20 (1x64d) |
2.98 |
3,446,602 |
538.36M |
Training (log) |
ResNeXt-20 (2x32d) |
2.96 |
2,672,458 |
425.15M |
Training (log) |
ResNeXt-20 (4x16d) |
3.17 |
2,285,386 |
368.55M |
Training (log) |
ResNeXt-20 (8x8d) |
3.18 |
2,091,850 |
340.25M |
Training (log) |
ResNeXt-20 (16x4d) |
3.21 |
1,995,082 |
326.10M |
Training (log) |
ResNeXt-20 (32x2d) |
3.27 |
1,946,698 |
319.03M |
Training (log) |
ResNeXt-20 (64x1d) |
3.42 |
1,922,506 |
315.49M |
Training (log) |
ResNeXt-20 (2x64d) |
2.83 |
6,198,602 |
987.98M |
Training (log) |
ResNeXt-20 (4x32d) |
2.98 |
4,650,314 |
761.57M |
Training (log) |
ResNeXt-20 (8x16d) |
3.01 |
3,876,170 |
648.37M |
Training (log) |
ResNeXt-20 (16x8d) |
2.93 |
3,489,098 |
591.77M |
Training (log) |
ResNeXt-20 (32x4d) |
3.09 |
3,295,562 |
563.47M |
Training (log) |
ResNeXt-20 (64x2d) |
3.14 |
3,198,794 |
549.32M |
Training (log) |
ResNeXt-56 (1x64d) |
2.42 |
9,317,194 |
1,399.33M |
Training (log) |
ResNeXt-56 (2x32d) |
2.46 |
6,994,762 |
1,059.72M |
Training (log) |
ResNeXt-56 (4x16d) |
2.44 |
5,833,546 |
889.91M |
Training (log) |
ResNeXt-56 (8x8d) |
2.47 |
5,252,938 |
805.01M |
Training (log) |
ResNeXt-56 (16x4d) |
2.56 |
4,962,634 |
762.56M |
Training (log) |
ResNeXt-56 (32x2d) |
2.53 |
4,817,482 |
741.34M |
Training (log) |
ResNeXt-56 (64x1d) |
2.55 |
4,744,906 |
730.72M |
Training (log) |
ResNeXt-29 (32x4d) |
2.80 |
4,775,754 |
780.55M |
Training (log) |
ResNeXt-29 (16x64d) |
2.68 |
68,155,210 |
10,709.34M |
Training (log) |
ResNeXt-272 (1x64d) |
2.35 |
44,540,746 |
6,565.15M |
Training (log) |
ResNeXt-272 (2x32d) |
2.44 |
32,928,586 |
4,867.11M |
Training (log) |
SE-ResNet-20 |
3.23 |
274,847 |
41.30M |
Training (log) |
SE-ResNet-56 |
2.64 |
862,889 |
127.07M |
Training (log) |
SE-ResNet-110 |
2.35 |
1,744,952 |
255.72M |
Training (log) |
SE-ResNet-164(BN) |
2.45 |
1,906,258 |
255.52M |
Training (log) |
SE-ResNet-272(BN) |
2.38 |
3,153,826 |
420.96M |
Training (log) |
SE-ResNet-542(BN) |
2.26 |
6,272,746 |
834.57M |
Training (log) |
SE-PreResNet-20 |
3.24 |
274,559 |
41.30M |
Training (log) |
SE-PreResNet-56 |
2.71 |
862,601 |
127.07M |
Training (log) |
SE-PreResNet-110 |
2.59 |
1,744,664 |
255.72M |
Training (log) |
SE-PreResNet-164(BN) |
2.56 |
1,904,882 |
255.29M |
Training (log) |
SE-PreResNet-272(BN) |
2.49 |
3,152,450 |
420.73M |
Training (log) |
SE-PreResNet-542(BN) |
2.47 |
6,271,370 |
834.34M |
Training (log) |
PyramidNet-110 (a=48) |
2.47 |
1,772,706 |
408.37M |
Training (log) |
PyramidNet-110 (a=84) |
2.43 |
3,904,446 |
778.15M |
Training (log) |
PyramidNet-110 (a=270) |
2.38 |
28,485,477 |
4,730.60M |
Training (log) |
PyramidNet-164 (a=270, BN) |
2.33 |
27,216,021 |
4,608.81M |
Training (log) |
PyramidNet-200 (a=240, BN) |
2.32 |
26,752,702 |
4,563.40M |
Training (log) |
PyramidNet-236 (a=220, BN) |
2.35 |
26,969,046 |
4,631.32M |
Training (log) |
PyramidNet-272 (a=200, BN) |
2.40 |
26,210,842 |
4,541.36M |
Training (log) |
DenseNet-40 (k=12) |
3.05 |
599,050 |
210.80M |
Training (log) |
DenseNet-BC-40 (k=12) |
3.20 |
176,122 |
74.89M |
Training (log) |
DenseNet-BC-40 (k=24) |
2.90 |
690,346 |
293.09M |
Training (log) |
DenseNet-BC-40 (k=36) |
2.60 |
1,542,682 |
654.60M |
Training (log) |
DenseNet-100 (k=12) |
2.60 |
4,068,490 |
1,353.55M |
Training (log) |
X-DenseNet-BC-40-2 (k=24) |
2.87 |
690,346 |
293.09M |
Training (log) |
X-DenseNet-BC-40-2 (k=36) |
2.74 |
1,542,682 |
654.60M |
Training (log) |
WRN-16-10 |
2.78 |
17,116,634 |
2,414.04M |
Training (log) |
WRN-28-10 |
2.71 |
36,479,194 |
5,246.98M |
Training (log) |
WRN-40-8 |
2.54 |
35,748,314 |
5,176.90M |
Training (log) |
WRN-20-10-1bit |
2.73 |
26,737,140 |
4,019.14M |
Training (log) |
WRN-20-10-32bit |
2.59 |
26,737,140 |
4,019.14M |
Training (log) |
RoR-3-56 |
2.69 |
762,746 |
113.43M |
Training (log) |
RoR-3-110 |
2.57 |
1,637,690 |
242.07M |
Training (log) |
RoR-3-164 |
2.73 |
2,512,634 |
370.72M |
Training (log) |
RiR |
2.68 |
9,492,980 |
1,281.08M |
Training (log) |
Shake-Shake-ResNet-20-2x16d |
3.17 |
541,082 |
81.78M |
Training (log) |
Shake-Shake-ResNet-26-2x32d |
2.62 |
2,923,162 |
428.89M |
Training (log) |
DIA-ResNet-20 |
3.23 |
286,866 |
41.34M |
Training (log) |
DIA-ResNet-56 |
2.68 |
870,162 |
127.18M |
Training (log) |
DIA-ResNet-110 |
2.47 |
1,745,106 |
255.94M |
Training (log) |
DIA-ResNet-164(BN) |
2.44 |
1,923,002 |
259.18M |
Training (log) |
DIA-PreResNet-20 |
3.03 |
286,674 |
41.31M |
Training (log) |
DIA-PreResNet-56 |
2.80 |
869,970 |
127.15M |
Training (log) |
DIA-PreResNet-110 |
2.42 |
1,744,914 |
255.92M |
Training (log) |
DIA-PreResNet-164(BN) |
2.56 |
1,922,106 |
258.95M |
Training (log) |
CUB-200-2011
Model |
Error, % |
Params |
FLOPs/2 |
Remarks |
ResNet-10 |
27.65 |
5,008,392 |
893.63M |
Training (log) |
ResNet-12 |
26.58 |
5,082,376 |
1,125.84M |
Training (log) |
ResNet-14 |
24.35 |
5,377,800 |
1,357.53M |
Training (log) |
ResNet-16 |
23.21 |
6,558,472 |
1,588.93M |
Training (log) |
ResNet-18 |
23.30 |
11,279,112 |
1,820.00M |
Training (log) |
ResNet-26 |
22.52 |
17,549,832 |
2,746.38M |
Training (log) |
SE-ResNet-10 |
27.39 |
5,052,932 |
893.67M |
Training (log) |
SE-ResNet-12 |
26.04 |
5,127,496 |
1,125.88M |
Training (log) |
SE-ResNet-14 |
23.63 |
5,425,104 |
1,357.58M |
Training (log) |
SE-ResNet-16 |
23.21 |
6,614,240 |
1,588.99M |
Training (log) |
SE-ResNet-18 |
23.08 |
11,368,192 |
1,820.10M |
Training (log) |
SE-ResNet-26 |
22.51 |
17,683,452 |
2,746.52M |
Training (log) |
MobileNet x1.0 |
23.46 |
3,411,976 |
578.98M |
Training (log) |
ProxylessNAS Mobile |
21.88 |
3,055,712 |
331.44M |
Training (log) |
NTS-Net |
13.26 |
28,623,333 |
33,361.39M |
From yangze0930/NTS-Net (log) |
Pascal VOC20102
Model |
Extractor |
Pix.Acc.,% |
mIoU,% |
Params |
FLOPs/2 |
Remarks |
PSPNet |
ResNet(D)-101b |
98.09 |
81.44 |
65,708,501 |
230,586.69M |
From dmlc/gluon-cv (log) |
DeepLabv3 |
ResNet(D)-101b |
97.95 |
80.24 |
58,754,773 |
47,624.54M |
From dmlc/gluon-cv (log) |
DeepLabv3 |
ResNet(D)-152b |
98.11 |
81.20 |
74,398,421 |
59,894.06M |
From dmlc/gluon-cv (log) |
FCN-8s(d) |
ResNet(D)-101b |
97.80 |
80.40 |
52,072,917 |
196,562.96M |
From dmlc/gluon-cv (log) |
ADE20K
Model |
Extractor |
Pix.Acc.,% |
mIoU,% |
Params |
FLOPs/2 |
Remarks |
PSPNet |
ResNet(D)-50b |
79.37 |
36.87 |
46,782,550 |
162,410.82M |
From dmlc/gluon-cv (log) |
PSPNet |
ResNet(D)-101b |
79.93 |
37.97 |
65,774,678 |
230,824.47M |
From dmlc/gluon-cv (log) |
DeepLabv3 |
ResNet(D)-50b |
79.72 |
37.13 |
39,795,798 |
32,755.38M |
From dmlc/gluon-cv (log) |
DeepLabv3 |
ResNet(D)-101b |
80.21 |
37.84 |
58,787,926 |
47,650.43M |
From dmlc/gluon-cv (log) |
FCN-8s(d) |
ResNet(D)-50b |
76.92 |
33.39 |
33,146,966 |
128,387.08M |
From dmlc/gluon-cv (log) |
FCN-8s(d) |
ResNet(D)-101b |
79.01 |
35.88 |
52,139,094 |
196,800.73M |
From dmlc/gluon-cv (log) |
Cityscapes
Model |
Extractor |
Pix.Acc.,% |
mIoU,% |
Params |
FLOPs/2 |
Remarks |
PSPNet |
ResNet(D)-101b |
96.17 |
71.72 |
65,707,475 |
230,583.01M |
From dmlc/gluon-cv (log) |
ICNet |
ResNet(D)-50b |
95.50 |
64.02 |
47,489,184 |
14,241.91M |
From dmlc/gluon-cv (log) |
Fast-SCNN |
- |
95.14 |
65.76 |
1,138,051 |
3,490.05M |
From dmlc/gluon-cv (log) |
SINet |
- |
93.66 |
60.31 |
119,418 |
1,411.97M |
From clovaai/c3_sinet (log) |
DANet |
ResNet(D)-50b |
95.91 |
67.99 |
47,586,427 |
180,370.99M |
From dmlc/gluon-cv (log) |
DANet |
ResNet(D)-101b |
96.03 |
68.10 |
66,578,555 |
248,784.64M |
From dmlc/gluon-cv (log) |
COCO Semantic Segmentation
Model |
Extractor |
Pix.Acc.,% |
mIoU,% |
Params |
FLOPs/2 |
Remarks |
PSPNet |
ResNet(D)-101b |
92.05 |
67.41 |
65,708,501 |
230,586.69M |
From dmlc/gluon-cv (log) |
DeepLabv3 |
ResNet(D)-101b |
92.19 |
67.73 |
58,754,773 |
47,624.54M |
From dmlc/gluon-cv (log) |
DeepLabv3 |
ResNet(D)-152b |
92.24 |
68.99 |
74,398,421 |
275,084.22M |
From dmlc/gluon-cv (log) |
FCN-8s(d) |
ResNet(D)-101b |
91.44 |
60.11 |
52,072,917 |
196,562.96M |
From dmlc/gluon-cv (log) |
CelebAMask-HQ
COCO Keypoints Detection
Model |
Extractor |
OKS AP, % |
Params |
FLOPs/2 |
Remarks |
AlphaPose |
Fast-SE-ResNet-101b |
74.15/91.59/80.68 |
59,569,873 |
9,553.15M |
From dmlc/gluon-cv (log) |
SimplePose |
ResNet-18 |
66.31/89.20/73.41 |
15,376,721 |
1,799.25M |
From dmlc/gluon-cv (log) |
SimplePose |
ResNet-50b |
71.02/91.23/78.57 |
33,999,697 |
4,041.06M |
From dmlc/gluon-cv (log) |
SimplePose |
ResNet-101b |
72.44/92.18/79.76 |
52,991,825 |
7,685.04M |
From dmlc/gluon-cv (log) |
SimplePose |
ResNet-152b |
72.53/92.14/79.61 |
68,635,473 |
11,332.86M |
From dmlc/gluon-cv (log) |
SimplePose |
ResNet(A)-50b |
71.70/91.31/78.66 |
34,018,929 |
4,278.56M |
From dmlc/gluon-cv (log) |
SimplePose |
ResNet(A)-101b |
72.97/92.24/80.81 |
53,011,057 |
7,922.54M |
From dmlc/gluon-cv (log) |
SimplePose |
ResNet(A)-152b |
73.44/92.27/80.72 |
68,654,705 |
11,570.36M |
From dmlc/gluon-cv (log) |
SimplePose(Mobile) |
ResNet-18 |
66.25/89.17/74.32 |
12,858,208 |
1,960.96M |
From dmlc/gluon-cv (log) |
SimplePose(Mobile) |
ResNet-50b |
71.10/91.28/78.67 |
25,582,944 |
4,221.30M |
From dmlc/gluon-cv (log) |
SimplePose(Mobile) |
1.0 MobileNet-224 |
64.10/88.06/71.23 |
5,019,744 |
751.36M |
From dmlc/gluon-cv (log) |
SimplePose(Mobile) |
1.0 MobileNetV2b-224 |
63.74/88.12/71.06 |
4,102,176 |
495.95M |
From dmlc/gluon-cv (log) |
SimplePose(Mobile) |
MobileNetV3 Small 224/1.0 |
54.34/83.67/59.35 |
2,625,088 |
236.51M |
From dmlc/gluon-cv (log) |
SimplePose(Mobile) |
MobileNetV3 Large 224/1.0 |
63.67/88.91/70.82 |
4,768,336 |
403.97M |
From dmlc/gluon-cv (log) |
Lightweight OpenPose 2D |
MobileNet |
39.99/65.95/40.70 |
4,091,698 |
8,948.96M |
From Daniil-Osokin/lighw...ch (log) |
Lightweight OpenPose 3D |
MobileNet |
39.99/65.95/40.70 |
5,085,983 |
11,049.43M |
From Daniil-Osokin/li...3d...ch (log) |
IBPPose |
- |
64.86/83.62/70.12 |
95,827,784 |
57,193.82M |
From jialee93/Improved...Parts (log) |
Mozilla Common Voice (Corpus 6.1, dev subset)
Some remarks:
- NR means Noise Reduction.
- LS means trained on LibriSpeech dataset.
Model |
Lang |
WER, % |
Params |
FLOPs/2 |
Remarks |
Jasper DR 10x5 |
En |
21.90 |
332,632,349 |
85,142.96M |
From NVIDIA/NeMo (log) |
Jasper DR 10x5 NR |
En |
17.89 |
332,632,349 |
85,142.96M |
From NVIDIA/NeMo (log) |
QuartzNet 5x5 LS |
En |
44.68 |
6,713,181 |
1,717.12M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
En |
16.77 |
18,924,381 |
4,840.29M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 NR |
En |
17.74 |
18,924,381 |
4,840.29M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
De |
11.66 |
18,927,456 |
4,841.08M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
Fr |
13.88 |
18,938,731 |
4,843.96M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
It |
15.02 |
18,934,631 |
4,842.91M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
Es |
12.95 |
18,931,556 |
4,842.13M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
Ca |
8.42 |
18,934,631 |
4,842.91M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
Pl |
13.59 |
18,929,506 |
4,841.60M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
Ru |
16.48 |
18,930,531 |
4,841.87M |
From NVIDIA/NeMo (log) |
QuartzNet 15x5 |
Ru/34 |
9.68 |
18,929,506 |
4,841.60M |
From sberdevices/golos (log) |
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution