Skip to main content

TResNet: High Performance GPU-Dedicated Architecture

Project description

Packaged TResNet based on Official PyTorch Implementation [paper] [github]

Installation

Install with pip:

pip install torch_tresnet

or directly:

pip install git+https://github.com/tczhangzhi/torch-tresnet

Use

Follow the grammatical conventions of torchvision

from torch_tresnet import tresnet_m, tresnet_l, tresnet_xl, tresnet_m_448, tresnet_l_448, tresnet_xl_448

# pretrianed on 224*224
model = tresnet_m(pretrain=True)
model = tresnet_m(pretrain=True, num_classes=10)
model = tresnet_m(pretrain=True, num_classes=10, in_chans=3)

# pretrianed on 448*448
model = tresnet_m_448(pretrain=True)
model = tresnet_m_448(pretrain=True, num_classes=10)
model = tresnet_m_448(pretrain=True, num_classes=10, in_chans=3)

Main Results

TResNet Models

TResNet models accuracy and GPU throughput on ImageNet, compared to ResNet50. All measurements were done on Nvidia V100 GPU, with mixed precision. All models are trained on input resolution of 224.

Models

Top Training Speed (img/sec)

Top Inference Speed (img/sec)

Max Train Batch Size

Top-1 Acc.

ResNet50

805

2830

288

79.0

EfficientNetB1

440

2740

196

79.2

TResNet-M

730

2930

512

80.7

TResNet-L

345

1390

316

81.4

TResNet-XL

250

1060

240

82.0

Comparison To Other Networks

Comparison of ResNet50 to top modern networks, with similar top-1 ImageNet accuracy. All measurements were done on Nvidia V100 GPU with mixed precision. For gaining optimal speeds, training and inference were measured on 90% of maximal possible batch size. Except TResNet-M, all the models’ ImageNet scores were taken from the public repository, which specialized in providing top implementations for modern networks. Except EfficientNet-B1, which has input resolution of 240, all other models have input resolution of 224.

Model

Top Training Speed (img/sec)

Top Inference Speed (img/sec)

Top-1 Acc.

Flops[G]

ResNet50

805

2830

79.0

4.1

ResNet50-D

600

2670

79.3

4.4

ResNeXt50

490

1940

78.5

4.3

EfficientNetB1

440

2740

79.2

0.6

SEResNeXt50

400

1770

79.0

4.3

MixNet-L

400

1400

79.0

0.5

TResNet-M

730

2930

80.7

5.5

Transfer Learning SotA Results

Comparison of TResNet to state-of-the-art models on transfer learning datasets (only ImageNet-based transfer learning results). Models inference speed is measured on a mixed precision V100 GPU. Since no official implementation of Gpipe was provided, its inference speed is unknown.

Dataset

Model

Top-1 Acc.

Speed img/sec

Input

CIFAR-10

Gpipe

99.0

480

CIFAR-10

TResNet-XL

99.0

1060

224

CIFAR-100

EfficientNet-B7

91.7

70

600

CIFAR-100

TResNet-XL

91.5

1060

224

Stanford Cars

EfficientNet-B7

94.7

70

600

Stanford Cars

TResNet-L

96.0

500

368

Oxford-Flowers

EfficientNet-B7

98.8

70

600

Oxford-Flowers

TResNet-L

99.1

500

368

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torch_tresnet-1.0.8.tar.gz (7.9 kB view details)

Uploaded Source

File details

Details for the file torch_tresnet-1.0.8.tar.gz.

File metadata

  • Download URL: torch_tresnet-1.0.8.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: Python-urllib/3.7

File hashes

Hashes for torch_tresnet-1.0.8.tar.gz
Algorithm Hash digest
SHA256 28afe85170bb22a53cfc61592730cf876e839ebf579bba3abf1fd3f281027a08
MD5 b6c0cf92c8c84a7d08a280627b9ca2ea
BLAKE2b-256 9597860e23e5f712ceb4d81de196398029cb4284684fee60d8d05b4a13afd21f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page