torch_tresnet

TResNet: High Performance GPU-Dedicated Architecture

Project description

Packaged TResNet based on Official PyTorch Implementation [paper] [github]

Installation

Install with pip:

pip install torch_tresnet

or directly:

pip install git+https://github.com/tczhangzhi/torch-tresnet

Use

Follow the grammatical conventions of torchvision

from torch_tresnet import tresnet_m, tresnet_l, tresnet_xl, tresnet_m_448, tresnet_l_448, tresnet_xl_448

# pretrianed on 224*224
model = tresnet_m(pretrain=True)
model = tresnet_m(pretrain=True, num_classes=10)
model = tresnet_m(pretrain=True, num_classes=10, in_chans=3)

# pretrianed on 448*448
model = tresnet_m_448(pretrain=True)
model = tresnet_m_448(pretrain=True, num_classes=10)
model = tresnet_m_448(pretrain=True, num_classes=10, in_chans=3)

Main Results

TResNet Models

TResNet models accuracy and GPU throughput on ImageNet, compared to ResNet50. All measurements were done on Nvidia V100 GPU, with mixed precision. All models are trained on input resolution of 224.

Models	Top Training Speed (img/sec)	Top Inference Speed (img/sec)	Max Train Batch Size	Top-1 Acc.
ResNet50	805	2830	288	79.0
EfficientNetB1	440	2740	196	79.2
TResNet-M	730	2930	512	80.7
TResNet-L	345	1390	316	81.4
TResNet-XL	250	1060	240	82.0

Comparison To Other Networks

Comparison of ResNet50 to top modern networks, with similar top-1 ImageNet accuracy. All measurements were done on Nvidia V100 GPU with mixed precision. For gaining optimal speeds, training and inference were measured on 90% of maximal possible batch size. Except TResNet-M, all the models’ ImageNet scores were taken from the public repository, which specialized in providing top implementations for modern networks. Except EfficientNet-B1, which has input resolution of 240, all other models have input resolution of 224.

Model	Top Training Speed (img/sec)	Top Inference Speed (img/sec)	Top-1 Acc.	Flops[G]
ResNet50	805	2830	79.0	4.1
ResNet50-D	600	2670	79.3	4.4
ResNeXt50	490	1940	78.5	4.3
EfficientNetB1	440	2740	79.2	0.6
SEResNeXt50	400	1770	79.0	4.3
MixNet-L	400	1400	79.0	0.5
TResNet-M	730	2930	80.7	5.5

Transfer Learning SotA Results

Comparison of TResNet to state-of-the-art models on transfer learning datasets (only ImageNet-based transfer learning results). Models inference speed is measured on a mixed precision V100 GPU. Since no official implementation of Gpipe was provided, its inference speed is unknown.

Dataset	Model	Top-1 Acc.	Speed img/sec	Input
CIFAR-10	Gpipe	99.0		480
CIFAR-10	TResNet-XL	99.0	1060	224
CIFAR-100	EfficientNet-B7	91.7	70	600
CIFAR-100	TResNet-XL	91.5	1060	224
Stanford Cars	EfficientNet-B7	94.7	70	600
Stanford Cars	TResNet-L	96.0	500	368
Oxford-Flowers	EfficientNet-B7	98.8	70	600
Oxford-Flowers	TResNet-L	99.1	500	368

Project details

Release history Release notifications | RSS feed

1.0.8

Oct 26, 2020

1.0.7

Jul 30, 2020

1.0.6

Apr 19, 2020

1.0.5

Apr 19, 2020

1.0.3

Apr 19, 2020

1.0.2

Apr 19, 2020

This version

1.0.1

Apr 19, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torch_tresnet-1.0.1.tar.gz (6.1 kB view details)

Uploaded Apr 19, 2020 Source

File details

Details for the file torch_tresnet-1.0.1.tar.gz.

File metadata

Download URL: torch_tresnet-1.0.1.tar.gz
Upload date: Apr 19, 2020
Size: 6.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: Python-urllib/3.7

File hashes

Hashes for torch_tresnet-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`9ba54651768c52806ca9eb80adbf162e021d4abcaca2746a9fc1dc8da8471ef8`
MD5	`f870861cd62ee758f4e9c8a8f556ef2b`
BLAKE2b-256	`80286de99b95df670ef897299d8bbf530a549d3b3ae6211413c32d166fd14292`