tensorflow keras computer vision attention models

These details have not been verified by PyPI

Project links

Homepage

Project description

Keras_cv_attention_models

Keras_cv_attention_models
Usage
- Basic Usage
- Layers
- Model surgery
- AotNet
- ResNetD
- ResNeXt
- ResNetQ
- BotNet
- VOLO
- ResNeSt
- HaloNet
- CoTNet
- CoAtNet
- CoaT
- MLP mixer
- ResMLP
- GMLP
- LeViT
Other implemented keras models

Usage

Basic Usage

Install as pip package:

pip install -U git+https://github.com/leondgarse/keras_cv_attention_models

Refer to each sub directory for detail usage.

Basic model prediction

from keras_cv_attention_models import volo
mm = volo.VOLO_d1(pretrained="imagenet")

""" Run predict """
import tensorflow as tf
from tensorflow import keras
from skimage.data import chelsea
img = chelsea() # Chelsea the cat
imm = keras.applications.imagenet_utils.preprocess_input(img, mode='torch')
pred = mm(tf.expand_dims(tf.image.resize(imm, mm.input_shape[1:3]), 0)).numpy()
pred = tf.nn.softmax(pred).numpy()  # If classifier activation is not softmax
print(keras.applications.imagenet_utils.decode_predictions(pred)[0])
# [('n02124075', 'Egyptian_cat', 0.9692954),
#  ('n02123045', 'tabby', 0.020203391),
#  ('n02123159', 'tiger_cat', 0.006867502),
#  ('n02127052', 'lynx', 0.00017674894),
#  ('n02123597', 'Siamese_cat', 4.9493494e-05)]

Exclude model top layers by set num_classes=0

from keras_cv_attention_models import resnest
mm = resnest.ResNest50(num_classes=0)
print(mm.output_shape)
# (None, 7, 7, 2048)

Layers

attention_layers is __init__.py only, which imports core layers defined in model architectures. Like MHSAWithPositionEmbedding from botnet, HaloAttention from halonet.

from keras_cv_attention_models import attention_layers
aa = attention_layers.MHSAWithPositionEmbedding(num_heads=4, key_dim=128, relative=True)
print(f"{aa(tf.ones([1, 14, 16, 256])).shape = }")
# aa(tf.ones([1, 14, 16, 256])).shape = TensorShape([1, 14, 16, 512])

Model surgery

model_surgery including functions used to change model parameters after built.

from keras_cv_attention_models import model_surgery
# Replace all ReLU with PReLU
mm = model_surgery.replace_ReLU(keras.applications.ResNet50(), target_activation='PReLU')

AotNet

Keras AotNet is just a ResNet / ResNetV2 like framework, that set parameters like attn_types and se_ratio and others, which is used to apply different types attention layer.

# Mixing se and outlook and halo and mhsa and cot_attention, 21M parameters
# 50 is just a picked number that larger than the relative `num_block`
from keras_cv_attention_models import aotnet
attn_types = [None, "outlook", ["mhsa", "halo"] * 50, "cot"]
se_ratio = [0.25, 0, 0, 0]
mm = aotnet.AotNet50V2(attn_types=attn_types, se_ratio=se_ratio, deep_stem=True, strides=1)

ResNetD

Keras ResNetD includes implementation of PDF 1812.01187 Bag of Tricks for Image Classification with Convolutional Neural Networks

Model	Params	Image resolution	Top1 Acc	Download
ResNet50D	25.58M	224	80.530	resnet50d.h5
ResNet101D	44.57M	224	83.022	resnet101d.h5
ResNet152D	60.21M	224	83.680	resnet152d.h5
ResNet200D	64.69	224	83.962	resnet200d.h5

ResNeXt

Keras ResNeXt includes implementation of PDF 1611.05431 Aggregated Residual Transformations for Deep Neural Networks
SWSL means Semi-Weakly Supervised ResNe*t from Github facebookresearch/semi-supervised-ImageNet1K-models. Please note the CC-BY-NC 4.0 license on theses weights, non-commercial use only.

Model	Params	Image resolution	Top1 Acc	Download
ResNeXt50 (32x4d)	25M	224	79.768	resnext50_imagenet.h5
- SWSL	25M	224	82.182	resnext50_swsl.h5
ResNeXt50D (32x4d + deep)	25M	224	79.676	resnext50d_imagenet.h5
ResNeXt101 (32x4d)	42M	224	80.334	resnext101_imagenet.h5
- SWSL	42M	224	83.230	resnext101_swsl.h5
ResNeXt101W (32x8d)	89M	224	79.308	resnext101_imagenet.h5
- SWSL	89M	224	84.284	resnext101w_swsl.h5

ResNetQ

Keras ResNetQ includes implementation of Github timm/models/resnet.py

Model	Params	Image resolution	Top1 Acc	Download
ResNet51Q	35.7M	224	82.36	resnet51q.h5

BotNet

Keras BotNet is for PDF 2101.11605 Bottleneck Transformers for Visual Recognition.

Model	Params	Image resolution	Top1 Acc	Download
botnet50	21M	224	77.604	botnet50_imagenet.h5
botnet101	41M	224
botnet152	56M	224

VOLO

Keras VOLO is for PDF 2106.13112 VOLO: Vision Outlooker for Visual Recognition.

Model	Params	Image resolution	Top1 Acc	Download
volo_d1	27M	224	84.2	volo_d1_224.h5
volo_d1 ↑384	27M	384	85.2	volo_d1_384.h5
volo_d2	59M	224	85.2	volo_d2_224.h5
volo_d2 ↑384	59M	384	86.0	volo_d2_384.h5
volo_d3	86M	224	85.4	volo_d3_224.h5
volo_d3 ↑448	86M	448	86.3	volo_d3_448.h5
volo_d4	193M	224	85.7	volo_d4_224.h5
volo_d4 ↑448	193M	448	86.8	volo_d4_448.h5
volo_d5	296M	224	86.1	volo_d5_224.h5
volo_d5 ↑448	296M	448	87.0	volo_d5_448.h5
volo_d5 ↑512	296M	512	87.1	volo_d5_512.h5

ResNeSt

Keras ResNeSt is for PDF 2004.08955 ResNeSt: Split-Attention Networks.

Model	Params	Image resolution	Top1 Acc	Download
resnest50	28M	224	81.03	resnest50.h5
resnest101	49M	256	82.83	resnest101.h5
resnest200	71M	320	83.84	resnest200.h5
resnest269	111M	416	84.54	resnest269.h5

HaloNet

Keras HaloNet is for PDF 2103.12731 Scaling Local Self-Attention for Parameter Efficient Visual Backbones.

Model	Params	Image resolution	Top1 Acc
HaloNetH0	6.6M	256
HaloNetH1	9.1M	256
HaloNetH2	10.3M	256
HaloNetH3	12.5M	320
HaloNetH4	19.5M	384	85.5
HaloNetH5	31.6M	448
HaloNetH6	44.3M	512
HaloNetH7	67.9M	640

CoTNet

Keras CoTNet is for PDF 2107.12292 Contextual Transformer Networks for Visual Recognition.

Model	Params	Image resolution	FLOPs	Top1 Acc	Download
CoTNet-50	22.2M	224	3.3	81.3	cotnet50_224.h5
CoTNeXt-50	30.1M	224	4.3	82.1
SE-CoTNetD-50	23.1M	224	4.1	81.6	se_cotnetd50_224.h5
CoTNet-101	38.3M	224	6.1	82.8	cotnet101_224.h5
CoTNeXt-101	53.4M	224	8.2	83.2
SE-CoTNetD-101	40.9M	224	8.5	83.2	se_cotnetd101_224.h5
SE-CoTNetD-152	55.8M	224	17.0	84.0	se_cotnetd152_224.h5
SE-CoTNetD-152	55.8M	320	26.5	84.6	se_cotnetd152_320.h5

CoAtNet

Keras CoAtNet is for PDF 2106.04803 CoAtNet: Marrying Convolution and Attention for All Data Sizes.

Model	Params	Image resolution	Top1 Acc
CoAtNet-0	25M	224	81.6
CoAtNet-1	42M	224	83.3
CoAtNet-2	75M	224	84.1
CoAtNet-2, ImageNet-21k pretrain	75M	224	87.1
CoAtNet-3	168M	224	84.5
CoAtNet-3, ImageNet-21k pretrain	168M	224	87.6
CoAtNet-3, ImageNet-21k pretrain	168M	512	87.9
CoAtNet-4, ImageNet-21k pretrain	275M	512	88.1
CoAtNet-4, ImageNet-21K + PT-RA-E150	275M	512	88.56

CoaT

Keras CoaT is for PDF 2104.06399 CoaT: Co-Scale Conv-Attentional Image Transformers.

Model	Params	Image resolution	Top1 Acc	Download
CoaTLiteTiny	5.7M	224	77.5	coat_lite_tiny_imagenet.h5
CoaTLiteMini	11M	224	79.1	coat_lite_mini_imagenet.h5
CoaTLiteSmall	20M	224	81.9	coat_lite_small_imagenet.h5
CoaTTiny	5.5M	224	78.3	coat_tiny_imagenet.h5
CoaTMini	10M	224	81.0	coat_mini_imagenet.h5

MLP mixer

Keras MLP mixer includes implementation of PDF 2105.01601 MLP-Mixer: An all-MLP Architecture for Vision.
Models Top1 Acc is Pre-trained on JFT-300M model accuray on ImageNet 1K from paper.

Model	Params	Top1 Acc	ImageNet	Imagenet21k	ImageNet SAM
MLPMixerS32	19.1M	68.70
MLPMixerS16	18.5M	73.83
MLPMixerB32	60.3M	75.53			b32_imagenet_sam.h5
MLPMixerB16	59.9M	80.00	b16_imagenet.h5	b16_imagenet21k.h5	b16_imagenet_sam.h5
MLPMixerL32	206.9M	80.67
MLPMixerL16	208.2M	84.82	l16_imagenet.h5	l16_imagenet21k.h5
- input 448	208.2M	86.78
MLPMixerH14	432.3M	86.32
- input 448	432.3M	87.94

ResMLP

Keras ResMLP includes implementation of PDF 2105.03404 ResMLP: Feedforward networks for image classification with data-efficient training

Model	Params	Image resolution	Top1 Acc	ImageNet
ResMLP12	15M	224	77.8	resmlp12_imagenet.h5
ResMLP24	30M	224	80.8	resmlp24_imagenet.h5
ResMLP36	116M	224	81.1	resmlp36_imagenet.h5
ResMLP_B24	129M	224	83.6	resmlp_b24_imagenet.h5
- imagenet22k	129M	224	84.4	resmlp_b24_imagenet22k.h5

GMLP

Keras GMLP includes implementation of PDF 2105.08050 Pay Attention to MLPs.

Model	Params	Image resolution	Top1 Acc	ImageNet
GMLPTiny16	6M	224	72.3
GMLPS16	20M	224	79.6	gmlp_s16_imagenet.h5
GMLPB16	73M	224	81.6

LeViT

Keras LeViT is for PDF 2104.01136 LeViT: a Vision Transformer in ConvNet’s Clothing for Faster Inference.

Model	Params	Image resolution	Top1 Acc	ImageNet
LeViT128S	7.8M	224	76.6	levit128s_imagenet.h5
LeViT128	9.2M	224	78.6	levit128_imagenet.h5
LeViT192	11M	224	80.0	levit192_imagenet.h5
LeViT256	19M	224	81.6	levit256_imagenet.h5
LeViT384	39M	224	82.6	levit384_imagenet.h5

Other implemented keras models

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.4.3

Feb 15, 2026

1.4.2

Apr 21, 2025

1.4.1

Apr 10, 2024

1.3.25

Feb 29, 2024

1.3.24

Dec 27, 2023

1.3.23

Dec 18, 2023

1.3.22

Nov 21, 2023

1.3.21

Oct 20, 2023

1.3.20

Aug 25, 2023

1.3.19

Aug 2, 2023

1.3.18

Jun 28, 2023

1.3.17

Jun 3, 2023

1.3.16

May 24, 2023

1.3.15

May 10, 2023

1.3.14

Apr 15, 2023

1.3.13

Apr 2, 2023

1.3.12

Mar 23, 2023

1.3.11

Mar 8, 2023

1.3.10

Mar 8, 2023

1.3.9

Jan 18, 2023

1.3.8

Jan 15, 2023

1.3.7

Jan 14, 2023

1.3.6

Jan 12, 2023

1.3.5

Jan 9, 2023

1.3.4

Nov 12, 2022

1.3.3

Nov 3, 2022

1.3.2

Oct 27, 2022

1.3.1

Sep 12, 2022

1.3.0

Jul 25, 2022

1.2.30

Jul 23, 2022

1.2.29

Jul 19, 2022

1.2.28

Jul 15, 2022

1.2.27

May 18, 2022

1.2.26

May 12, 2022

1.2.25

May 10, 2022

1.2.24

Apr 28, 2022

1.2.23

Apr 26, 2022

1.2.22

Apr 24, 2022

1.2.21

Apr 22, 2022

1.2.20

Apr 18, 2022

1.2.19

Apr 15, 2022

1.2.18

Apr 13, 2022

1.2.17

Apr 11, 2022

1.2.16

Apr 6, 2022

1.2.15

Apr 2, 2022

1.2.14

Mar 30, 2022

1.2.13

Mar 29, 2022

1.2.12

Mar 21, 2022

1.2.11

Mar 19, 2022

1.2.10

Mar 18, 2022

1.2.9

Mar 13, 2022

1.2.8

Mar 11, 2022

1.2.7

Mar 9, 2022

1.2.6

Mar 9, 2022

1.2.5

Feb 22, 2022

1.2.4

Feb 18, 2022

1.2.3

Feb 15, 2022

1.2.2

Feb 10, 2022

1.2.1

Jan 29, 2022

1.2.0

Jan 26, 2022

1.1.20

Jan 25, 2022

1.1.19

Jan 13, 2022

1.1.18

Jan 12, 2022

1.1.17

Jan 12, 2022

1.1.16

Jan 8, 2022

1.1.15

Jan 5, 2022

1.1.14

Jan 4, 2022

1.1.13

Jan 2, 2022

1.1.12

Dec 30, 2021

1.1.11

Dec 27, 2021

1.1.10

Dec 24, 2021

1.1.9

Dec 23, 2021

1.1.8

Dec 23, 2021

1.1.7

Dec 22, 2021

1.1.6

Dec 14, 2021

1.1.5

Dec 13, 2021

1.1.4

Dec 7, 2021

1.1.3

Nov 29, 2021

1.1.2

Nov 16, 2021

1.1.1

Nov 16, 2021

1.1.0

Nov 10, 2021

1.0.22

Nov 2, 2021

1.0.21

Nov 2, 2021

1.0.20

Oct 26, 2021

1.0.19

Oct 26, 2021

1.0.18

Oct 22, 2021

1.0.17

Oct 22, 2021

1.0.16

Oct 14, 2021

1.0.15

Oct 9, 2021

1.0.14

Oct 8, 2021

1.0.13

Sep 30, 2021

1.0.12

Sep 29, 2021

1.0.11

Sep 28, 2021

1.0.10

Sep 26, 2021

1.0.9

Sep 26, 2021

1.0.8

Sep 14, 2021

1.0.7

Sep 6, 2021

1.0.6

Sep 2, 2021

1.0.4

Sep 1, 2021

This version

1.0.3

Sep 1, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keras-cv-attention-models-1.0.3.tar.gz (67.4 kB view details)

Uploaded Sep 1, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

keras_cv_attention_models-1.0.3-py3-none-any.whl (84.8 kB view details)

Uploaded Sep 1, 2021 Python 3

File details

Details for the file keras-cv-attention-models-1.0.3.tar.gz.

File metadata

Download URL: keras-cv-attention-models-1.0.3.tar.gz
Upload date: Sep 1, 2021
Size: 67.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7

File hashes

Hashes for keras-cv-attention-models-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`a31b37abf4dab399fffc2d50eed0a0b98e9985103b36a6ae64b6cf71bc53c521`
MD5	`4dda481ffd3cadd82c2e4110bb2fc6d8`
BLAKE2b-256	`c898f08d145b11d29de609b817db7a981c90eed7d2521d433dbab83e1a3697d5`

See more details on using hashes here.

File details

Details for the file keras_cv_attention_models-1.0.3-py3-none-any.whl.

File metadata

Download URL: keras_cv_attention_models-1.0.3-py3-none-any.whl
Upload date: Sep 1, 2021
Size: 84.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.7

File hashes

Hashes for keras_cv_attention_models-1.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`148c6fe06359602de7f555627d068b59e4c01090e84489f12c7fb33f0a29f241`
MD5	`608103e8cf7b7dbc8c26d53cfcc47529`
BLAKE2b-256	`268096a2aa6a6708cb84fb3f172b7acfa9bb85db462ad99aefd141d2409ee7a4`

See more details on using hashes here.

keras-cv-attention-models 1.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Keras_cv_attention_models

Usage

Basic Usage

Layers

Model surgery

AotNet

ResNetD

ResNeXt

ResNetQ

BotNet

VOLO

ResNeSt

HaloNet

CoTNet

CoAtNet

CoaT

MLP mixer

ResMLP

GMLP

LeViT

Other implemented keras models

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes