Pretrained keras 3 vision models

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

IMvision12

These details have not been verified by PyPI

Project description

Keras Models 🚀

Python

📖 Introduction

Keras Models (kmodels) is a collection of models with pretrained weights, built entirely with Keras 3. It supports a range of tasks, including classification, object detection (DETR, RT-DETR, RT-DETRv2, RF-DETR, D-FINE), segmentation (SAM, SAM2, SAM3, SegFormer, DeepLabV3, EoMT), vision-language modeling (CLIP, SigLIP, SigLIP2), and more. It includes hybrid architectures like MaxViT alongside traditional CNNs and pure transformers. kmodels includes custom layers and backbone support, providing flexibility and efficiency across various applications. For backbones, there are various weight variants like in1k, in21k, fb_dist_in1k, ms_in22k, fb_in22k_ft_in1k, ns_jft_in1k, aa_in1k, cvnets_in1k, augreg_in21k_ft_in1k, augreg_in21k, and many more.

⚡ Installation

From PyPI (recommended)

pip install -U kmodels

From Source

pip install -U git+https://github.com/IMvision12/keras-models

📑 Documentation

Topic	Description
Backbone Models	Classification backbones (ViT, ResNet, Swin, ConvNeXt, EfficientNet, and more) with usage examples and model listing

Segmentation

Model	Description
SAM	Segment Anything Model — promptable segmentation with points, boxes, or masks (ViT-B/L/H)
SAM2	Segment Anything Model 2 — next generation of promptable visual segmentation (Hiera Tiny/Small/Base+/Large)
SAM3	Segment Anything Model 3 — open-vocabulary detection + segmentation with CLIP text encoder (ViT-L/14). Weights require Meta SAM License acceptance on HuggingFace
SegFormer	Transformer-based semantic segmentation with MLP decoder, Cityscapes & ADE20K weights
DeepLabV3	Atrous convolution-based semantic segmentation
EoMT	Encoder-only Mask Transformer for panoptic segmentation

Object Detection

Model	Description
DETR	End-to-end object detection with Transformers (ResNet-50/101 backbones)
RT-DETR	Real-time DETR with ResNet-vd backbone and hybrid encoder (ResNet-18/34/50/101 variants)
RT-DETRv2	RT-DETR v2 with selective multi-scale deformable attention and learnable per-level sampling scale (ResNet-18/34/50/101 variants)
RF-DETR	Real-time detection transformer (Nano, Small, Medium, Base, Large variants)
D-FINE	Fine-grained distribution refinement detector with HGNetV2 backbone (Nano/Small/Medium/Large/XLarge)

Feature Extraction

Model	Description
DINO	Self-supervised ViT-S/B and ResNet-50 backbones trained with self-distillation
DINOv2	Improved self-supervised ViT-S/B/L backbones with LayerScale, trained on LVD-142M

Vision-Language Models

Model	Description
CLIP	Contrastive Language-Image Pre-training for zero-shot classification
SigLIP	Sigmoid loss-based language-image pre-training with multilingual support
SigLIP2	Next-gen SigLIP with improved semantic understanding and 256K vocabulary

📑 Models

Backbones

🏷️ Model Name	📜 Reference Paper	📦 Source of Weights
CaiT	Going deeper with Image Transformers	`timm`
ConvMixer	Patches Are All You Need?	`timm`
ConvNeXt	A ConvNet for the 2020s	`timm`
ConvNeXt V2	ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders	`timm`
DeiT	Training data-efficient image transformers & distillation through attention	`timm`
DenseNet	Densely Connected Convolutional Networks	`timm`
EfficientNet	EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks	`timm`
EfficientNet-Lite	EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks	`timm`
EfficientNetV2	EfficientNetV2: Smaller Models and Faster Training	`timm`
FlexiViT	FlexiViT: One Model for All Patch Sizes	`timm`
InceptionNeXt	InceptionNeXt: When Inception Meets ConvNeXt	`timm`
Inception-ResNet-v2	Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning	`timm`
Inception-v3	Rethinking the Inception Architecture for Computer Vision	`timm`
Inception-v4	Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning	`timm`
MaxViT	MaxViT: Multi-Axis Vision Transformer	`timm`
MiT	SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers	`transformers`
MLP-Mixer	MLP-Mixer: An all-MLP Architecture for Vision	`timm`
MobileNetV2	MobileNetV2: Inverted Residuals and Linear Bottlenecks	`timm`
MobileNetV3	Searching for MobileNetV3	`keras`
MobileViT	MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer	`timm`
MobileViTV2	Separable Self-attention for Mobile Vision Transformers	`timm`
NextViT	Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios	`timm`
PiT	Rethinking Spatial Dimensions of Vision Transformers	`timm`
PoolFormer	MetaFormer is Actually What You Need for Vision	`timm`
Res2Net	Res2Net: A New Multi-scale Backbone Architecture	`timm`
ResMLP	ResMLP: Feedforward networks for image classification with data-efficient training	`timm`
ResNet	Deep Residual Learning for Image Recognition	`timm`
ResNetV2	Identity Mappings in Deep Residual Networks	`timm`
ResNeXt	Aggregated Residual Transformations for Deep Neural Networks	`timm`
SENet	Squeeze-and-Excitation Networks	`timm`
Swin Transformer	Swin Transformer: Hierarchical Vision Transformer using Shifted Windows	`timm`
Swin Transformer V2	Swin Transformer V2: Scaling Up Capacity and Resolution	`timm`
VGG	Very Deep Convolutional Networks for Large-Scale Image Recognition	`timm`
ViT	An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale	`timm`
Xception	Xception: Deep Learning with Depthwise Separable Convolutions	`keras`

Object Detection

🏷️ Model Name	📜 Reference Paper	📦 Source of Weights
D-FINE	D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement	`transformers`
DETR	End-to-End Object Detection with Transformers	`transformers`
RT-DETR	DETRs Beat YOLOs on Real-time Object Detection	`transformers`
RT-DETRv2	RT-DETRv2: Improved Baseline with Bag-of-Freebies for Real-Time Detection Transformers	`transformers`
RF-DETR	RF-DETR: Neural Architecture Search for Real-Time Detection Transformers	`rfdetr`

Segmentation

🏷️ Model Name	📜 Reference Paper	📦 Source of Weights
DeepLabV3	Rethinking Atrous Convolution for Semantic Image Segmentation	`torchvision`
EoMT	Your ViT is Secretly an Image Segmentation Model	`transformers`
SAM	Segment Anything	`transformers`
SAM2	SAM 2: Segment Anything in Images and Videos	`transformers`
SAM3	SAM 3	`transformers` (gated)
SegFormer	SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers	`transformers`

Feature Extraction

🏷️ Model Name	📜 Reference Paper	📦 Source of Weights
DINO	Emerging Properties in Self-Supervised Vision Transformers	`torch.hub`
DINOv2	DINOv2: Learning Robust Visual Features without Supervision	`transformers`

Vision-Language-Models (VLMs)

🏷️ Model Name	📜 Reference Paper	📦 Source of Weights
CLIP	Learning Transferable Visual Models From Natural Language Supervision	`transformers`
SigLIP	Sigmoid Loss for Language Image Pre-Training	`transformers`
SigLIP2	SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features	`transformers`

📜 License

This project leverages timm and transformers for converting pretrained weights from PyTorch to Keras. For licensing details, please refer to the respective repositories.

🔖 kmodels Code: This repository is licensed under the Apache 2.0 License.

🌟 Credits

The Keras team for their powerful and user-friendly deep learning framework
The Transformers library for its robust tools for loading and adapting pretrained models
The pytorch-image-models (timm) project for pioneering many computer vision model implementations
All contributors to the original papers and architectures implemented in this library

Citing

BibTeX

@misc{gc2025kmodels,
  author = {Gitesh Chawda},
  title = {Keras Models},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/IMvision12/keras-models}}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

IMvision12

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.0

Apr 18, 2026

0.2.7

Apr 15, 2026

0.2.6

Apr 15, 2026

This version

0.2.5

Apr 11, 2026

0.2.4

Apr 8, 2026

0.2.3

Apr 3, 2026

0.2.2

Apr 1, 2026

0.2.1

Mar 27, 2026

0.2.0

Mar 25, 2026

0.1.9

Mar 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kmodels-0.2.5.tar.gz (478.4 kB view details)

Uploaded Apr 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kmodels-0.2.5-py3-none-any.whl (587.1 kB view details)

Uploaded Apr 11, 2026 Python 3

File details

Details for the file kmodels-0.2.5.tar.gz.

File metadata

Download URL: kmodels-0.2.5.tar.gz
Upload date: Apr 11, 2026
Size: 478.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kmodels-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`76e6e5bd5288fdf332935cd4c95234a614287b2317181fbcdb08a890faa40bf5`
MD5	`ae3237aaac2ed174e5b754b36d603306`
BLAKE2b-256	`c0a9ae861dcb0543675ffdf8e36ba7a316c4ea5cbf5076c252b3160ec702657b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kmodels-0.2.5.tar.gz:

Publisher: release.yml on IMvision12/keras-models

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kmodels-0.2.5.tar.gz
- Subject digest: 76e6e5bd5288fdf332935cd4c95234a614287b2317181fbcdb08a890faa40bf5
- Sigstore transparency entry: 1276419530
- Sigstore integration time: Apr 11, 2026
Source repository:
- Permalink: IMvision12/keras-models@d7f183bae5e8c45db8ced9deef35882166d4e1e4
- Branch / Tag: refs/heads/main
- Owner: https://github.com/IMvision12
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d7f183bae5e8c45db8ced9deef35882166d4e1e4
- Trigger Event: push

File details

Details for the file kmodels-0.2.5-py3-none-any.whl.

File metadata

Download URL: kmodels-0.2.5-py3-none-any.whl
Upload date: Apr 11, 2026
Size: 587.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kmodels-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7179b5103f1e2117f7576b64f99ec9c6495599cad16bcc6aca58c06fb16b9f4f`
MD5	`6f8a9dab99aef5be86df7557e7bad781`
BLAKE2b-256	`7739bece7da1d1880f521de0e64fd396cb74e1feeb972aa4d9c7606841d184bf`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kmodels-0.2.5-py3-none-any.whl:

Publisher: release.yml on IMvision12/keras-models

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kmodels-0.2.5-py3-none-any.whl
- Subject digest: 7179b5103f1e2117f7576b64f99ec9c6495599cad16bcc6aca58c06fb16b9f4f
- Sigstore transparency entry: 1276419644
- Sigstore integration time: Apr 11, 2026
Source repository:
- Permalink: IMvision12/keras-models@d7f183bae5e8c45db8ced9deef35882166d4e1e4
- Branch / Tag: refs/heads/main
- Owner: https://github.com/IMvision12
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d7f183bae5e8c45db8ced9deef35882166d4e1e4
- Trigger Event: push

kmodels 0.2.5

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Keras Models 🚀

📖 Introduction

⚡ Installation

📑 Documentation

📑 Models

📜 License

🌟 Credits

Citing

BibTeX

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance