Skip to main content

KerasFormers: Open-source Keras 3 collection of pretrained models

Project description

KerasFormers ๐Ÿš€

License Keras Python

๐Ÿ“– Introduction

KerasFormers is a collection of models with pretrained weights, built entirely with Keras 3. It supports a range of tasks, including classification, object detection (DETR, RT-DETR, RT-DETRv2, RF-DETR, D-FINE, OWL-ViT, OWLv2, Grounding DINO), segmentation (SAM, SAM2, SAM3, SegFormer, DeepLabV3, EoMT, MaskFormer, Mask2Former, OneFormer, MobileViT-DeepLabV3, RF-DETR), monocular depth estimation (Depth Anything V1, Depth Anything V2), feature extraction (DINO, DINOv2, DINOv3), vision-language modeling (CLIP, SigLIP, SigLIP2, MetaCLIP 2), speech recognition (Whisper, Speech2Text, Moonshine), speech-aware language modeling (Granite Speech, Granite Speech Plus), text encoding and masked language modeling (BERT, RoBERTa, XLM-RoBERTa, DeBERTa, DeBERTa-v2, DeBERTa-v3), text generation with large language models (GPT, GPT-2, Qwen2, Qwen2-MoE, Qwen3, Qwen3-MoE, Qwen3.5, Qwen3.5-MoE, GPT-OSS, Llama 2, Llama 3, Llama 4, Mistral, Mixtral, Gemma, Gemma 2, Gemma 4, MiniMax-Text-01, MiniMax-M2, DeepSeek-V2, DeepSeek-V3, DeepSeek-V4, Cohere/Command-R, Cohere2, Cohere2-MoE, GLM-4, GLM-4-0414, GLM-4.5/GLM-4.6, GLM-5/GLM-5.1/GLM-5.2), multimodal vision-language generation (Qwen2-VL, Qwen2.5-VL, Qwen3-VL, InternVL3, Gemma 3, Mistral 3, DeepSeek-VL, Janus-Pro, MiniMax-M3-VL, Cohere2-Vision, GLM-4V, GLM-4.5V), vision-language grounding across object detection, OCR, pointing, and referring (LocateAnything), and more. It includes hybrid architectures like MaxViT alongside traditional CNNs and pure transformers. kerasformers includes custom layers and backbone support, providing flexibility and efficiency across various applications. For backbones, there are various weight variants like in1k, in21k, fb_dist_in1k, ms_in22k, fb_in22k_ft_in1k, ns_jft_in1k, aa_in1k, cvnets_in1k, augreg_in21k_ft_in1k, augreg_in21k, and many more.

โšก Installation

From PyPI (recommended)

pip install -U kerasformers

From Source

pip install -U git+https://github.com/IMvision12/KerasFormers

๐Ÿ“‘ Documentation

Per-model guides - with architecture notes, usage examples, and available pretrained weights, live in the docs/ folder, one page per model across every supported task (classification, object detection, segmentation, depth estimation, feature extraction, vision-language, speech recognition, text encoding, and language modeling). Classification backbones share a single page since they all follow the same XModel / XImageClassify two-class structure; each other model has its own. Browse docs/ for the complete, always-up-to-date list.

๐Ÿ“‘ Models

๐Ÿ“ Text Models


๐Ÿ‘๏ธ Vision Models






๐Ÿ–ผ๏ธ Multimodal Models




๐Ÿ”Š Audio Models


๐Ÿ“œ License

This project leverages timm and transformers for converting pretrained weights from PyTorch to Keras. For licensing details, please refer to the respective repositories.

๐ŸŒŸ Credits

  • The Keras team for their powerful and user-friendly deep learning framework
  • The Transformers library for its robust tools for loading and adapting pretrained models
  • The pytorch-image-models (timm) project for pioneering many computer vision model implementations
  • All contributors to the original papers and architectures implemented in this library

Citing

BibTeX

@misc{gc2025kerasformers,
  author = {Gitesh Chawda},
  title = {KerasFormers},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/IMvision12/KerasFormers}}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kerasformers-1.0.9.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kerasformers-1.0.9-py3-none-any.whl (1.6 MB view details)

Uploaded Python 3

File details

Details for the file kerasformers-1.0.9.tar.gz.

File metadata

  • Download URL: kerasformers-1.0.9.tar.gz
  • Upload date:
  • Size: 1.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kerasformers-1.0.9.tar.gz
Algorithm Hash digest
SHA256 02433a50aedc1226f44f00f75cd8c096a3c5bd6c9b9b5d53bf3056a8831b37bf
MD5 bb35bb9164d8f24fc2ee6c5aabf61c71
BLAKE2b-256 43f33214ab7f8f3f60e79e8e2e6670e28b1ec696c55f0fc8d946873ce6aa2474

See more details on using hashes here.

Provenance

The following attestation bundles were made for kerasformers-1.0.9.tar.gz:

Publisher: release.yml on IMvision12/KerasFormers

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file kerasformers-1.0.9-py3-none-any.whl.

File metadata

  • Download URL: kerasformers-1.0.9-py3-none-any.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kerasformers-1.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 8c996ea1bda229b22a61e29df916ab22a3043b21735f26a9ce33131bc575f6cd
MD5 1181426def45e1676a36d109286c0999
BLAKE2b-256 25af31f8e1cc8929197b9087eea381b5807fa10b22db1b57dc70f71ab54d9783

See more details on using hashes here.

Provenance

The following attestation bundles were made for kerasformers-1.0.9-py3-none-any.whl:

Publisher: release.yml on IMvision12/KerasFormers

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page