PyTorch image models, evolved — encoder/head architecture split layered on timm

These details have not been verified by PyPI

Project links

Project description

timme — timm, evolved

timme is an experimental refactor of timm that splits every image model into a reusable encoder and a separate head. It's a thin layer on top of timm — for now it reuses timm's blocks, layers, hub integration, and pretrained-weight infrastructure, while exposing a cleaner API for feature extraction, head swapping, and weight remapping.

Status: alpha / proof-of-concept. 9 model families wired (~109 variants). The shape of the public API is settling but not stable. Not production-ready.

Why

timm models are wonderful but each one is a monolithic nn.Module with forward_features + forward_head baked together. That makes a few things awkward:

using a backbone for downstream tasks (detection, segmentation, dense prediction) requires hand-rolling head removal,
swapping a head (token-pool vs avg-pool vs attention-pool, distillation, MAP) requires reaching into model internals,
intermediate features (features_only=True) go through a separate FeatureListNet wrapper rather than the model itself.

timme says: every model is ImageClassifier(encoder, head) where encoder IS the features (no forward_features indirection) and head is a swappable, well-typed module. Pretrained weights load via a small per-family WeightLayout that splits old monolithic state dicts into encoder.* / head.*.

Install

pip install timme         # not yet on PyPI; for now:
pip install -e .          # from a clone

Runtime dependency: timm>=1.0, torch>=2.0.

Usage

Drop-in replacement for timm.create_model:

import timme

# Pretrained classifier — same names as timm
model = timme.create_model('resnet50.a1_in1k', pretrained=True)
model.eval()

# Logits match timm bit-for-bit
import torch, timm
x = torch.randn(1, 3, 224, 224)
y2 = model(x)
y1 = timm.create_model('resnet50.a1_in1k', pretrained=True).eval()(x)
assert torch.equal(y1, y2)

Encoder-only (replaces features_only=True):

encoder = timme.create_encoder('vit_base_patch16_224.augreg2_in21k_ft_in1k', pretrained=True)
features = encoder(x)                                 # (B, 197, 768) — NLC
encoder = timme.create_encoder('resnet50', pretrained=True, out_indices=(0, 1, 2, 3, 4))
stages = encoder(x)                                   # list of stage tensors

Swap heads or change num_classes / in_chans:

model = timme.create_model('resnet50.a1_in1k', pretrained=True, num_classes=10)
model = timme.create_model('resnet50.a1_in1k', pretrained=True, in_chans=1)

Training And Eval Apps

Native task-style apps live under timme.apps and are installed as console scripts. The intended naming is train_{task}.py and eval_{task}.py; today that means classification training/eval, SSL training, and k-NN representation eval.

# console scripts
timme-train-cls --data.path /data/imagenet --model.model resnet50
timme-train-ssl --data.path /data/imagenet --model.model vit_tiny_patch16_224 --ssl.ssl_method nepa
timme-eval-cls --data-dir /data/imagenet/validation --model resnet50 --pretrained
timme-eval-knn --data-dir /data/imagenet --model vit_tiny_patch16_224 --checkpoint /path/to/last.pth.tar

# equivalent module entry points, useful from a source checkout
python -m timme.apps.train_cls --data.path /data/imagenet --model.model resnet50
python -m timme.apps.eval_cls --data-dir /data/imagenet/validation --model resnet50 --pretrained

Classification training:

torchrun --nproc-per-node=8 -m timme.apps.train_cls \
  --data.path /data/imagenet \
  --model.model resnet50 \
  --model.pretrained true \
  --loader.batch_size 128 \
  --scheduler.epochs 100 \
  --device.amp true

Override the train defaults when needed:

timme-train-cls \
  --data.path /data/imagenet \
  --model.model convnext_tiny \
  --optimizer.lr 5e-4 \
  --optimizer.weight_decay 0.05 \
  --model.torchcompile inductor

Self-supervised training is encoder-native: timme-train-ssl builds a bare timme.create_encoder(...) via create_train_model(..., target='encoder'). The SSL tasks train representations directly, without a classifier wrapper/head.

# NEPA
timme-train-ssl \
  --data.path /data/imagenet \
  --model.model vit_tiny_patch16_224 \
  --ssl.ssl_method nepa \
  --scheduler.epochs 100 \
  --device.amp true

# LeJEPA multi-view training
timme-train-ssl \
  --data.path /data/imagenet \
  --model.model vit_small_patch16_224 \
  --ssl.ssl_method lejepa \
  --ssl.num_views 2 \
  --ssl.lejepa_lamb 0.02

Evaluate checkpoints:

timme-eval-cls \
  --data-dir /data/imagenet/validation \
  --model resnet50 \
  --checkpoint output/train/model_best.pth.tar

timme-eval-knn \
  --data-dir /data/imagenet \
  --model vit_tiny_patch16_224 \
  --checkpoint output/ssl/model_best.pth.tar \
  --k 1 5 20 100

timme-eval-knn also uses timme.create_encoder(...) and normalizes task, DDP, and torch.compile checkpoint wrapper keys before loading into the bare encoder. For DDP plus torch.compile, the apps apply distributed wrapping before task compilation; when gradient checkpointing is enabled, timme disables Dynamo's DDP optimizer guard path that is known to fail in recent PyTorch versions.

Config files are YAML dataclass trees with the same dotted keys as the CLI:

model:
  model: vit_tiny_patch16_224
data:
  path: /data/imagenet
loader:
  batch_size: 256
optimizer:
  opt: adamw
  lr: 3e-4
  weight_decay: 0.01
scheduler:
  epochs: 100
  warmup_epochs: 5
ssl:
  ssl_method: nepa

timme-train-ssl -c configs/nepa_vit.yaml --data.path /data/imagenet

What's implemented

Current model coverage is 109 variants across 9 families, all exact-matching timm pretrained weights:

family	example variants
ResNet	`resnet18`, `resnet50`, `resnet101`, `resnet50d`, `seresnet50`
ViT	`vit_tiny/small/base/large_patch16_224/384`, CLIP/DINOv2 variants
ConvNeXt	`convnext_tiny/small/base/large`, `convnextv2_base`
MobileNetV3	`mobilenetv3_large_100`, `mobilenetv3_small_100`
ByobNet	~70 variants — gernet, resnet51q, regnetz, eca_resnet, etc.
DeiT	`deit_`, `deit3_`, distilled `deit__distilled_`
LeViT	`levit_128/192/256/384`, conv-mode variants
NaFlexViT	`naflexvit_base/so150m2/so400m_patch16_*` (gap, par_gap, map, siglip)
EVA / EVA02	`eva_giant_patch14_`, `eva02_tiny/small/base/large_patch14_`, CLIP

9 canonical heads cover the head-side variability (5 spatial for CNNs, 4 token for transformers). See ARCHITECTURE.md.

Native apps currently include:

timme.apps.train_cls / timme-train-cls: classification, distillation, DDP, AMP, EMA, mixup/cutmix, update-step scheduling.
timme.apps.eval_cls / timme-eval-cls: classifier validation.
timme.apps.train_ssl / timme-train-ssl: encoder-native NEPA and LeJEPA training.
timme.apps.eval_knn / timme-eval-knn: encoder-native k-NN representation eval.
timme.apps.sweep / timme-sweep: lightweight config sweep runner.

What's not implemented yet

The remaining ~80 timm families. Each one needs the same wiring: encoder class + WeightLayout + builder + register_family(...).
Many advanced application scripts from timm/OpenCLIP are not ported yet. Classification and SSL task apps are the active native paths.
Standalone hub story. timme reuses timm.models._registry for pretrained_cfg metadata and timm.models._builder.load_pretrained for downloads, so it depends on timm's hub integration today.
Standalone layer primitives. timme.layers is a placeholder; blocks/norms/activations/attention pools are imported from timm.layers.

Roadmap

The plan is for timme to grow into a fully standalone replacement for timm. Near-term priorities:

More families — work through the rest of timm's model zoo, prioritizing actively-developed ones.
Vendor / fork shared layers as needed — the runtime dep on timm is fine for v0 but limits independent evolution.
More native apps — extend the task app base toward additional workflows.
Hub integration — read pretrained metadata from timme's own registry instead of borrowing timm's.

License

Apache-2.0. Built on timm (also Apache-2.0). See LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.3

May 8, 2026

This version

0.0.2

May 5, 2026

0.0.1

Apr 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

timme-0.0.2.tar.gz (137.1 kB view details)

Uploaded May 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

timme-0.0.2-py3-none-any.whl (167.8 kB view details)

Uploaded May 5, 2026 Python 3

File details

Details for the file timme-0.0.2.tar.gz.

File metadata

Download URL: timme-0.0.2.tar.gz
Upload date: May 5, 2026
Size: 137.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.22

File hashes

Hashes for timme-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`6a014c0556f560301703df06e1150405c07616e0ebdc39d9d19679c8acd0362d`
MD5	`ccc91a25ad497f8de3b274229cbf79ab`
BLAKE2b-256	`feda02ffa8e2b0b8ca97c4a44af96c828d06cd65f0bfe67ce007c298e2e70c8d`

See more details on using hashes here.

File details

Details for the file timme-0.0.2-py3-none-any.whl.

File metadata

Download URL: timme-0.0.2-py3-none-any.whl
Upload date: May 5, 2026
Size: 167.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.22

File hashes

Hashes for timme-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`37c1dba38797a07117464a73bb7e0973e1dac19f9335c42bdb2698608f9509c0`
MD5	`bc6e8569078dc65218437bdc04527a3b`
BLAKE2b-256	`e09c4350d995212c75fed415844de7d405a4cde12e51de7ee72ca5c7e2d9e6ea`

See more details on using hashes here.

timme 0.0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

timme — timm, evolved

Why

Install

Usage

Training And Eval Apps

What's implemented

What's not implemented yet

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes