A package for training and testing image classification models using PyTorch.
Project description
FAI-Trainer
FAI-Trainer, the ultimate one-liner for image classification tasks in PyTorch! This user-friendly Python package simplifies the entire data preparation and model training process, allowing you to focus on what matters most - developing accurate models.
With FAI-Trainer, you can effortlessly convert your images into a format that's perfect for machine learning, resize them to the ideal size, and even eliminate duplicate images from your dataset. And if you're worried about splitting your data into training and validation sets? Don't be! FAI-Trainer takes care of that too.
Also: with just a single line of code, you can train a powerful Vision Transformer (ViT-B/16) model using PyTorch and torchvision. No more tedious setup or manual configuration required. Just load your dataset, specify some basic parameters, and let FAI-Trainer do the rest.
Whether you're a seasoned AI expert needing a quick trained model turnaround or just starting out with deep learning, FAI-Trainer is designed to make image classification tasks easier than ever before.
Authors: Nizamuddin Mohamed (@webnizam) GitHub: webnizam | Michael Stattelman (@mstatt) GitHub: mstatt
Features
- Multi-Model Support: Choose from multiple architectures including:
- Vision Transformer (ViT-B/16) (default)
- Vision Transformer (ViT-Tiny/16) with augmented regularization
- ResNet50
- Vision Transformer (ViT-L/16)
- EfficientNetV2-S
- ConvNeXt-Tiny
- Multi-Device Support: Automatic selection of the best available device (XPU, CUDA, MPS, or CPU) for training and inference
- Data Preparation: Convert image formats, resize images, remove duplicates, and ensure no corrupted images.
- Dataset Splitting: Automatically split datasets into training and validation sets with a specified ratio.
- Model Training: Train a ResNet50 model on the prepared dataset with configurable batch size, number of epochs, and image dimensions.
- Progress Tracking: Visual progress tracking for both training and validation phases.
- Model Testing: Load a trained model to test on a specific image or the validation dataset, and save the results.
System Requirements
- Python 3.8 or higher
- For GPU acceleration:
- NVIDIA GPU with CUDA support
- Intel GPU with XPU support (requires intel-extension-for-pytorch)
- Apple Silicon with MPS support (requires macOS 12.3+)
- 4GB RAM minimum (8GB+ recommended)
Installation
To install the basic FAI-Trainer package:
pip install fai-trainer
For Intel GPU acceleration support:
pip install fai-trainer[xpu]
If Intel XPU support fails to install or initialize, the package will automatically fall back to other available devices in this order:
- NVIDIA CUDA GPU (if available)
- Apple Silicon MPS (if available)
- CPU (always available as fallback)
Usage
Data Preparation and Model Training
To run data preparation and training with the default ViT-B/16:
fai-trainer --prepare-data --train --batch-size 16 --epochs 3 --image-size 224 224
To train with ViT-Tiny (recommended for faster training):
fai-trainer --train --model-type vit_tiny_384 --batch-size 32 --epochs 5 --image-size 384 384
To train with other model architectures:
fai-trainer --train --model-type resnet50 --batch-size 32 --epochs 5
fai-trainer --train --model-type efficientnet_v2_s --batch-size 32 --epochs 3
fai-trainer --train --model-type convnext_tiny --batch-size 32 --epochs 3
Available model types:
- vit_b_16 (Vision Transformer Base, default)
- vit_tiny_384 (Vision Transformer Tiny, ImageNet-21k pretrained + ImageNet-1k fine-tuned)
- resnet50 (ResNet50)
- vit_l_16 (Vision Transformer Large)
- efficientnet_v2_s (EfficientNetV2 Small)
- convnext_tiny (ConvNeXt Tiny)
To train without using pretrained weights:
fai-trainer --train --model-type vit_tiny_384 --no-pretrained
Model-specific Image Size Requirements:
Each model architecture has specific image size requirements:
- ViT-B/16 (default): Requires dimensions divisible by 16. Recommended sizes: 224x224, 384x384, 512x512
- ViT-Tiny/384: Requires exactly 384x384
- ViT-L/16: Requires dimensions divisible by 16. Recommended sizes: 224x224, 384x384, 512x512
- ResNet50: Minimum size 32x32, recommended 224x224
- EfficientNetV2-S: Minimum size 32x32, recommended 384x384
- ConvNeXt-Tiny: Minimum size 32x32, recommended 224x224
If you don't specify an image size, the recommended size for each model will be used automatically. If you specify an invalid size for a model, the training/testing will be aborted with an explanation of the requirements.
Model Testing
To test the model on a specific image:
fai-trainer --test --image-path path/to/your/image.jpg --model-type vit_tiny_384 --image-size 384 384
Note: When testing, make sure to use the same model type and image size that was used for training.
Directory Structure
Ensure your dataset directory has the following structure:
datasets/
├── class1/
│ ├── image1.jpg
│ ├── image2.jpg
│ └── ...
├── class2/
│ ├── image1.jpg
│ ├── image2.jpg
│ └── ...
└── class3/
├── image1.jpg
├── image2.jpg
└── ...
Directory Clean up
To clean up any prior processing or training:
fai-trainer --clean
Contributing
Contributions are welcome! Please open an issue or submit a pull request on GitHub.
©️2025 Falcons.AI | Vition.AI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file fai_trainer-0.5.3.tar.gz.
File metadata
- Download URL: fai_trainer-0.5.3.tar.gz
- Upload date:
- Size: 15.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c435606178f6a607a0d1becd61c42e77619281d9d603d4441413d0426fedeff3
|
|
| MD5 |
e0156867e435b2cafb8517d6308fd666
|
|
| BLAKE2b-256 |
30d32d0c51f901593d4f3ee154f76040a3ed6c7efaec6b99df529606e5cdcddf
|
File details
Details for the file fai_trainer-0.5.3-py3-none-any.whl.
File metadata
- Download URL: fai_trainer-0.5.3-py3-none-any.whl
- Upload date:
- Size: 16.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.22
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dae5f9fdd424c7dc46c0f09891c7cfdb066f7b4d0d0b136d5c92cc65fe623058
|
|
| MD5 |
999dd6c99dbc34481c09aa578af5e983
|
|
| BLAKE2b-256 |
79bee03c92cb55c445a0f535c66ad84fce3c7aa409e9a4219f2bc5f8128d2bbf
|