GigaTrain: An Efficient and Scalable Training Framework for AI Models
Project description
An Efficient and Scalable Training Framework for AI Models
| Quick Start | Contributing | License | Citation |
✨ Introduction
GigaTrain is an efficient and scalable training framework engineered to accelerate the development of large AI models. It provides optimized performance and streamlined training workflows, allowing researchers and developers to easily experiment with various models.
Major features
- 🔍 Unified distributed training: Seamless multi-GPU/multi-node execution; supports DeepSpeed ZeRO (0/1/2/3), FSDP/FSDP2, DDP, etc.
- 🔧 Flexible and reproducible configs: Clean PY/YAML/JSON configuration and a registry-driven, modular design with pluggable optimizers, schedulers, samplers, transforms, etc.
- 📈 Performance and memory efficiency: Mixed precision (FP16/BF16/FP8), gradient accumulation, gradient checkpointing, EMA, etc.
- 📊 Built-in monitoring and checkpointing: Integrated experiment logging and robust checkpointing for reliable long runs and resumability.
- ⚡ Lightweight and Easy to Use: Simple pip/source install; developers can focus solely on implementing the key algorithm, as the framework handles repetitive, tedious, and error-prone things like backprop, logging, checkpointing, resuming, EMA, and multi-node/multi-GPU execution.
⚡ Installation
GigaTrain can be installed from PyPi and has to be installed in a virtual environment (venv or conda for instance):
pip3 install giga-train
or you can install directly from source for the latest updates:
conda create -n giga_train python=3.11.10
conda activate giga_train
git clone https://github.com/open-gigaai/giga-train.git
cd giga-train
pip3 install -e .
🚀 Getting Started
We provide a step-by-step example to teach you how to fine-tune a model using GigaTrain.
🤝 Contributing
We welcome contributions! Please see our Contributing Guide for details.
📄 License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
📖 Citation
@misc{gigaai2025gigatrain,
author = {GigaAI},
title = {GigaTrain: An Efficient and Scalable Training Framework for AI Models},
year = {2025},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/open-gigaai/giga-train}}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file giga_train-1.0.0.tar.gz.
File metadata
- Download URL: giga_train-1.0.0.tar.gz
- Upload date:
- Size: 47.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1441294defda3c741ef76e061129d19b999f6b9d5b1e647d627c9aea89561b31
|
|
| MD5 |
98245b003c9227b5c2e70a9d79eac88d
|
|
| BLAKE2b-256 |
a8dd9431aca2bbb5109c84222caf3b54908f49f4d530ea3ee4fa787bf816d8c9
|
File details
Details for the file giga_train-1.0.0-py3-none-any.whl.
File metadata
- Download URL: giga_train-1.0.0-py3-none-any.whl
- Upload date:
- Size: 58.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bc2940a2337a4573475d33407687523835745db5b37713554ddbb5a94057e6d3
|
|
| MD5 |
9cb2ea76781389ead9fc996b037e39a2
|
|
| BLAKE2b-256 |
312adbbb8a8babadfdb600adb6518fa85e5d4e748b1d2c3ccb74290ac56183dc
|