Last released Mar 14, 2025
A multimodal model training toolkit
Last released Apr 11, 2024
A framework for efficient fault tolerance in large scale distributed training with pipeline template.
Supported by