Last released Apr 17, 2025
Galvatron, a Efficient Transformer Training Framework for Multiple GPUs Using Automatic Parallelism
Supported by