Last released May 31, 2024
Efficiently run models quantized with AQLM
Last released Sep 11, 2023
A smal framework for reproducing FL experiments
Last released Aug 6, 2023
Automatically shard your large model between multiple GPUs, works without torch.distributed
Supported by