Last released Mar 28, 2026
Flash-attention-class memory efficiency for GPUs without flash attention
Supported by