Last released Jan 12, 2026
Cross-platform FlashAttention-2 Triton implementation for Turing+ architectures
Supported by