Last released Apr 15, 2026
Flash Attention CUTE (CUDA Template Engine) implementation
Last released Mar 27, 2026
A place to store reusable transformer components found around the interwebs
Last released Feb 9, 2026
Flash Attention CUTE - package coming soon
Last released May 30, 2025
Helpful tools and examples for working with flex-attention
Supported by