Last released Feb 10, 2026
Fast, memory-efficient attention column reduction (e.g., sum, mean, max)
Last released Apr 6, 2026
ParoQuant — Pairwise Rotation Quantization for LLMs
Supported by