Last released Apr 7, 2026
Up to 28x KV cache compression for LLMs via spectral SVD projection
Supported by