Profile of FluffyAIcode

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

1 project

Last released Apr 25, 2026

Nested-lattice KV-cache compression for LLM inference: Zamir-Feder D4 and E8 variants with shaping gain over scalar quantisation.

Supported by