Last released May 6, 2026
3-tier distributed KV cache for LLM inference — preserve evicted KV across cluster nodes
Rust extension: TieredKVCache + TurboQuant + gRPC servers
Supported by