Last released Apr 15, 2026
Fast snapshot/restore for LLM inference. 17x faster cold starts, multi-GPU tensor parallel, KV cache snapshots.
Rust+CUDA native extension for thaw (pipelined DMA freeze/restore)
Supported by