Profile of matteso1

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

2 projects

thaw-vllm

Last released Jun 7, 2026

The fork primitive for LLM inference. Snapshot a running session — weights + KV cache + scheduler state — and hydrate it into N divergent children that skip prefill. For RL rollouts, parallel coding agents, agent branching. Supports vLLM and SGLang.