Last released May 21, 2026
A minimal, high-performance large language model (LLM) inference engine implementing vLLM in Rust.
Supported by