Last released Apr 25, 2026
High-performance, memory-fluid LLM inference engine — Rust speed, Python convenience.
Supported by