Last released Jan 16, 2025
NPU monitoring tool with TUI interface
A high-throughput and memory-efficient inference and serving engine for LLMs
Supported by