Last released Apr 22, 2026
Generate Shenron docker-compose deployments from model config files
Last released Feb 23, 2026
SGLang multiplexer with an OpenAI-compatible frontend
Last released Feb 20, 2026
Offline and online benchmarking utilities for large language model workloads
Supported by