Last released May 16, 2026
SGLang is a fast serving framework for large language models and vision language models.
Last released Oct 9, 2023
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.
Supported by