Last released May 14, 2024
A structured generation langauge for LLMs.
Last released Oct 9, 2023
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.
Supported by