Last released Aug 1, 2024
A simple and efficient python library for fast inference of GGUF Large Language Models.
Last released Jun 20, 2024
Supported by