Last released Oct 27, 2024
An OpenAI-compatible API LLM engine with smart prompt caching, batch processing, structured output with guided decoding, and function calling for all models using MLX.
Last released Jul 8, 2024
A python package for developing AI applications with local LLMs.
Supported by