Last released Jul 10, 2025
Python package wrapping llama.cpp for on-device LLM inference
Supported by