Last released May 16, 2026
TurboQuant+ compression for vLLM. 4.3x weight compression + 3.7x KV cache, zero calibration.
Last released May 13, 2026
Minimal async DB layer for Python. Typed CRUD over Pydantic, raw SQL when you need it
Last released Mar 28, 2026
Numpy-only TurboQuant vector quantization. No PyTorch, no CUDA.
Supported by