Profile of back2matching

Last released Mar 27, 2026

TurboQuant KV cache compression for LLM inference. Open-source pip-installable implementation for HuggingFace models.

Last released Mar 26, 2026

Compress and protect embeddings with TurboQuant. Zero-loss privacy via orthogonal rotation + 8x compression. No training needed.

Last released Mar 25, 2026

Which quantization should I use? One command benchmarks every quant level on YOUR GPU.

Last released Mar 25, 2026

Benchmark every KV cache compression method on your GPU. One command, real numbers.

sean