Profile of TheTom

vllm-swift

Last released May 7, 2026

vLLM Metal plugin powered by mlx-swift — high-performance LLM inference on Apple Silicon

longctx-svc

Last released May 7, 2026

Local retrieval companion for inference servers — scoped, session-aware, file-watching.

longctx

Last released May 6, 2026

Open long-context inference stack: retrieval + open weights, no closed parts.

tqkit

Last released May 6, 2026

Unified toolkit for benchmarking and integrating TurboQuant+ KV-cache compression across inference engines (llama.cpp, vLLM, MLX).

refract-llm

Last released May 4, 2026

REFRACT — Reference-anchored Robust Acid-test for Compressed Transformers. Multi-axis KV-cache fidelity scoring for LLMs across llama.cpp, MLX, vLLM, and SGLang.

usbinfo

Last released Jun 24, 2021

Module for introspecting USB devices on a system

Tom Turney

6 projects