Last released Feb 12, 2026
GPU Auto-Optimizer: automatically finds the fastest stable batch/precision configuration for ML scripts and vLLM.
Last released Feb 10, 2026
Automatic micro-batching for HTTP LLM calls and local PyTorch inference, backed by a Rust core.
Supported by