Last released Apr 19, 2026
Memory-efficient Mamba2 scan for HuggingFace Transformers. Fixes Zamba2 OOM on small GPUs.
Last released Apr 2, 2026
Model weight compression and streaming decode library
Last released Mar 6, 2026
Block-level model patching with verifiable receipts
Supported by