Last released Jun 9, 2025
Activated LoRA (aLoRA) is a low rank adapter architecture that allows for reusing existing base model KV cache.
Supported by