Last released Mar 24, 2026
A configuration-driven LLM compression framework using low-rank factorization and layer removal
Supported by