Profile of AstraMind

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

3 projects

Last released Dec 16, 2024

This is a faster implementation for TTS models, to be used in highly async environment

Last released Jun 20, 2024

Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"

Last released Apr 21, 2024

An efficent implementation for the paper: "The Era of 1-bit LLMs"

Supported by