32 projects
f5-tts
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
spacesutils
Utilities for ZeroGPU spaces on Hugging Face Spaces
podcastpile
None
pyrqbit
Python bindings for librqbit - a feature-rich BitTorrent client library
openvoicelab
None
vibevoice-api
A model for speech generation with an AR + diffusion architecture.
pyhfapi
The official Python library for the hugging-face API
openbridge
Use Claude Code with any LLM
cckimi
Claude Code Kimi-Groq Proxy
ttstune
Configuration-driven framework for fine-tuning TTS models.
chatterbox-server
A API for ChatterboxTTS
hfcrypt
Host encrypted apps on Hugging Face Spaces
ace-step
ACE Step: A Step Towards Music Generation Foundation Model
ttslab
Run TTS models.
voicestar
VoiceStar: Robust, Duration-Controllable TTS that can Extrapolate
openf5-utils
Utilities for OpenF5 TTS
gpus
A web interface for monitoring NVIDIA GPUs
simpletts
A lightweight Python library for running TTS models with a unified API.
parler-tts
Toolkit for using and training Parler-TTS, a high-quality text-to-speech model.
descript-audio-codec-unofficial
A high-quality general neural audio codec.
descript-audiotools-unofficial
Utilities for handling audio.
tangoflux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
speechtoolkit
ML for Speech presents SpeechToolkit, a unified, all-in-one toolkit for TTS, ASR, VC, & other models.
utmos
UT-Sarulab MOS prediction system using SSL models
mfs-styletts2
Unofficial pip package for StyleTTS 2
resemble-monotonic-align
None
ns3vc
Unofficial pip package for Amphion's NaturalSpeech3 implementation
lvc
Unofficial pip package for zero-shot voice conversion
openphonemizer
Permissively licensed, open sourced, local IPA Phonemizer (G2P) powered by deep learning.
melotts-test
txtsplit
A simple text splitter based on Tortoise for use in text-to-speech applications
deepsearchkit
A small, fast, and easy package for semantic searching using artificial intelligence.