MaskGCT model for TTSDB
Project description
MaskGCT
MaskGCT is a zero-shot text-to-speech model using masked generative codec transformer from Amphion.
Installation
cd models/maskgct
just setup maskgct
Usage
from ttsdb_maskgct import MaskGCT
model = MaskGCT(model_id="ttsds/maskgct")
audio, sr = model.synthesize(
text="Hello, world!",
reference_audio="path/to/reference.wav",
text_reference="Text spoken in the reference audio.",
language="en"
)
Audio Examples
Generated audio samples from integration tests:
| Language | Text | Audio |
|---|---|---|
| English | "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good." | audio_examples/en_test_001.wav |
| Chinese | "視野無限廣,窗外有藍天" | audio_examples/zh_test_001.wav |
To regenerate examples, run:
just hf-weights-prepare maskgct # Download weights first
just test-integration maskgct
Supported Languages
- English (en)
- Chinese (zh)
- Korean (ko)
- Japanese (ja)
- French (fr)
- German (de)
Citation
@article{wang2024maskgct,
title={MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer},
author={Wang, Yuancheng and Zhan, Haoyue and Liu, Liwei and Zeng, Ruihong and Guo, Haotian and Zheng, Jiachen and Zhang, Qiang and Zhang, Xueyao and Zhang, Shunsi and Wu, Zhizheng},
journal={arXiv preprint arXiv:2409.00750},
year={2024}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ttsdb_maskgct-0.1.4-py3-none-any.whl.
File metadata
- Download URL: ttsdb_maskgct-0.1.4-py3-none-any.whl
- Upload date:
- Size: 41.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4b826ceea480cb71f9e96f2bc400f7ac06b7a37d6eb81937c39480beb16027bd
|
|
| MD5 |
987b4786edc7b4fd95d636ffd73962ef
|
|
| BLAKE2b-256 |
893c38c41d16eb5e1b29a32d2d25a84cb34eef3143c0c0826e4af232ca2d8c6a
|