MaskGCT model for TTSDB
Project description
MaskGCT
MaskGCT is a zero-shot text-to-speech model using masked generative codec transformer from Amphion.
Installation
cd models/maskgct
just setup maskgct
Usage
from ttsdb_maskgct import MaskGCT
model = MaskGCT(model_id="ttsds/maskgct")
audio, sr = model.synthesize(
text="Hello, world!",
reference_audio="path/to/reference.wav",
text_reference="Text spoken in the reference audio.",
language="en"
)
Audio Examples
Generated audio samples from integration tests:
| Language | Text | Audio |
|---|---|---|
| English | "With tenure, Suzie'd have all the more leisure for yachting, but her publications are no good." | audio_examples/en_test_001.wav |
| Chinese | "視野無限廣,窗外有藍天" | audio_examples/zh_test_001.wav |
To regenerate examples, run:
just hf prepare maskgct # Download weights first
just test-integration maskgct
Supported Languages
- English (en)
- Chinese (zh)
- Korean (ko)
- Japanese (ja)
- French (fr)
- German (de)
Citation
@article{wang2024maskgct,
title={MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer},
author={Wang, Yuancheng and Zhan, Haoyue and Liu, Liwei and Zeng, Ruihong and Guo, Haotian and Zheng, Jiachen and Zhang, Qiang and Zhang, Xueyao and Zhang, Shunsi and Wu, Zhizheng},
journal={arXiv preprint arXiv:2409.00750},
year={2024}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ttsdb_maskgct-0.1.0.tar.gz
(9.9 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ttsdb_maskgct-0.1.0.tar.gz.
File metadata
- Download URL: ttsdb_maskgct-0.1.0.tar.gz
- Upload date:
- Size: 9.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2b4daca130ecbc12293842c433e029b52ed3c66daa7da4f8ee75f30ab1c66146
|
|
| MD5 |
276a23ba1e048bfa8801feb162e03f8a
|
|
| BLAKE2b-256 |
9abef378aa2396ae5611188f9eab21f06ffb5ff0d0687a4c382442f501e5f501
|
File details
Details for the file ttsdb_maskgct-0.1.0-py3-none-any.whl.
File metadata
- Download URL: ttsdb_maskgct-0.1.0-py3-none-any.whl
- Upload date:
- Size: 6.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b1c6c90ab9f02fff276174ee3c0dc1ad61e9191aa4938120fa44e7a0cc8bebf0
|
|
| MD5 |
c36b7d046acd3782c45563cda1031d3d
|
|
| BLAKE2b-256 |
c6e2f38779fc58689c17b6190b5f521c2db06d37a42f803d5a40f59f874c9116
|