No project description provided
Project description
ttsds
As many recent Text-to-Speech (TTS) models have shown, synthetic audio can be close to real human speech. However, traditional evaluation methods for TTS systems need an update to keep pace with these new developments. Our TTSDS benchmark assesses the quality of synthetic speech by considering factors like prosody, speaker identity, and intelligibility. By comparing these factors with both real speech and noise datasets, we can better understand how close synthetic speech is to human speech.
For the current benchmark results, see https://huggingface.co/spaces/ttsds/benchmark.
For other details, see our paper: https://arxiv.org/abs/2407.12707
Installation
Pip
pip install ttsds
Requirements
- Python 3.8+
- System packages: ffmpeg, automake, autoconf, unzip, sox, gfortran, subversion, libtool
- On some systems, the fairseq installation may fail due to conflicting dependencies. In this case, you can install this fork of fairseq https://github.com/MiniXC/fairseq-noconf
Caching
Please set TTSDS_CACHE_DIR
environment variable to a directory where you want to cache the downloaded models and data.
License
ttsds
is distributed under the terms of the MIT license.
Citation
@misc{minixhofer2024ttsdstexttospeechdistribution,
title={TTSDS -- Text-to-Speech Distribution Score},
author={Christoph Minixhofer and Ondřej Klejch and Peter Bell},
year={2024},
eprint={2407.12707},
archivePrefix={arXiv},
primaryClass={eess.AS},
url={https://arxiv.org/abs/2407.12707},
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ttsds-0.0.3.tar.gz
.
File metadata
- Download URL: ttsds-0.0.3.tar.gz
- Upload date:
- Size: 5.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9c3b998cc93c556b256bb9f036b67506d0c5228e13ca4ac9183f8bc3f8848cb8 |
|
MD5 | ff7ba5de6b95c68f23acf5d0a9ee6c31 |
|
BLAKE2b-256 | a6406c125356fc34a0aa89dc05571eeed1d941cf4147368b75155b6fd5c1811a |
File details
Details for the file ttsds-0.0.3-py3-none-any.whl
.
File metadata
- Download URL: ttsds-0.0.3-py3-none-any.whl
- Upload date:
- Size: 5.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 657aa2d38bd1873b029b3ee06e89f885b17b0a52943dabd52596d951ff4d4600 |
|
MD5 | 4c7ccbcbc5a13f5b3d1b771d5a5edb9a |
|
BLAKE2b-256 | db5fcd842a16474e588e842979533d85331ec74c55970fe0f4be57ae49dd9aa8 |