Skip to main content

No project description provided

Project description

ttsds

PyPI - Version Hugginface Space

As many recent Text-to-Speech (TTS) models have shown, synthetic audio can be close to real human speech. However, traditional evaluation methods for TTS systems need an update to keep pace with these new developments. Our TTSDS benchmark assesses the quality of synthetic speech by considering factors like prosody, speaker identity, and intelligibility. By comparing these factors with both real speech and noise datasets, we can better understand how close synthetic speech is to human speech.

For the current benchmark results, see https://huggingface.co/spaces/ttsds/benchmark.

For other details, see our paper: https://arxiv.org/abs/2407.12707

Installation

Pip

pip install ttsds

Requirements

  • Python 3.8+
  • System packages: ffmpeg, automake, autoconf, unzip, sox, gfortran, subversion, libtool
  • On some systems, the fairseq installation may fail due to conflicting dependencies. In this case, you can install this fork of fairseq https://github.com/MiniXC/fairseq-noconf

Caching

Please set TTSDS_CACHE_DIR environment variable to a directory where you want to cache the downloaded models and data.

Website

License

ttsds is distributed under the terms of the MIT license.

Citation

@misc{minixhofer2024ttsdstexttospeechdistribution,
      title={TTSDS -- Text-to-Speech Distribution Score}, 
      author={Christoph Minixhofer and Ondřej Klejch and Peter Bell},
      year={2024},
      eprint={2407.12707},
      archivePrefix={arXiv},
      primaryClass={eess.AS},
      url={https://arxiv.org/abs/2407.12707}, 
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ttsds-0.0.4.tar.gz (5.3 MB view details)

Uploaded Source

Built Distribution

ttsds-0.0.4-py3-none-any.whl (5.3 MB view details)

Uploaded Python 3

File details

Details for the file ttsds-0.0.4.tar.gz.

File metadata

  • Download URL: ttsds-0.0.4.tar.gz
  • Upload date:
  • Size: 5.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for ttsds-0.0.4.tar.gz
Algorithm Hash digest
SHA256 2a3c23785d11ac3f34815d556987ef4cf7fba706f37de66f120685d20d10e83c
MD5 e94b94c7ea7c83c2dfabb0ffc274dc07
BLAKE2b-256 51d9cf90d07f7e30b2aab65082df39931c2bdc0b1626343ae19675a7f4af9d96

See more details on using hashes here.

File details

Details for the file ttsds-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: ttsds-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 5.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for ttsds-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 cf1597ef0e4ecccc5eccbe67b40c36a95fac72920903944e97128f82cc5ef9ba
MD5 e99ed00c5bfb5293e779a049791ad716
BLAKE2b-256 c6d9a1bce94d1095b869896ce55e08620bec8ba3774f7adc4106e1a71bb3e468

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page