Skip to main content

No project description provided

Project description

ttsds

PyPI - Version Hugginface Space

As many recent Text-to-Speech (TTS) models have shown, synthetic audio can be close to real human speech. However, traditional evaluation methods for TTS systems need an update to keep pace with these new developments. Our TTSDS benchmark assesses the quality of synthetic speech by considering factors like prosody, speaker identity, and intelligibility. By comparing these factors with both real speech and noise datasets, we can better understand how close synthetic speech is to human speech.

For the current benchmark results, see https://huggingface.co/spaces/ttsds/benchmark.

For other details, see our paper: https://arxiv.org/abs/2407.12707

Installation

Pip

pip install ttsds

Requirements

  • Python 3.8+
  • System packages: ffmpeg, automake, autoconf, unzip, sox, gfortran, subversion, libtool
  • On some systems, the fairseq installation may fail due to conflicting dependencies. In this case, you can install this fork of fairseq https://github.com/MiniXC/fairseq-noconf

Caching

Please set TTSDS_CACHE_DIR environment variable to a directory where you want to cache the downloaded models and data.

Website

License

ttsds is distributed under the terms of the MIT license.

Citation

@misc{minixhofer2024ttsdstexttospeechdistribution,
      title={TTSDS -- Text-to-Speech Distribution Score}, 
      author={Christoph Minixhofer and Ondřej Klejch and Peter Bell},
      year={2024},
      eprint={2407.12707},
      archivePrefix={arXiv},
      primaryClass={eess.AS},
      url={https://arxiv.org/abs/2407.12707}, 
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ttsds-0.0.3.tar.gz (5.3 MB view details)

Uploaded Source

Built Distribution

ttsds-0.0.3-py3-none-any.whl (5.3 MB view details)

Uploaded Python 3

File details

Details for the file ttsds-0.0.3.tar.gz.

File metadata

  • Download URL: ttsds-0.0.3.tar.gz
  • Upload date:
  • Size: 5.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for ttsds-0.0.3.tar.gz
Algorithm Hash digest
SHA256 9c3b998cc93c556b256bb9f036b67506d0c5228e13ca4ac9183f8bc3f8848cb8
MD5 ff7ba5de6b95c68f23acf5d0a9ee6c31
BLAKE2b-256 a6406c125356fc34a0aa89dc05571eeed1d941cf4147368b75155b6fd5c1811a

See more details on using hashes here.

File details

Details for the file ttsds-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: ttsds-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 5.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for ttsds-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 657aa2d38bd1873b029b3ee06e89f885b17b0a52943dabd52596d951ff4d4600
MD5 4c7ccbcbc5a13f5b3d1b771d5a5edb9a
BLAKE2b-256 db5fcd842a16474e588e842979533d85331ec74c55970fe0f4be57ae49dd9aa8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page