No project description provided

These details have not been verified by PyPI

Project links

Project description

TTSDS - Text-to-Speech Distribution Score

TTSDS is a comprehensive benchmark for evaluating the quality of synthetic speech in Text-to-Speech (TTS) systems. It assesses multiple aspects of speech quality including prosody, speaker identity, and intelligibility by comparing synthetic speech with both real speech and noise datasets.

Version 2.1.3

We are excited to release TTSDS 2.1.3 - this release restores compatibility with transformers>=5.0, torchaudio>=2.10, and huggingface_hub>=1.4. TTSDS2 is multilingual and updated quarterly, with a new dataset every time: you can view the results at https://ttsdsbenchmark.com#leaderboard.

Features

Multi-dimensional Evaluation: Assess speech quality across different categories:
- Prosody (e.g., pitch, speaking rate)
- Speaker Identity (e.g., speaker verification)
- Intelligibility (e.g., speech recognition)
- Generic Features (e.g., embeddings)
- Environment (e.g., noise robustness)
Weighted Scoring: Customizable weights for different evaluation categories
Progress Tracking: Real-time progress display with detailed statistics
Caching: Efficient caching of intermediate results
Error Handling: Robust error handling with optional skipping of failed benchmarks

Installation

System Requirements

# Required system packages
sudo apt-get install ffmpeg automake autoconf unzip sox gfortran subversion libtool

ttsds has been tested with python 3.10, 3.11, and 3.12. Versions <3.10 and >3.12 are not supported (for now).

A note on numpy 2.0

TTSDS has some dependencies which require numpy<2.0.0. Use the following command if the ttsds installation does not automatically install numpy<2.0.0:

pip install "numpy<2"

Python Installation

# Basic installation
pip install ttsds

Optional: Fairseq Installation

If you encounter dependency conflicts with fairseq, use this fork:

pip install git+https://github.com/MiniXC/fairseq-noconf

Development Installation

For development, install with the dev extras:

# Clone the repository
git clone https://github.com/ttsds/ttsds.git
cd ttsds

# Install with development dependencies
pip install -e ".[dev]"

Usage

Basic Example

from ttsds import BenchmarkSuite
from ttsds.util.dataset import DirectoryDataset

# Initialize datasets
datasets = [
    DirectoryDataset("path/to/your/dataset", name="your_dataset")
]
reference_datasets = [
    DirectoryDataset("path/to/reference/dataset", name="reference")
]

# Create benchmark suite
suite = BenchmarkSuite(
    datasets=datasets,
    reference_datasets=reference_datasets,
    write_to_file="results.csv",  # Optional: save results to CSV
    skip_errors=True,  # Optional: skip failed benchmarks
    include_environment=False,  # Optional: exclude environment benchmarks
)

# Run benchmarks
results = suite.run()

# Get aggregated results with weighted scores
aggregated = suite.get_aggregated_results()
print(aggregated)

The datasets should be directories containing wav files. Since this is a distributional score, the wav files do not need to include the same content, and the number of files can vary between datasets. However, results are best when the speaker identities are the same.

Custom Category Weights

from ttsds.benchmarks.benchmark import BenchmarkCategory

suite = BenchmarkSuite(
    datasets=datasets,
    reference_datasets=reference_datasets,
    category_weights={
        BenchmarkCategory.SPEAKER: 0.25,
        BenchmarkCategory.INTELLIGIBILITY: 0.25,
        BenchmarkCategory.PROSODY: 0.25,
        BenchmarkCategory.GENERIC: 0.25,
        BenchmarkCategory.ENVIRONMENT: 0.0,
    },
)

Multilingual

suite = BenchmarkSuite(
    datasets=datasets,
    reference_datasets=reference_datasets,
    multilingual=True,
)

Progress Display

The benchmark suite provides a real-time progress display showing:

Overall progress
Per-benchmark completion status
Estimated time remaining
Error messages (if any)

Configuration

Environment Variables

# Set cache directory (default: ~/.cache/ttsds)
export TTSDS_CACHE_DIR=/path/to/cache

Benchmark Categories

Speaker: Evaluates speaker identity preservation
Intelligibility: Measures speech recognition performance
Prosody: Assesses speech rhythm and intonation
Generic: General speech quality metrics
Environment: Noise robustness evaluation - this is excluded by default, set include_environment=True to include it.

Results

The benchmark results include:

Individual benchmark scores
Category-wise aggregated scores
Overall weighted score
Time taken for each benchmark
Reference and noise dataset information

Results can be saved to a CSV file for further analysis.

Development

Running Tests

TTSDS includes a comprehensive test suite covering its functionality:

# Run all tests
cd ttsds
./tests/run_tests.py

# Run specific test modules or classes
./tests/run_tests.py tests/unit/benchmarks/test_benchmark.py
./tests/run_tests.py tests/unit/test_ttsds.py::test_benchmark_suite_init

# Run with coverage report
./tests/run_tests.py --cov-report=html

The test suite uses pytest and includes:

Unit tests for individual components
Integration tests for the full system
Test coverage reporting

Documentation

The API documentation is automatically generated from docstrings using mkdocstrings:

# Build the documentation
pip install -e ".[dev]"
mkdocs build

# Serve the documentation locally
mkdocs serve

Citation

@inproceedings{minixhofer2024ttsds,
  title={TTSDS-Text-to-Speech Distribution Score},
  author={Minixhofer, Christoph and Klejch, Ond{\v{r}}ej and Bell, Peter},
  booktitle={SLT},
  year={2024},
}

License

ttsds is distributed under the terms of the MIT license.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.1.3

Jun 5, 2026

2.1.1

May 18, 2025

2.1.0 yanked

May 16, 2025

Reason this release was yanked:

This version does not set up import correctly, please upgrade to 2.1.1

2.0.0

Apr 18, 2025

0.0.4

Nov 22, 2024

0.0.3

Nov 5, 2024

0.0.2

Jul 18, 2024

0.0.1

Jul 18, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ttsds-2.1.3.tar.gz (5.3 MB view details)

Uploaded Jun 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ttsds-2.1.3-py3-none-any.whl (5.3 MB view details)

Uploaded Jun 5, 2026 Python 3

File details

Details for the file ttsds-2.1.3.tar.gz.

File metadata

Download URL: ttsds-2.1.3.tar.gz
Upload date: Jun 5, 2026
Size: 5.3 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for ttsds-2.1.3.tar.gz
Algorithm	Hash digest
SHA256	`8414b375c984d33aa49535dc48c929659da6a9c0b35c635828db105c628398c1`
MD5	`f289effcdba50049ad29f259c21c4664`
BLAKE2b-256	`5f57610880c18246e7c4d6f6a808bb6a4874b7bbd1ccd49604574c4519a6c156`

See more details on using hashes here.

File details

Details for the file ttsds-2.1.3-py3-none-any.whl.

File metadata

Download URL: ttsds-2.1.3-py3-none-any.whl
Upload date: Jun 5, 2026
Size: 5.3 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for ttsds-2.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d1e662ab3c05ac7a70c839c10e3b39f2a6b1afc3df604c29435af508be7cd230`
MD5	`851dba7c2563c79b70c6aaf4ad058872`
BLAKE2b-256	`3e56c76ba59ec1d52f87b22c583f50f0ea6480db05b16001963bbb67ff4aea8a`

See more details on using hashes here.

ttsds 2.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TTSDS - Text-to-Speech Distribution Score

Version 2.1.3

Features

Installation

System Requirements

A note on numpy 2.0

Python Installation

Optional: Fairseq Installation

Development Installation

Usage

Basic Example

Custom Category Weights

Multilingual

Progress Display

Configuration

Environment Variables

Benchmark Categories

Results

Development

Running Tests

Documentation

Citation

License

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes