20 projects
pronunciation-dictionary-utils
CLI and library to modify pronunciation dictionaries (any language).
textgrid-tools
Command-line interface (CLI) to modify TextGrids and their corresponding audio files.
txt-utils
CLI to modify text files.
text-selection
Command-line interface (CLI) to select lines of a text file.
mean-opinion-score
Library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).
tts-mos-test-mturk
Command-line interface and Python library to evaluate text-to-speech mean opinion score studies done on Amazon Mechanical Turk.
tacotron-cli
Command-line interface (CLI) to train Tacotron 2 using .wav <=> .TextGrid pairs.
speech-dataset-parser
Library to parse speech datasets stored in a generic format based on TextGrids. A tool (CLI) for converting common datasets like LJ Speech into a generic format is included.
pinyin-to-ipa
Command-line interface (CLI) and Python library to transcribe pinyin to IPA.
dict-from-pypinyin
Command-line interface (CLI) to create a pronunciation dictionary by looking up pinyin transcriptions using pypinyin including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.
dict-from-dict
Command-line interface (CLI) to create a pronunciation dictionary from an other pronunciation dictionary with the possibility of ignoring punctuation and splitting on hyphens before lookup.
pronunciation-dictionary
Library to load and save pronunciation dictionaries (any language).
dict-from-dragonmapper
Command-line interface (CLI) to create a pronunciation dictionary by looking up IPA transcriptions using dragonmapper including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.
mel-cepstral-distance
CLI and library to compute the Mel-Cepstral Distance of two WAV files based on the paper 'Mel-Cepstral Distance Measure for Objective Speech Quality Assessment' by Robert F. Kubichek.
dict-from-annotation
Command-line interface (CLI) to create a pronunciation dictionary based on annotations.
english-text-normalization
Command-line interface (CLI) and library to normalize English texts.
waveglow-cli
Command-line interface (CLI) to train WaveGlow using .wav files.
dict-from-g2pE
CLI to create a pronunciation dictionary by predicting English ARPAbet phonemes using seq2seq model from g2pE and the possibility of ignoring punctuation and splitting on hyphens before prediction.
word-to-pronunciation
Create pronunciations of words with the possibility of ignoring punctuation and splitting on hyphens before lookup.
iterable-serialization
Serialization/deserialization of iterables of type 'str' to a single string.