23 projects
birdnet
A Python library for identifying bird species by their sounds.
zho-tts
Web app, command-line interface and Python library for synthesizing Chinese texts into speech.
en-tts
Web app, command-line interface and Python library for synthesizing English texts into speech.
pinyin-to-ipa
Command-line interface (CLI) and Python library to transcribe pinyin to IPA.
txt-utils
CLI to modify text files.
waveglow-cli
Command-line interface (CLI) to train WaveGlow using .wav files.
tacotron-cli
Command-line interface (CLI) to train Tacotron 2 using .wav <=> .TextGrid pairs.
mel-cepstral-distance
CLI and library to compute the Mel-Cepstral Distance of two WAV files based on the paper 'Mel-Cepstral Distance Measure for Objective Speech Quality Assessment' by Robert F. Kubichek.
dict-from-g2pE
CLI to create a pronunciation dictionary by predicting English ARPAbet phonemes using seq2seq model from g2pE and the possibility of ignoring punctuation and splitting on hyphens before prediction.
dict-from-dict
Command-line interface (CLI) to create a pronunciation dictionary from an other pronunciation dictionary with the possibility of ignoring punctuation and splitting on hyphens before lookup.
pronunciation-dictionary-utils
CLI and library to modify pronunciation dictionaries (any language).
english-text-normalization
Command-line interface (CLI) and library to normalize English texts.
dict-from-pypinyin
Command-line interface (CLI) to create a pronunciation dictionary by looking up pinyin transcriptions using pypinyin including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.
pronunciation-dictionary
Library to save and load pronunciation dictionaries (language-independent).
tts-mos-test-mturk
Command-line interface to evaluate text-to-speech mean opinion score studies done on Amazon Mechanical Turk.
mean-opinion-score
Library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).
textgrid-tools
Command-line interface (CLI) to modify TextGrids and their corresponding audio files.
text-selection
Command-line interface (CLI) to select lines of a text file.
speech-dataset-parser
Library to parse speech datasets stored in a generic format based on TextGrids. A tool (CLI) for converting common datasets like LJ Speech into a generic format is included.
dict-from-dragonmapper
Command-line interface (CLI) to create a pronunciation dictionary by looking up IPA transcriptions using dragonmapper including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.
dict-from-annotation
Command-line interface (CLI) to create a pronunciation dictionary based on annotations.
word-to-pronunciation
Create pronunciations of words with the possibility of ignoring punctuation and splitting on hyphens before lookup.
iterable-serialization
Serialization/deserialization of iterables of type 'str' to a single string.