Skip to main content
Avatar for Stefan Taubert from gravatar.com

Stefan Taubert

Username    stefantaubert
Date joined   Joined

23 projects

birdnet

Last released

A Python library for identifying bird species by their sounds.

zho-tts

Last released

Web app, command-line interface and Python library for synthesizing Chinese texts into speech.

en-tts

Last released

Web app, command-line interface and Python library for synthesizing English texts into speech.

pinyin-to-ipa

Last released

Command-line interface (CLI) and Python library to transcribe pinyin to IPA.

txt-utils

Last released

CLI to modify text files.

waveglow-cli

Last released

Command-line interface (CLI) to train WaveGlow using .wav files.

tacotron-cli

Last released

Command-line interface (CLI) to train Tacotron 2 using .wav <=> .TextGrid pairs.

mel-cepstral-distance

Last released

CLI and library to compute the Mel-Cepstral Distance of two WAV files based on the paper 'Mel-Cepstral Distance Measure for Objective Speech Quality Assessment' by Robert F. Kubichek.

dict-from-g2pE

Last released

CLI to create a pronunciation dictionary by predicting English ARPAbet phonemes using seq2seq model from g2pE and the possibility of ignoring punctuation and splitting on hyphens before prediction.

dict-from-dict

Last released

Command-line interface (CLI) to create a pronunciation dictionary from an other pronunciation dictionary with the possibility of ignoring punctuation and splitting on hyphens before lookup.

pronunciation-dictionary-utils

Last released

CLI and library to modify pronunciation dictionaries (any language).

english-text-normalization

Last released

Command-line interface (CLI) and library to normalize English texts.

dict-from-pypinyin

Last released

Command-line interface (CLI) to create a pronunciation dictionary by looking up pinyin transcriptions using pypinyin including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.

pronunciation-dictionary

Last released

Library to save and load pronunciation dictionaries (language-independent).

tts-mos-test-mturk

Last released

Command-line interface to evaluate text-to-speech mean opinion score studies done on Amazon Mechanical Turk.

mean-opinion-score

Last released

Library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).

textgrid-tools

Last released

Command-line interface (CLI) to modify TextGrids and their corresponding audio files.

text-selection

Last released

Command-line interface (CLI) to select lines of a text file.

speech-dataset-parser

Last released

Library to parse speech datasets stored in a generic format based on TextGrids. A tool (CLI) for converting common datasets like LJ Speech into a generic format is included.

dict-from-dragonmapper

Last released

Command-line interface (CLI) to create a pronunciation dictionary by looking up IPA transcriptions using dragonmapper including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.

dict-from-annotation

Last released

Command-line interface (CLI) to create a pronunciation dictionary based on annotations.

word-to-pronunciation

Last released

Create pronunciations of words with the possibility of ignoring punctuation and splitting on hyphens before lookup.

iterable-serialization

Last released

Serialization/deserialization of iterables of type 'str' to a single string.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page