Skip to main content
Avatar for Stefan Taubert from gravatar.com

Stefan Taubert

Username    stefantaubert
Date joined   Joined

23 projects

birdnet

Last released

A Python library for identifying bird species by their sounds.

pinyin-to-ipa

Last released

A Python library, web application, and command-line tool for transcribing Pinyin to IPA. Tone markers are attached to the vowel of each syllable.

mel-cepstral-distance

Last released

A Python library for computing the Mel-Cepstral Distance (also known as Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based on the paper 'Mel-Cepstral Distance Measure for Objective Speech Quality Assessment' by Kubichek (1993).

zho-tts

Last released

Web app, command-line interface and Python library for synthesizing Chinese texts into speech.

en-tts

Last released

Web app, command-line interface and Python library for synthesizing English texts into speech.

txt-utils

Last released

CLI to modify text files.

waveglow-cli

Last released

Command-line interface (CLI) to train WaveGlow using .wav files.

tacotron-cli

Last released

Command-line interface (CLI) to train Tacotron 2 using .wav <=> .TextGrid pairs.

dict-from-g2pE

Last released

CLI to create a pronunciation dictionary by predicting English ARPAbet phonemes using seq2seq model from g2pE and the possibility of ignoring punctuation and splitting on hyphens before prediction.

dict-from-dict

Last released

Command-line interface (CLI) to create a pronunciation dictionary from an other pronunciation dictionary with the possibility of ignoring punctuation and splitting on hyphens before lookup.

pronunciation-dictionary-utils

Last released

CLI and library to modify pronunciation dictionaries (any language).

english-text-normalization

Last released

Command-line interface (CLI) and library to normalize English texts.

dict-from-pypinyin

Last released

Command-line interface (CLI) to create a pronunciation dictionary by looking up pinyin transcriptions using pypinyin including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.

pronunciation-dictionary

Last released

Library to save and load pronunciation dictionaries (language-independent).

tts-mos-test-mturk

Last released

Command-line interface to evaluate text-to-speech mean opinion score studies done on Amazon Mechanical Turk.

mean-opinion-score

Last released

Library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).

textgrid-tools

Last released

Command-line interface (CLI) to modify TextGrids and their corresponding audio files.

text-selection

Last released

Command-line interface (CLI) to select lines of a text file.

speech-dataset-parser

Last released

Library to parse speech datasets stored in a generic format based on TextGrids. A tool (CLI) for converting common datasets like LJ Speech into a generic format is included.

dict-from-dragonmapper

Last released

Command-line interface (CLI) to create a pronunciation dictionary by looking up IPA transcriptions using dragonmapper including the possibility of ignoring punctuation and splitting words on hyphens before transcribing them.

dict-from-annotation

Last released

Command-line interface (CLI) to create a pronunciation dictionary based on annotations.

word-to-pronunciation

Last released

Create pronunciations of words with the possibility of ignoring punctuation and splitting on hyphens before lookup.

iterable-serialization

Last released

Serialization/deserialization of iterables of type 'str' to a single string.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page