Skip to main content

Voice100 Runtime is a TTS/ASR sample app that uses ONNX Runtime, WORLD and Voice100 neural TTS/ASR models on Python. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

Project description

Voice100 Runtime

Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime, WORLD and Voice100 neural TTS/ASR models on Python. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.

  • Beginnings are apt to be determinative and when reinforced by continuous applications of similar influence. Sample audio
  • mata toojinoyoonigodaimyooootoyobarerushuyoonamyoooonochuuoonihaisarerukotomoooi Japanese sample audio

Install

Use pip to install from GitHub.

pip install git+https://github.com/kaiidams/voice100-runtime.git

List available models

import voice100_runtime
print(voice100_runtime.list_models())

Using TTS

This downloads ONNX files in ~/.cache/voice100_runtime/ if not available.

import soundfile as sf
import voice100_runtime
tts = voice100_runtime.load("tts_en")
waveform, sample_rate = tts("Hello, world!")
sf.write("output.wav", waveform, sample_rate, "PCM_16")

Using ASR

This downloads an ONNX file in ~/.cache/voice100_runtime/ if not available.

import soundfile as sf
import voice100_runtime
asr = voice100_runtime.load("asr_en")
waveform, sample_rate = sf.read("output.wav")
text = asr(waveform, sample_rate)
print(text)

Using multi-task TTS

This downloads ONNX files in ~/.cache/voice100_runtime/ if not available.

import soundfile as sf
import voice100_runtime
tts = voice100_runtime.load("tts_en_mt")
waveform, sample_rate, align = tts("Hello, world!", return_align=True)
print("/".join(align))
sf.write("output.wav", waveform, sample_rate, "PCM_16")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voice100_runtime-1.6.1.tar.gz (6.8 kB view details)

Uploaded Source

Built Distribution

voice100_runtime-1.6.1-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file voice100_runtime-1.6.1.tar.gz.

File metadata

  • Download URL: voice100_runtime-1.6.1.tar.gz
  • Upload date:
  • Size: 6.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.8.15 Linux/5.19.0-38-generic

File hashes

Hashes for voice100_runtime-1.6.1.tar.gz
Algorithm Hash digest
SHA256 772e78039046c86637b0b71046bcf5ba8c23034a1f9607bffa46c3fb2b58acfb
MD5 edde711277658b60df465b5139082420
BLAKE2b-256 238b82045ab35412be8cbd3bd2c3f5a040e0756c4aa46c8b50d2055f0494f964

See more details on using hashes here.

File details

Details for the file voice100_runtime-1.6.1-py3-none-any.whl.

File metadata

  • Download URL: voice100_runtime-1.6.1-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.8.15 Linux/5.19.0-38-generic

File hashes

Hashes for voice100_runtime-1.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c2aaeeffae60eb9f4f883164ef30d765c786cb76018c52f9c0381b67df7f090b
MD5 062a4eed2482f56e711fe38018e97c17
BLAKE2b-256 2d3a607050de9d183f171f3101197aa3242331af88868a533058ded7a92dfbbe

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page