Voice100 Runtime is a TTS/ASR sample app that uses ONNX Runtime, WORLD and Voice100 neural TTS/ASR models on Python. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
Project description
Voice100 Runtime
Voice100 Android App is a TTS/ASR sample app that uses ONNX Runtime, WORLD and Voice100 neural TTS/ASR models on Python. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without recursion.
- Beginnings are apt to be determinative and when reinforced by continuous applications of similar influence. Sample audio
- mata toojinoyoonigodaimyooootoyobarerushuyoonamyoooonochuuoonihaisarerukotomoooi Japanese sample audio
Install
Use pip
to install from GitHub.
pip install git+https://github.com/kaiidams/voice100-runtime.git
List available models
import voice100_runtime
print(voice100_runtime.list_models())
Using TTS
This downloads ONNX files in ~/.cache/voice100_runtime/
if not
available.
import soundfile as sf
import voice100_runtime
tts = voice100_runtime.load("tts_en")
waveform, sample_rate = tts("Hello, world!")
sf.write("output.wav", waveform, sample_rate, "PCM_16")
Using ASR
This downloads an ONNX file in ~/.cache/voice100_runtime/
if not
available.
import soundfile as sf
import voice100_runtime
asr = voice100_runtime.load("asr_en")
waveform, sample_rate = sf.read("output.wav")
text = asr(waveform, sample_rate)
print(text)
Using multi-task TTS
This downloads ONNX files in ~/.cache/voice100_runtime/
if not
available.
import soundfile as sf
import voice100_runtime
tts = voice100_runtime.load("tts_en_mt")
waveform, sample_rate, align = tts("Hello, world!", return_align=True)
print("/".join(align))
sf.write("output.wav", waveform, sample_rate, "PCM_16")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file voice100_runtime-1.6.1.tar.gz
.
File metadata
- Download URL: voice100_runtime-1.6.1.tar.gz
- Upload date:
- Size: 6.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.8.15 Linux/5.19.0-38-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 772e78039046c86637b0b71046bcf5ba8c23034a1f9607bffa46c3fb2b58acfb |
|
MD5 | edde711277658b60df465b5139082420 |
|
BLAKE2b-256 | 238b82045ab35412be8cbd3bd2c3f5a040e0756c4aa46c8b50d2055f0494f964 |
File details
Details for the file voice100_runtime-1.6.1-py3-none-any.whl
.
File metadata
- Download URL: voice100_runtime-1.6.1-py3-none-any.whl
- Upload date:
- Size: 8.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.8.15 Linux/5.19.0-38-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c2aaeeffae60eb9f4f883164ef30d765c786cb76018c52f9c0381b67df7f090b |
|
MD5 | 062a4eed2482f56e711fe38018e97c17 |
|
BLAKE2b-256 | 2d3a607050de9d183f171f3101197aa3242331af88868a533058ded7a92dfbbe |