ailia AI Speech
Project description
ailia AI Speech Python API
!! CAUTION !! “ailia” IS NOT OPEN SOURCE SOFTWARE (OSS). As long as user complies with the conditions stated in License Document, user may use the Software for free of charge, but the Software is basically paid software.
About ailia AI Speech
ailia AI Speech is a library to perform speech recognition using AI. It provides a C API for native applications, as well as a C# API well suited for Unity applications. Using ailia AI Speech, you can easily integrate AI powered speech recognition into your applications.
Install from pip
You can install the ailia AI Speech free evaluation package with the following command.
pip3 install ailia_speech
Install from package
You can install the ailia AI Speech from Package with the following command.
python3 bootstrap.py
pip3 install ./
Usage
import ailia_speech
import librosa
import os
import urllib.request
# Load target audio
ref_file_path = "demo.wav"
if not os.path.exists(ref_file_path):
urllib.request.urlretrieve(
"https://github.com/axinc-ai/ailia-models/raw/refs/heads/master/audio_processing/whisper/demo.wa",
"demo.wav"
)
audio_waveform, sampling_rate = librosa.load(ref_file_path, mono=True)
# Infer
speech = ailia_speech.Whisper()
speech.initialize_model(model_path = "./models/", model_type = ailia_speech.AILIA_SPEECH_MODEL_TYPE_WHISPER_MULTILINGUAL_SMALL)
recognized_text = speech.transcribe(audio_waveform, sampling_rate)
print(recognized_text)
API specification
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ailia_speech-1.3.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | aa1a3a60af89f72d12523e262de2de76b038f54fb98f80ffc868124f9ad7dbec |
|
MD5 | 61dcc868ae866ca05d655488b15c7229 |
|
BLAKE2b-256 | 60eb8015c761e4745a0a74592eebb4c096ddb106fdafacf5ec7e04196aaae8b4 |