ailia AI Voice

Project description

ailia AI Voice Python API

!! CAUTION !! “ailia” IS NOT OPEN SOURCE SOFTWARE (OSS). As long as user complies with the conditions stated in License Document, user may use the Software for free of charge, but the Software is basically paid software.

About ailia AI Voice

ailia AI Voice is a library for speech synthesis using AI. It provides APIs for C# for Unity and for C for native applications. By using ailia AI Voice, it is possible to easily implement AI-based speech synthesis in applications.

ailia AI Voice can perform speech synthesis offline, only on edge devices, without the need for cloud connectivity. It also supports the latest GPT-SoVITS, enabling speech synthesis in any voice timbre.

Install from pip

You can install the ailia AI Voice free evaluation package with the following command.

pip3 install ailia_voice

Install from package

You can install the ailia AI Voice from Package with the following command.

python3 bootstrap.py
pip3 install ./

Usage

import ailia_voice

import librosa
import time
import soundfile

import os
import urllib.request

# Load reference audio
ref_text = "水をマレーシアから買わなくてはならない。"
ref_file_path = "reference_audio_girl.wav"
if not os.path.exists(ref_file_path):
	urllib.request.urlretrieve(
		"https://github.com/axinc-ai/ailia-models/raw/refs/heads/master/audio_processing/gpt-sovits/reference_audio_captured_by_ax.wav",
		"reference_audio_girl.wav"
	)
audio_waveform, sampling_rate = librosa.load(ref_file_path, mono=True)

# Infer
voice = ailia_voice.GPTSoVITS()
voice.initialize_model(model_path = "./models/")
voice.set_reference_audio(ref_text, ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_JA, audio_waveform, sampling_rate)
buf, sampling_rate = voice.synthesize_voice("こんにちは。今日はいい天気ですね。", ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_JA)
#buf, sampling_rate = voice.synthesize_voice("Hello world.", ailia_voice.AILIA_VOICE_G2P_TYPE_GPT_SOVITS_EN)

# Save result
soundfile.write("output.wav", buf, sampling_rate)

User dictionary

To load a user dictionary created with pyopenjtalk, give the user_dict_path to initialize_model.

voice.initialize_model(model_path = "./models/", user_dict_path = "./userdic.dic")

API specification

https://github.com/axinc-ai/ailia-sdk

Project details

Release history Release notifications | RSS feed

1.5.0.0

Mar 11, 2026

1.4.0.0

Jan 5, 2026

This version

1.3.0.1

Jan 23, 2025

1.2.0.0

Oct 25, 2024

1.1.1.0

Oct 13, 2024

1.1.0.4

Sep 25, 2024

1.1.0.3

Sep 25, 2024

1.1.0.2

Sep 24, 2024

1.1.0.1

Sep 24, 2024

1.1.0.0

Sep 24, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ailia_voice-1.3.0.1.tar.gz (2.0 MB view details)

Uploaded Jan 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ailia_voice-1.3.0.1-py3-none-any.whl (2.0 MB view details)

Uploaded Jan 23, 2025 Python 3

File details

Details for the file ailia_voice-1.3.0.1.tar.gz.

File metadata

Download URL: ailia_voice-1.3.0.1.tar.gz
Upload date: Jan 23, 2025
Size: 2.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.8

File hashes

Hashes for ailia_voice-1.3.0.1.tar.gz
Algorithm	Hash digest
SHA256	`6cb971e6b2282909ce32dc54fd08883cd4d5953cbf05ab00aab45b641f823b6e`
MD5	`867ea63d01018a41a4ff230c60c9e2ec`
BLAKE2b-256	`2d0f2a2a30eec8d8e55981d65a64a6f0396c16e39731e8a3d6329b620a789310`

See more details on using hashes here.

File details

Details for the file ailia_voice-1.3.0.1-py3-none-any.whl.

File metadata

Download URL: ailia_voice-1.3.0.1-py3-none-any.whl
Upload date: Jan 23, 2025
Size: 2.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.8

File hashes

Hashes for ailia_voice-1.3.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`075fa1d40478bda15504954c9cdb0b094a50553cb0c8107141363e85665736f3`
MD5	`4fc6d52837821263c5b0f2757170e295`
BLAKE2b-256	`4b3a6c01c610a07456f9cd821e73f84c6ed578a29a64be5f318e41646766580c`

See more details on using hashes here.

ailia-voice 1.3.0.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

ailia AI Voice Python API

About ailia AI Voice

Install from pip

Install from package

Usage

User dictionary

API specification

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes