Easy and fast AI Waifu voice generation
Project description
AIwaifu Vocal Pipeline
Waifu Voice - Made Easy
an implementation of AIwaifu Vocal Pipeline to make it easier to create a fast and easy-to-use Cute Waifu Voice Text -> TTS -> Voice Conversion -> Done
Usage
# pip install aivoifu
poetry install aivoifu # Recommended using poetry
from AIvoifu import client_pipeline
model = client_pipeline.tts_pipeline(tts_model_selection='gtts', vc_model_selection='ayaka-jp', hubert_model='zomehwh-hubert-base', language='en')
model.tts('Hello This Is A Test Text Anyway', save_path='./test.wav')
How to add your own TTS/Voice Conversion pipeline
First of all we need to understand how AIwaifu Vocal Pipeline was design There's two components
- TTS: This is a typical text to speech model which you can add your own
- Voice_Conversion: This is our Heroine on making TTS sound like a cute girl by converting TTS speech using Voice Conversion which you can train easily
First of all we get TTS voice from TTS pipeline, adn the we do voice conversion on it
Custom TTS
in the TTS folder you'll found
- tts_base_model: folder
- tts.py
if you wish to add your own TTS model please create a class in tts.py and in the function constructor please download the weight and cache it under the tts_base_model along with other cache file
# Example code
class CustomTTS:
def __init__(self) -> None:
from something import tts_library
self.model_link = 'Huggingface_link'
root = os.path.dirname(os.path.abspath(__file__))
self.model_root_path = f'{root}/base_tts_model/'
self.model_name = 'custom_tts'
model_path = f'{self.model_root_path}/{self.model_name}'
if not os.path.exist(model_path)
os.mkdir(model_path)
weigh_path = f'{model_path}/{self.model_name}.pth'
if not os.path.exist(weigh_path):
wget.download(self.model_link, weigh_path)
self.model = tts_library.load(weigh_path)
print(f'model {self.model_name} initialized')
def tts(self, text:str, save:boolean=True, your_args:any):
# some preprocessing
output = self.model.tts(text)
if save:
output.save('save_path')
return output
Custom Voice Conversion
if you're not looking to add new language but just want to custom the model voice easily
Our recommendation is to just Train Voice Conversion pipeline on your own samples and added it to the zoo
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file aivoifu-0.2.6.tar.gz
.
File metadata
- Download URL: aivoifu-0.2.6.tar.gz
- Upload date:
- Size: 149.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.12.2 Darwin/23.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe28dc1d74a37466a3781cbef9c0d6cb85fa578ca094e6564e9f7c0544dd9e13 |
|
MD5 | 8f6850f41f0cc940304961d99c25271e |
|
BLAKE2b-256 | 59a442349cf64fb6a9c7b01f6d3f5c1a677a76f8787e08cbd929dadc39dd97c1 |
File details
Details for the file aivoifu-0.2.6-py3-none-any.whl
.
File metadata
- Download URL: aivoifu-0.2.6-py3-none-any.whl
- Upload date:
- Size: 164.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.12.2 Darwin/23.4.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f99332d92c303691853340d694a2f67535cdbae89cb3c7a0e90c74acd23e580c |
|
MD5 | 4b832202d210c4ca71c849ee24b3b984 |
|
BLAKE2b-256 | edb00a81602c369fa13c1c434b66ef9ab540e03515286572320b9394db972c2b |