Introducing Voice_Cloning: A Python Package for Speech Synthesis and Voice Cloning!
Project description
Voice_Cloning is a Python package that allows users to synthesize speech and clone voices using Artificial Intelligence techniques. With Voice_Cloning, users can create their own text-to-speech systems, generate audio from text, and even clone their own voice to create a personalized speech model. Voice_Cloning is a powerful tool for anyone looking to add speech synthesis or voice cloning capabilities to their projects, whether it be for personal use or commercial applications.
Disclaimer
The Voice_Cloning Python package is intended to be used as a tool to assist individuals who have lost their voice due to medical conditions or surgeries. While it can generate synthetic speech that sounds similar to a person's natural voice, it is not intended to replace the natural human voice, nor is it a guarantee that the synthetic voice created will sound exactly like the person's natural voice.
Furthermore, we strongly advise against using this package for any illegal or unethical purposes that may harm individuals or society as a whole. We do not condone or support any misuse of this technology and will not be held responsible for any consequences resulting from such misuse.
It is the responsibility of the user to ensure that they are using this technology in an ethical and responsible manner, in compliance with all applicable laws and regulations.
By using the Voice_Cloning Python package, you agree to these terms and acknowledge that you understand the limitations and potential risks associated with this technology.
Feature | Output |
---|---|
Real-time voice cloning | The package can clone a user's voice in real time by analyzing a reference voice clip and a user's speech input |
Speech synthesis | Allow users to generate synthetic speech using a text input with pre-loaded speakers, similar to a text-to-speech (TTS) system |
Multi-Accent support | Supports Indian and Western-style accents for voice cloning and speech synthesis |
Noise reduction | The package includes functions to reduce noise in the recorded audio, improving the quality of the cloned or synthesized voice |
Installation
Use the package manager pip to install Voice-Cloning
pip install voice-cloning
Usage
1. Clone using an external reference voice
# load all the functions
from voice_cloning.generation import *
# provide a reference sound file, speech text and clone the voice
sound_path = r"xx/xxx/xxx.wav" # support most of the sound formats
speech_text = "Please use this package carefully"
generated_wav = speech_generator(
voice_type = "western", # supports "indian" & "western"
sound_path = sound_path,
speech_text= speech_text
)
## Play and save the sound with noise-reduction capabilities
# play the generated sound
play_sound(generated_wav)
# save the file
save_sound(generated_wav, filename="voice output", noise_reduction=True) # enable noise reduction
2. Speech Synthesis: Use an existing voice from the sound library
the sound library offers support to both "western & "indian" sounds, with 31 speaker voices library: https://github.com/dreji18/Voice_Cloning/blob/main/speakers/speaker_library.xlsx
# load all the functions
from voice_cloning.generation import *
speech_text = "Please use this package carefully"
# play the speaker sound and generate the voice
play_library_sound(voice_type = "western", gender = "female", speaker_id = "speaker-3")
play_library_sound(voice_type = "indian", gender = "male", speaker_id = "speaker-1") # complete list available in the repo,
generated_wav = speech_generator(
voice_type = "western",
gender = "male",
speaker_id = "speaker-4",
speech_text= speech_text
)
# play the generated sound & save the file
play_sound(generated_wav)
save_sound(generated_wav, filename="voice output", noise_reduction=True) # enable noise reduction
About
This Package is part of the Research Topic "Voice_Cloning: A Python library for Speech Synthesis and Voice Cloning to assist Individuals with Speech Disorders" conducted by R. Vinotha, L.D Vijay Anand, Hepsiba D, Deepak John Reji. If you use this work (code, model or dataset),
Please cite us and star at: https://github.com/dreji18/Voice_Cloning
License
MIT License
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file Voice-Cloning-0.0.9.tar.gz
.
File metadata
- Download URL: Voice-Cloning-0.0.9.tar.gz
- Upload date:
- Size: 11.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 20aa64257f5ee90654c505ff3558b630a1f15526e0263ae16a49db12ef93a77e |
|
MD5 | d9d1a4bb293e1700fbf8234a34b63ef3 |
|
BLAKE2b-256 | 7337efcc5e39c43b0356a1b49abff12fd91e6bc4fcca2db0460a9ffc8e06ac5a |
File details
Details for the file Voice_Cloning-0.0.9-py3-none-any.whl
.
File metadata
- Download URL: Voice_Cloning-0.0.9-py3-none-any.whl
- Upload date:
- Size: 11.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5c871f46707c21d67dd4f53b9b0f7463c4bf9711326343c401eab93d33f57cb3 |
|
MD5 | f3311551c84ee51abb8ebaeb2f087336 |
|
BLAKE2b-256 | c8e4a2967e9088f30cd277e796d2f60f09e960b776fade0f1076dcae24c5bb91 |