Voice-Cloning

Introducing Voice_Cloning: A Python Package for Speech Synthesis and Voice Cloning!

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Voice_Cloning is a Python package that allows users to synthesize speech and clone voices using Artificial Intelligence techniques. With Voice_Cloning, users can create their own text-to-speech systems, generate audio from text, and even clone their own voice to create a personalized speech model. Voice_Cloning is a powerful tool for anyone looking to add speech synthesis or voice cloning capabilities to their projects, whether it be for personal use or commercial applications.

Disclaimer

The Voice_Cloning Python package is intended to be used as a tool to assist individuals who have lost their voice due to medical conditions or surgeries. While it can generate synthetic speech that sounds similar to a person's natural voice, it is not intended to replace the natural human voice, nor is it a guarantee that the synthetic voice created will sound exactly like the person's natural voice.

Furthermore, we strongly advise against using this package for any illegal or unethical purposes that may harm individuals or society as a whole. We do not condone or support any misuse of this technology and will not be held responsible for any consequences resulting from such misuse.

It is the responsibility of the user to ensure that they are using this technology in an ethical and responsible manner, in compliance with all applicable laws and regulations.

By using the Voice_Cloning Python package, you agree to these terms and acknowledge that you understand the limitations and potential risks associated with this technology.

Feature	Output
Real-time voice cloning	The package can clone a user's voice in real time by analyzing a reference voice clip and a user's speech input
Speech synthesis	Allow users to generate synthetic speech using a text input with pre-loaded speakers, similar to a text-to-speech (TTS) system
Multi-Accent support	Supports Indian and Western-style accents for voice cloning and speech synthesis
Noise reduction	The package includes functions to reduce noise in the recorded audio, improving the quality of the cloned or synthesized voice

Installation

Use the package manager pip to install Voice-Cloning

pip install voice-cloning

Usage

1. Clone using an external reference voice

# load all the functions
from voice_cloning.generation import *

# provide a reference sound file, speech text and clone the voice
sound_path = r"xx/xxx/xxx.wav" # support most of the sound formats
speech_text = "Please use this package carefully"

generated_wav = speech_generator(
    voice_type = "western", # supports "indian" & "western"
    sound_path = sound_path, 
    speech_text=  speech_text
    )

## Play and save the sound with noise-reduction capabilities

# play the generated sound
play_sound(generated_wav)

# save the file
save_sound(generated_wav, filename="voice output", noise_reduction=True) # enable noise reduction

2. Speech Synthesis: Use an existing voice from the sound library

the sound library offers support to both "western & "indian" sounds, with 31 speaker voices library: https://github.com/dreji18/Voice_Cloning/blob/main/speakers/speaker_library.xlsx

# load all the functions
from voice_cloning.generation import *

speech_text = "Please use this package carefully"

# play the speaker sound and generate the voice
play_library_sound(voice_type = "western", gender = "female", speaker_id = "speaker-3") 

play_library_sound(voice_type = "indian", gender = "male", speaker_id = "speaker-1") # complete list available in the repo, 

generated_wav = speech_generator(
    voice_type = "western", 
    gender = "male", 
    speaker_id = "speaker-4", 
    speech_text= speech_text
    )

# play the generated sound & save the file
play_sound(generated_wav)

save_sound(generated_wav, filename="voice output", noise_reduction=True) # enable noise reduction

About

This Package is part of the Research Topic "Voice_Cloning: A Python library for Speech Synthesis and Voice Cloning to assist Individuals with SpeechÂ Disorders" conducted by R. Vinotha, L.D Vijay Anand, Hepsiba D, Deepak John Reji. If you use this work (code, model or dataset),

Please cite us and star at: https://github.com/dreji18/Voice_Cloning

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.9

Mar 13, 2023

0.0.8

Mar 7, 2023

0.0.5

Mar 6, 2023

0.0.4

Mar 5, 2023

0.0.3

Mar 5, 2023

0.0.2

Mar 5, 2023

0.0.1

Mar 5, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Voice-Cloning-0.0.9.tar.gz (11.1 MB view hashes)

Uploaded Mar 13, 2023 Source

Built Distribution

Voice_Cloning-0.0.9-py3-none-any.whl (11.1 MB view hashes)

Uploaded Mar 13, 2023 Python 3

Hashes for Voice-Cloning-0.0.9.tar.gz

Hashes for Voice-Cloning-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`20aa64257f5ee90654c505ff3558b630a1f15526e0263ae16a49db12ef93a77e`
MD5	`d9d1a4bb293e1700fbf8234a34b63ef3`
BLAKE2b-256	`7337efcc5e39c43b0356a1b49abff12fd91e6bc4fcca2db0460a9ffc8e06ac5a`

Hashes for Voice_Cloning-0.0.9-py3-none-any.whl

Hashes for Voice_Cloning-0.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5c871f46707c21d67dd4f53b9b0f7463c4bf9711326343c401eab93d33f57cb3`
MD5	`f3311551c84ee51abb8ebaeb2f087336`
BLAKE2b-256	`c8e4a2967e9088f30cd277e796d2f60f09e960b776fade0f1076dcae24c5bb91`