Skip to main content

Introducing Voice_Cloning: A Python Package for Speech Synthesis and Voice Cloning!

Project description

Voice_Cloning is a Python package that allows users to synthesize speech and clone voices using Artificial Intelligence techniques. With Voice_Cloning, users can create their own text-to-speech systems, generate audio from text, and even clone their own voice to create a personalized speech model. Voice_Cloning is a powerful tool for anyone looking to add speech synthesis or voice cloning capabilities to their projects, whether it be for personal use or commercial applications.

Disclaimer

The Voice_Cloning Python package is intended to be used as a tool to assist individuals who have lost their voice due to medical conditions or surgeries. While it can generate synthetic speech that sounds similar to a person's natural voice, it is not intended to replace the natural human voice, nor is it a guarantee that the synthetic voice created will sound exactly like the person's natural voice.

Furthermore, we strongly advise against using this package for any illegal or unethical purposes that may harm individuals or society as a whole. We do not condone or support any misuse of this technology and will not be held responsible for any consequences resulting from such misuse.

It is the responsibility of the user to ensure that they are using this technology in an ethical and responsible manner, in compliance with all applicable laws and regulations.

By using the Voice_Cloning Python package, you agree to these terms and acknowledge that you understand the limitations and potential risks associated with this technology.

Feature Output
Real-time voice cloning The package can clone a user's voice in real time by analyzing a reference voice clip and a user's speech input
Speech synthesis Allow users to generate synthetic speech using a text input with pre-loaded speakers, similar to a text-to-speech (TTS) system
Multi-Accent support Supports Indian and Western-style accents for voice cloning and speech synthesis
Noise reduction The package includes functions to reduce noise in the recorded audio, improving the quality of the cloned or synthesized voice

Installation

Use the package manager pip to install Voice-Cloning

pip install voice-cloning

Usage

1. Clone using an external reference voice

# load all the functions
from voice_cloning.generation import *

# provide a reference sound file, speech text and clone the voice
sound_path = r"xx/xxx/xxx.wav" # support most of the sound formats
speech_text = "Please use this package carefully"

generated_wav = speech_generator(
    voice_type = "western", # supports "indian" & "western"
    sound_path = sound_path, 
    speech_text=  speech_text
    )

## Play and save the sound with noise-reduction capabilities

# play the generated sound
play_sound(generated_wav)

# save the file
save_sound(generated_wav, filename="voice output", noise_reduction=True) # enable noise reduction

2. Speech Synthesis: Use an existing voice from the sound library

the sound library offers support to both "western & "indian" sounds, with 31 speaker voices library: https://github.com/dreji18/Voice_Cloning/blob/main/speakers/speaker_library.xlsx

# load all the functions
from voice_cloning.generation import *

speech_text = "Please use this package carefully"

# play the speaker sound and generate the voice
play_library_sound(voice_type = "western", gender = "female", speaker_id = "speaker-3") 

play_library_sound(voice_type = "indian", gender = "male", speaker_id = "speaker-1") # complete list available in the repo, 

generated_wav = speech_generator(
    voice_type = "western", 
    gender = "male", 
    speaker_id = "speaker-4", 
    speech_text= speech_text
    )

# play the generated sound & save the file
play_sound(generated_wav)

save_sound(generated_wav, filename="voice output", noise_reduction=True) # enable noise reduction

About

This Package is part of the Research Topic "Voice_Cloning: A Python library for Speech Synthesis and Voice Cloning to assist Individuals with Speech Disorders" conducted by R. Vinotha, L.D Vijay Anand, Hepsiba D, Deepak John Reji. If you use this work (code, model or dataset),

Please cite us and star at: https://github.com/dreji18/Voice_Cloning

License

MIT License

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Voice-Cloning-0.0.9.tar.gz (11.1 MB view details)

Uploaded Source

Built Distribution

Voice_Cloning-0.0.9-py3-none-any.whl (11.1 MB view details)

Uploaded Python 3

File details

Details for the file Voice-Cloning-0.0.9.tar.gz.

File metadata

  • Download URL: Voice-Cloning-0.0.9.tar.gz
  • Upload date:
  • Size: 11.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.12

File hashes

Hashes for Voice-Cloning-0.0.9.tar.gz
Algorithm Hash digest
SHA256 20aa64257f5ee90654c505ff3558b630a1f15526e0263ae16a49db12ef93a77e
MD5 d9d1a4bb293e1700fbf8234a34b63ef3
BLAKE2b-256 7337efcc5e39c43b0356a1b49abff12fd91e6bc4fcca2db0460a9ffc8e06ac5a

See more details on using hashes here.

File details

Details for the file Voice_Cloning-0.0.9-py3-none-any.whl.

File metadata

File hashes

Hashes for Voice_Cloning-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 5c871f46707c21d67dd4f53b9b0f7463c4bf9711326343c401eab93d33f57cb3
MD5 f3311551c84ee51abb8ebaeb2f087336
BLAKE2b-256 c8e4a2967e9088f30cd277e796d2f60f09e960b776fade0f1076dcae24c5bb91

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page