Skip to main content

Deep learning toolkit for speech enhancement

Project description

GitHub Workflow Status GitHub GitHub issues GitHub Repo stars GitHub all releases

mayavoz is a Pytorch-based opensource toolkit for speech enhancement. It is designed to save time for audio researchers. Is provides easy to use pretrained audio enhancement models and facilitates highly customisable model training.

| Quick Start | Installation | Tutorials | Available Recipes | Demo

Key features :key:

  • Various pretrained models nicely integrated with huggingface :hugs: that users can select and use without any hastle.
  • :package: Ability to train and validation your own custom speech enhancement models with just under 10 lines of code!
  • :magic_wand: A command line tool that facilitates training of highly customisable speech enhacement models from the terminal itself!
  • :zap: Supports multi-gpu training integrated with Pytorch Lightning.

Demo

Noisy audio followed by enhanced audio.

https://user-images.githubusercontent.com/25312635/203756185-737557f4-6e21-4146-aa2c-95da69d0de4c.mp4

Quick Start :fire:

from mayavoz.models import Mayamodel

model = Mayamodel.from_pretrained("shahules786/mayavoz-waveunet-valentini-28spk")
model.enhance("noisy_audio.wav")

Recipes

Model Dataset STOI PESQ URL
WaveUnet Valentini-28spk 0.836 2.78 shahules786/mayavoz-waveunet-valentini-28spk
Demucs Valentini-28spk 0.961 2.56 shahules786/mayavoz-demucs-valentini-28spk
DCCRN Valentini-28spk 0.724 2.55 shahules786/mayavoz-dccrn-valentini-28spk
Demucs MS-SNSD-20hrs 0.56 1.26 shahules786/mayavoz-demucs-ms-snsd-20

Test scores are based on respective test set associated with train dataset.

See tutorials to train your custom model

Installation

Only Python 3.8+ is officially supported (though it might work with Python 3.7)

  • With Pypi
pip install mayavoz
  • With conda
conda env create -f environment.yml
conda activate mayavoz
  • From source code
git clone url
cd mayavoz
pip install -e .

Support

For commercial enquiries and scientific consulting, please contact me.

Acknowledgements

Sincere gratitude to AMPLYFI for supporting this project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mayavoz-0.0.1.tar.gz (29.9 kB view details)

Uploaded Source

Built Distribution

mayavoz-0.0.1-py2.py3-none-any.whl (38.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file mayavoz-0.0.1.tar.gz.

File metadata

  • Download URL: mayavoz-0.0.1.tar.gz
  • Upload date:
  • Size: 29.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.13

File hashes

Hashes for mayavoz-0.0.1.tar.gz
Algorithm Hash digest
SHA256 0bdc1e6e38f5cb9ee600578889ff7b723f3e282e485eb562f98b3be2644e250a
MD5 76b11cc095178dfd15505f5a52cdeae7
BLAKE2b-256 3d817d744b57d5059044104aed097a5b8d197804931c0dff3ad8d2e7ffe891b6

See more details on using hashes here.

File details

Details for the file mayavoz-0.0.1-py2.py3-none-any.whl.

File metadata

  • Download URL: mayavoz-0.0.1-py2.py3-none-any.whl
  • Upload date:
  • Size: 38.1 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.13

File hashes

Hashes for mayavoz-0.0.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 3ed9c7788cbcb227b9e1fde04e3a0672639189acb47b62be74e8cef826f32b70
MD5 f6005b84d53218f50cbeb4260b0dbde1
BLAKE2b-256 4ef05b70e8bce998aa168b574f16c6182f0c5d23a0f179b75b6fe1969a86ca54

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page