Vocos - MLX
Project description
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Implementation of Vocos with the MLX framework.
Installation
To use Vocos in inference mode, install it using:
pip install vocos-mlx
Usage
from vocos_mlx import Vocos, load_audio, log_mel_spectrogram
audio = load_audio("audio.wav", 24_000)
mel_spec = log_mel_spectrogram(audio, n_mels = 100)
vocos = Vocos.from_pretrained("lucasnewman/vocos-mel-24khz")
# reconstruct
reconstructed_audio = vocos_mlx(audio)
# decode
decoded_audio = vocos_mlx.decode(mx.expand_dims(mel_spec, axis=0))
Citations
@article{siuzdak2023vocos,
title={Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis},
author={Siuzdak, Hubert},
journal={arXiv preprint arXiv:2306.00814},
year={2023}
}
License
The code in this repository is released under the MIT license as found in the LICENSE file.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
vocos_mlx-0.0.2.tar.gz
(8.2 kB
view hashes)
Built Distribution
Close
Hashes for vocos_mlx-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2a29664cc9b1b74e47b098a70b8b20fa625de42710e0bde7912f65cd442de536 |
|
MD5 | fe504a01516ae0dc3d2e800bb062d8e8 |
|
BLAKE2b-256 | 91d872fbd2ce4ea47827bc4deee1ea4a336c16c1b390f7696ea5efb80bbc0250 |