This package is written for text-to-audio/music generation.
Project description
AudioLDM 2
This repo currently support Text-to-Audio Generation (including Music)
Web APP
- Prepare running environment
conda create -n audioldm python=3.8; conda activate audioldm
pip3 install audioldm
git clone https://github.com/haoheliu/AudioLDM2; cd AudioLDM2
- Start the web application (powered by Gradio)
python3 app.py
- A link will be printed out. Click the link to open the browser and play.
Commandline Usage
Prepare running environment
# Optional
conda create -n audioldm python=3.8; conda activate audioldm
# Install AudioLDM
pip3 install git+https://github.com/haoheliu/AudioLDM2.git
- Generate based on a text prompt
audioldm2 -t "Musical constellations twinkling in the night sky, forming a cosmic melody."
- Generate based on a list of text
audioldm2 -tl batch.lst
Cite this work
If you found this tool useful, please consider citing
@article{liu2023audioldm,
title={AudioLDM: Text-to-Audio Generation with Latent Diffusion Models},
author={Liu, Haohe and Chen, Zehua and Yuan, Yi and Mei, Xinhao and Liu, Xubo and Mandic, Danilo and Wang, Wenwu and Plumbley, Mark D},
journal={arXiv preprint arXiv:2301.12503},
year={2023}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
audioldm2-0.0.5.tar.gz
(2.9 MB
view hashes)
Built Distribution
Close
Hashes for audioldm2-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c665d6d81eae92cc973e57255f4e6fee69aee859a16c56050f2554802b776dfc |
|
MD5 | 67d889bc88897d05026e7782d90fef75 |
|
BLAKE2b-256 | 35c340b90b325579bce048ab623354c19a705baade1bfbcecedcd57afff8417d |