This package is written for text-to-audio/music generation.
Project description
AudioLDM 2
This repo currently support Text-to-Audio Generation (including Music)
Web APP
- Prepare running environment
conda create -n audioldm python=3.8; conda activate audioldm
pip3 install audioldm
git clone https://github.com/haoheliu/AudioLDM2; cd AudioLDM2
- Start the web application (powered by Gradio)
python3 app.py
- A link will be printed out. Click the link to open the browser and play.
Commandline Usage
Prepare running environment
# Optional
conda create -n audioldm python=3.8; conda activate audioldm
# Install AudioLDM
pip3 install git+https://github.com/haoheliu/AudioLDM2.git
- Generate based on a text prompt
audioldm2 -t "Musical constellations twinkling in the night sky, forming a cosmic melody."
- Generate based on a list of text
audioldm2 -tl batch.lst
Cite this work
If you found this tool useful, please consider citing
@article{liu2023audioldm,
title={AudioLDM: Text-to-Audio Generation with Latent Diffusion Models},
author={Liu, Haohe and Chen, Zehua and Yuan, Yi and Mei, Xinhao and Liu, Xubo and Mandic, Danilo and Wang, Wenwu and Plumbley, Mark D},
journal={arXiv preprint arXiv:2301.12503},
year={2023}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
audioldm2-0.0.4.tar.gz
(2.9 MB
view hashes)
Built Distribution
Close
Hashes for audioldm2-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 21dbc9dba8622215b8e29869e86c342757792f4d559d54461fe8940502c1a75c |
|
MD5 | 71b72a623be9174ab6598f9078c932ec |
|
BLAKE2b-256 | fa56f5733086508ac5fe16f09fd907c7aee5ad048a8b71bf232200ae44dbfb5b |