Skip to main content

Forked from DiffRhythm2: Text-to-Music Generation with Diffusion and Music-Language Models

Project description

Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching

License

Yuepeng Jiang, Huakang Chen, Ziqian Ning, Jixun Yao, Zerui Han, Di Wu, Meng Meng, Jian Luan, Zhonghua Fu, Lei Xie†

DiffRhythm 2 (Chinese: 谛韵, Dì Yùn) is the next-generation open-sourced music generation framework that advances the original DiffRhythm with a semi-autoregressive diffusion architecture. It is capable of generating full-length songs with precise lyric alignment and coherent musical structures. The name inherits the essence of DiffRhythm — “Diff” reflects its diffusion-based generative backbone, while “Rhythm” emphasizes its dedication to musicality and temporal flow. The Chinese name 谛韵 (Dì Yùn) continues this dual symbolism: “谛” (attentive listening) represents perceptual awareness, and “韵” (melodic charm) captures the expressive beauty of music.

Demo Video

https://github.com/user-attachments/assets/95bac874-82b2-4c92-950e-3489a9c03ab0

📢 News and Updates

📋 TODOs

  • Support Colab.
  • Gradio support.
  • Song extension.
  • Instrumental music generation.
  • Release code and weights.
  • Release paper to Arxiv.

🔨 Inference

Following the steps below to clone the repository and install the environment.

# clone and enter the repositry
git clone https://github.com/ASLP-lab/DiffRhythm2.git
cd DiffRhythm2

# install the environment
## espeak-ng
# For Debian-like distribution (e.g. Ubuntu, Mint, etc.)
sudo apt-get install espeak-ng
# For RedHat-like distribution (e.g. CentOS, Fedora, etc.) 
sudo yum install espeak-ng
# For MacOS
brew install espeak-ng
# For Windows
# Please visit https://github.com/espeak-ng/espeak-ng/releases to download .msi installer

## install requirements
pip install -r requirements.txt

On Linux you can now simply use the inference script:

# For inference using a reference WAV file
bash inference.sh

Weights will be automatically downloaded from Hugging Face upon the first run.

Example files of lyrics and reference audio can be found in example.

📜 License & Disclaimer

DiffRhythm 2 (code and weights) is released under the Apache License 2.0. This open-source license allows you to freely use, modify, and distribute the model, as long as you include the appropriate copyright notice and disclaimer.

We do not make any profit from this model. Our goal is to provide a high-quality base model for music generation, fostering innovation in AI music and contributing to the advancement of human creativity. We hope that DiffRhythm 2 will serve as a foundation for further research and development in the field of AI-generated music.

DiffRhythm 2 enables the creation of original music across diverse genres, supporting applications in artistic creation, education, and entertainment. While designed for positive use cases, potential risks include unintentional copyright infringement through stylistic similarities, inappropriate blending of cultural musical elements, and misuse for generating harmful content. To ensure responsible deployment, users must implement verification mechanisms to confirm musical originality, disclose AI involvement in generated works, and obtain permissions when adapting protected styles.

Citation

@article{diffrhythm2,
  title={DiffRhythm 2: Efficient and High Fidelity Song Generation via Block Flow Matching},
  author={Jiang, Yuepeng and Chen, Huakang and Ning, Ziqian and Yao, Jixun and Han, Zerui and Wu, Di and Meng, Meng and Luan, Jian and Fu, Zhonghua and Xie, Lei},
  journal={arXiv preprint arXiv:2510.22950},
  year={2025}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

diffrhythm2_fork-20260128-py3-none-any.whl (76.3 MB view details)

Uploaded Python 3

File details

Details for the file diffrhythm2_fork-20260128-py3-none-any.whl.

File metadata

File hashes

Hashes for diffrhythm2_fork-20260128-py3-none-any.whl
Algorithm Hash digest
SHA256 e5a6f4765647b5c13e46bb84893373fd0bfd631dac54c2ff6fb7a5efdc7a17b0
MD5 448efc02af51e4f58ab7d27dcce8c2f6
BLAKE2b-256 f2f26af5fd7f85ee263ec9436932b65b7fc9da8136a743ba2cd8efde813cbe7c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page