A fast, accurate Tempo Predictor
Project description
DeepRhythm: High-Speed Tempo Prediction
Introduction
DeepRhythm is a Convolutional Neural Network (CNN) designed for rapid, precise tempo prediction, specifically on modern music.
The implementation is heavily inspired by [1].
Audio is batch-processed using a vectorized HCQM, drastically reducing computation time by avoiding the common bottlenecks encountered in feature extraction.
Benchmarks
Method | Acc1 (%) | Acc2 (%) | Avg. Time (s) | Total Time (s) |
---|---|---|---|---|
Essentia (multifeature) | 79.15 | 94.19 | 2.78 | 1635.48 |
Essentia (Percival) | 80.51 | 94.87 | 1.46 | 851.91 |
Essentia (degara) | 77.26 | 91.97 | 1.40 | 820.85 |
Librosa | 52.82 | 63.93 | 0.51 | 299.68 |
DeepRhythm (cpu) | 90.77 | 96.75 | 0.127 | 74.43 |
DeepRhythm (cuda) | 90.77 | 96.75 | 0.0235 | 13.74 |
- Test done on 586 songs, mostly Hip Hop, Electronic, Pop, and Rock
- Acc1 = Prediction within +/- 2% of actual bpm
- Acc2 = Prediction within +/- 2% of actual bpm or a multiple (e.g. 120 ~= 60)
Installation
To install DeepRhythm, ensure you have Python and pip installed. Then run:
pip install git+https://github.com/KinWaiCheuk/nnAudio.git#subdirectory=Installation
pip install deeprhythm==0.0.5
Note: nnAudio currently needs to be installed separately since the pip package is out of date. Hopefully this will be resolved soon.
Usage
To predict the tempo of a song with DeepRhythm:
from deeprhythm import DeepRhythmPredictor
model = DeepRhythmPredictor()
tempo = model.predict('path/to/song.mp3')
print(f"Predicted Tempo: {tempo} BPM")
References
[1] Hadrien Foroughmand and Geoffroy Peeters, “Deep-Rhythm for Global Tempo Estimation in Music”, in Proceedings of the 20th International Society for Music Information Retrieval Conference, Delft, The Netherlands, Nov. 2019, pp. 636–643. doi: 10.5281/zenodo.3527890.
[2] K. W. Cheuk, H. Anderson, K. Agres and D. Herremans, "nnAudio: An on-the-Fly GPU Audio to Spectrogram Conversion Toolbox Using 1D Convolutional Neural Networks," in IEEE Access, vol. 8, pp. 161981-162003, 2020, doi: 10.1109/ACCESS.2020.3019084.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file deeprhythm-0.0.5.tar.gz
.
File metadata
- Download URL: deeprhythm-0.0.5.tar.gz
- Upload date:
- Size: 5.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 26c0950f29e4283b6f4fbd6cfed3b59c1c5f2282523545fa253147ff841bb936 |
|
MD5 | d16ca5568afd5d0f0459de1abfcae4f3 |
|
BLAKE2b-256 | 9990a9297e406bdc58097d70318dc845d2f3dff3750a3cfe2194285d0f66105f |
File details
Details for the file deeprhythm-0.0.5-py3-none-any.whl
.
File metadata
- Download URL: deeprhythm-0.0.5-py3-none-any.whl
- Upload date:
- Size: 27.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | afc28c9d77a260499f00c414f9f3c68d19577aea2cb2324ca9684860edba2195 |
|
MD5 | 69ca564a215516c6b23c65a925edc9dc |
|
BLAKE2b-256 | 90c4a8c1cb242c3674555657a1b259cc6ffafd652f4fedf8b8ceb255ce7651ce |