Python utilities to process and predict on audio attributes
Project description
Python Module to process and predict on music attributes
Two Models were built and trained to predict valence given an audio sample. One uses a feature pipeline on top of librosa to make a number of predictors that go into a Random Forest model to determine a valence prediction. The other uses OpenAI's whisper model to transcribe lyrics, then tokenize the words, and again a trained Random Forest model makes the prediction based on lyrics.
Model RMSE | |
---|---|
Audio | 1.56 |
Lyrics | 1.28 |
Data Used:
- 1000 Song Dataset - Download here
- Spotify Developer API - 30 second previews
Package Requirements
pip install -r requirements.txt
- make sure to download whisper from openai (not currently included in requirements.txt)
- Also must install ffmpeg (using brew, choco, etc.)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
audiologic-0.1.1.tar.gz
(6.9 kB
view hashes)
Built Distribution
Close
Hashes for audiologic-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e9cae3a0129b1bed34b9e111d22c22e60c8ed924d990991e360e2071257b2617 |
|
MD5 | f5811ed83e1c63fe01c5df84b7f50bb6 |
|
BLAKE2b-256 | 54a88928b5246247705dcdefc1e43448e31e97038695524c7725517bc3cc9b09 |