Skip to main content

Python utilities to process and predict on audio attributes

Project description

audiologic logo

Python Module to process and predict on music attributes

PyPI Latest Release

Two Models were built and trained to predict valence given an audio sample. One uses a feature pipeline on top of librosa to make a number of predictors that go into a Random Forest model to determine a valence prediction. The other uses OpenAI's whisper model to transcribe lyrics, then tokenize the words, and again a trained Random Forest model makes the prediction based on lyrics.

Model RMSE
Audio 1.56
Lyrics 1.28

Data Used:

Package Requirements

pip install -r requirements.txt

  • make sure to download whisper from openai (not currently included in requirements.txt)
  • Also must install ffmpeg (using brew, choco, etc.)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

audiologic-0.1.1.tar.gz (6.9 kB view hashes)

Uploaded Source

Built Distribution

audiologic-0.1.1-py3-none-any.whl (7.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page