Machine learning audio prediction experiments based on templates

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering

Project description

Nkululeko

Overview
Installation
Usage
Hello World
Licence

Overview

A project to detect speaker characteristics by machine learning experiments with a high level interface.

The idea is to have a framework (based on e.g. sklearn and torch) that can be used by people not being experienced programmers as they mainly have to adapt an initialization parameter file per experiment.

The latest features can be seen at the ini-file options that are used to control Nkululeko
Below is a Hello World example that should set you up fastly.
Here's a blog post on how to set up nkululeko on your computer.
Here's a slide presentation about nkululeko
Here's a video presentation about nkululeko
Here's the 2022 LREC article on nkululeko

Here are some examples of typical output:

Confusion matrix

Per default, Nkululeko displays results as a confusion matrix, using binning with regression.

Epoch progression

The point when overfitting starts can sometimes be seen by looking at the results per epoch:

Feature importance

Using the explore interface, Nkululeko analyses the importance of acoustic features:

Feature distribution

And can show the distribution of specific features per category:

t-SNE plots

A t-SNE plot can give you an estimate wether your acoustic features are useful at all:

Data distribution

Sometimes you only want to take a look at your data:

Installation

Create and activate a virtual python environment and simply run

pip install nkululeko

Some examples for ini-files (which you use to control nkululeko) are in the tests folder.

Usage

Basically, you specify your experiment in an "ini" file (e.g. experiment.ini) and then call one of the Nkululeko interfaces to run the experiment like this:

python -m nkululeko.nkululeko --config experiment.ini

A basic configuration looks like this:

[EXP]
root = ./
name = exp_emodb
[DATA]
databases = ['emodb']
emodb = ./emodb/
emodb.split_strategy = speaker_split
target = emotion
labels = ['anger', 'boredom', 'disgust', 'fear']
[FEATS]
type = ['praat']
[MODEL]
type = svm
[EXPL]
model = tree
plot_tree = True
[PLOT]
combine_per_speaker = True

Here is an overview of the interfaces:

nkululeko.nkululeko: doing experiments
nkululeko.demo: demo the current best model on commandline
nkululeko.test: predict a series of files with the current best model
nkululeko.explore: perform data exploration
nkululeko.augment: augment the current training data

Alternatively, there is a central "experiment" class that can be used by own experiments

There's my blog with tutorials:

The framework is targeted at the speech domain and supports experiments where different classifiers are combined with different feature extractors.

Here's a rough UML-like sketch of the framework. sketch

Currently the following linear classifiers are implemented (integrated from sklearn):

SVM, SVR, XGB, XGR, Tree, Tree_regressor, KNN, KNN_regressor, NaiveBayes, GMM and the following ANNs
MLP, CNN (tbd)

Here's an animation that shows the progress of classification done with nkululeko

Initialization file

You could

use a generic main python file (like my_experiment.py),
adapt the path to your nkululeko src
and then adapt an .ini file (again fitting at least the paths to src and data)

Here's an overview on the ini-file options

Hello World example

NEW I made a video to show you how to do this on Windows
Set up Python on your computer, version >= 3.6
Open a terminal/commandline/console window
Test python by typing python, python should start with version >3 (NOT 2!). You can leave the Python Interpreter by typing exit()
Create a folder on your computer for this example, let's call it nkulu_work
Get a copy of the Berlin emodb in audformat and unpack the same folder (nkulu_work)
Make sure the folder is called "emodb" and does contain the database files directly (not box-in-a-box)
Also, in the nkulu_work folder:
- Create a python environment
  - python -m venv venv
- Then, activate it:
  - under linux / mac
    - source venv/bin/activate
  - under Windows
    - venv\Scripts\activate.bat
  - if that worked, you should see a (venv) in front of your prompt
- Install the required packages in your environment
  - pip install nkululeko
  - Repeat until all error messages vanished (or fix them, or try to ignore them)...
Now you should have two folders in your nkulu_work folder:
- emodb and venv
Download a copy of the file exp_emodb.ini
Run the demo
- python -m nkululeko.nkululeko --config exp_emodb.ini
Find the results in the newly created folder exp_emodb
- Inspect exp_emodb/images/run_0/emodb_xgb_os_0_000_cnf.png
- This is the main result of you experiment: a confusion matrix for the emodb emotional categories
Inspect and play around with the demo configuration file that defined your experiment, then re-run.
There are many ways to experiment with different classifiers and acoustic features sets, all described here

Features

Classifiers: Naive Bayes, KNN, Tree, XGBoost, SVM, MLP
Feature extractors: Praat, Opensmile, openXBOW BoAW, TRILL embeddings, Wav2vec2 embeddings, audModel embeddings, ...
Feature scaling
Label encoding
Binning (continuous to categorical)
Online demo interface for trained models

Outlook

Classifiers: CNN
Feature extractors: mid level descriptors, Mel-spectra

Licence

Nkululeko can be used under the MIT license

Changelog

Version 0.44.0

added scatter functions: tsne, pca, umap

Version 0.43.7

added clap features

Version 0.43.6

small bugs

Version 0.43.5

because of difficulties with numba and audiomentations importing audiomentations only when augmenting

Version 0.43.4

added error when experiment type and predictor don't match

Version 0.43.3

fixed further bugs and added augmentation to the test runs

Version 0.43.2

fixed a bug when running continuous variable as classification problem

Version 0.43.1

fixed test_runs

Version 0.43.0

added augmentation module based on audiomentation

Version 0.42.0

age labels should now be detected in databases

Version 0.41.0

added feature tree plot

Version 0.40.1

fixed a bug: additional test database was not label encoded

Version 0.40.0

added EXPL section and first functionality
added test module (for test databases)

Version 0.39.0

added feature distribution plots
added plot format

Version 0.38.3

added demo mode with list argument

Version 0.38.2

fixed a bug concerned with "no_reuse" evaluation

Version 0.38.1

demo mode with file argument

Version 0.38.0

fixed demo mode

Version 0.37.2

mainly replaced pd.append with pd.concat

Version 0.37.1

fixed bug preventing praat feature extraction to work

Version 0.37.0

fixed bug cvs import not detecting multiindex

Version 0.36.3

published as a pypi module

Version 0.36.0

added entry nkululeko.py script

Version 0.35.0

fixed bug that prevented scaling (normalization)

Version 0.34.2

smaller bug fixed concerning the loss_string

Version 0.34.1

smaller bug fixes and tried Soft_f1 loss

Version 0.34.0

smaller bug fixes and debug ouputs

Version 0.33.0

added GMM as a model type

Version 0.32.0

added audmodel embeddings as features

Version 0.31.0

added models: tree and tree_reg

Version 0.30.0

added models: bayes, knn and knn_reg

Version 0.29.2

fixed hello world example

Version 0.29.1

bug fix for 0.29

Version 0.29.0

added a new FeatureExtractor class to import external data

Version 0.28.2

removed some Pandas warnings
added no_reuse function to database.load()

Version 0.28.1

with database.value_counts show only the data that is actually used

Version 0.28.0

made "label_data" configuration automatic and added "label_result"

Version 0.27.0

added "label_data" configuration to label data with trained model (so now there can be train, dev and test set)

Version 0.26.1

Fixed some bugs caused by the multitude of feature sets
Added possibilty to distinguish between absolut or relative pathes in csv datasets

Version 0.26.0

added the rename_speakers funcionality to prevent identical speaker names in datasets

Version 0.25.1

fixed bug that no features were chosen if not selected

Version 0.25.0

made selectable features universal for feature sets

Version 0.24.0

added multiple feature sets (will simply be concatenated)

Version 0.23.0

added selectable features for Praat interface

Version 0.22.0

added David R. Feinberg's Praat features, praise also to parselmouth

Version 0.21.0

Revoked 0.20.0
Added support for only_test = True, to enable later testing of trained models with new test data

Version 0.20.0

implemented reuse of trained and saved models

Version 0.19.0

added "max_duration_of_sample" for datasets

Version 0.18.6

added support for learning and dropout rate as argument

Version 0.18.5

added support for epoch number as argument

Version 0.18.4

added support for ANN layers as arguments

Version 0.18.3

added reuse of test and train file sets
added parameter to scale continous target values: target_divide_by

Version 0.18.2

added preference of local dataset specs to global ones

Version 0.18.1

added regression value display for confusion matrices

Version 0.18.0

added leave one speaker group out

Version 0.17.2

fixed scaler, added robust

Version 0.17.0

Added minimum duration for test samples

Version 0.16.4

Added possibility to combine predictions per speaker (with mean or mode function)

Version 0.16.3

Added minimal sample length for databases

Version 0.16.2

Added k-fold-cross-validation for linear classifiers

Version 0.16.1

Added leave-one-speaker-out for linear classifiers

Version 0.16.0

Added random sample splits

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering

Release history Release notifications | RSS feed

0.85.2

May 21, 2024

0.85.1

May 17, 2024

0.85.0

May 15, 2024

0.84.1

May 13, 2024

0.84.0

May 3, 2024

0.83.3

Apr 30, 2024

0.83.2

Apr 29, 2024

0.83.1

Apr 26, 2024

0.83.0

Apr 25, 2024

0.82.4

Apr 24, 2024

0.82.3

Apr 24, 2024

0.82.2

Apr 24, 2024

0.82.1

Apr 24, 2024

0.82.0

Apr 23, 2024

0.81.7

Apr 23, 2024

0.81.6

Apr 22, 2024

0.81.4

Apr 17, 2024

0.81.3

Apr 16, 2024

0.81.2

Apr 15, 2024

0.81.1

Mar 21, 2024

0.81.0

Mar 18, 2024

0.80.4

Mar 14, 2024

0.80.3

Mar 13, 2024

0.80.2

Mar 12, 2024

0.80.1

Mar 11, 2024

0.80.0

Mar 10, 2024

0.79.5

Feb 29, 2024

0.79.4

Feb 29, 2024

0.79.3

Feb 29, 2024

0.79.2

Feb 26, 2024

0.79.1

Feb 26, 2024

0.79.0

Feb 26, 2024

0.78.2

Feb 14, 2024

0.78.1

Feb 13, 2024

0.78.0

Feb 1, 2024

0.77.14

Jan 31, 2024

0.77.13

Jan 30, 2024

0.77.12

Jan 24, 2024

0.77.11

Jan 15, 2024

0.77.10

Jan 3, 2024

0.77.9

Jan 3, 2024

0.77.8

Jan 3, 2024

0.77.7

Jan 2, 2024

0.77.6

Jan 2, 2024

0.77.5

Dec 29, 2023

0.77.4

Dec 26, 2023

0.77.3

Dec 26, 2023

0.77.1

Dec 19, 2023

0.77.0

Dec 19, 2023

0.76.0

Dec 18, 2023

0.74.6

Dec 15, 2023

0.74.3

Dec 14, 2023

0.74.2

Dec 14, 2023

0.74.0

Dec 12, 2023

0.73.0

Dec 11, 2023

0.72.0

Dec 7, 2023

0.71.4

Dec 5, 2023

0.71.3

Nov 30, 2023

0.71.2

Nov 29, 2023

0.71.1

Nov 23, 2023

0.71.0

Nov 22, 2023

0.70.0

Nov 16, 2023

0.69.0

Nov 16, 2023

0.68.4

Nov 13, 2023

0.68.3

Nov 12, 2023

0.68.2

Nov 9, 2023

0.68.1

Nov 9, 2023

0.68.0

Nov 7, 2023

0.67.0

Oct 31, 2023

0.66.13

Oct 19, 2023

0.66.12

Oct 17, 2023

0.66.11

Oct 17, 2023

0.66.9

Oct 16, 2023

0.66.8

Oct 13, 2023

0.66.7

Oct 6, 2023

0.66.6

Oct 4, 2023

0.66.5

Oct 4, 2023

0.66.4

Sep 27, 2023

0.66.2

Sep 26, 2023

0.66.1

Sep 25, 2023

0.66.0

Sep 22, 2023

0.65.8

Sep 19, 2023

0.65.7

Sep 19, 2023

0.65.6

Sep 15, 2023

0.65.5

Sep 12, 2023

0.65.4

Sep 12, 2023

0.65.2

Sep 11, 2023

0.65.1

Sep 11, 2023

0.65.0

Sep 7, 2023

0.64.4

Sep 7, 2023

0.64.3

Sep 7, 2023

0.64.2

Sep 6, 2023

0.64.1

Sep 6, 2023

0.64.0

Sep 5, 2023

0.63.3

Sep 4, 2023

0.63.2

Aug 31, 2023

0.63.1

Aug 31, 2023

0.63.0

Aug 31, 2023

0.62.1

Aug 30, 2023

0.62.0

Aug 30, 2023

0.61.0

Aug 29, 2023

0.60.0

Aug 28, 2023

0.59.1

Aug 18, 2023

0.59.0

Aug 16, 2023

0.58.0

Aug 16, 2023

0.57.0

Aug 15, 2023

0.56.0

Aug 15, 2023

0.55.1

Aug 14, 2023

0.55.0

Jul 14, 2023

0.54.0

Jul 13, 2023

0.53.0

Jul 11, 2023

0.52.0

Jul 6, 2023

0.51.0

Jul 4, 2023

0.50.1

Jul 3, 2023

0.50.0

Jun 29, 2023

0.49.1

Jun 21, 2023

0.49.0

Jun 21, 2023

0.48.1

Jun 15, 2023

0.48.0

Jun 14, 2023

0.47.1

Jun 13, 2023

0.47.0

May 25, 2023

0.46.0

May 23, 2023

0.45.5

May 22, 2023

0.45.3

May 11, 2023

0.45.2

May 11, 2023

0.45.1

May 10, 2023

0.45.0

May 4, 2023

0.44.1

Apr 27, 2023

This version

0.44.0

Apr 20, 2023

0.43.6

Apr 18, 2023

0.43.5

Apr 4, 2023

0.43.4

Mar 24, 2023

0.43.3

Mar 23, 2023

0.43.2

Mar 22, 2023

0.43.1

Mar 13, 2023

0.42.0

Mar 1, 2023

0.41.0

Feb 28, 2023

0.40.1

Feb 22, 2023

0.40.0

Feb 20, 2023

0.39.0

Feb 16, 2023

0.38.3

Feb 15, 2023

0.38.2

Feb 9, 2023

0.38.1

Feb 8, 2023

0.37.2

Jan 26, 2023

0.0.0

May 22, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nkululeko-0.44.0.tar.gz (55.1 kB view hashes)

Uploaded Apr 20, 2023 Source

Built Distribution

nkululeko-0.44.0-py3-none-any.whl (73.3 kB view hashes)

Uploaded Apr 20, 2023 Python 3

Hashes for nkululeko-0.44.0.tar.gz

Hashes for nkululeko-0.44.0.tar.gz
Algorithm	Hash digest
SHA256	`9cfa1fcc89a6b7739f48ac262c2d366324e331499df3321fdd0611914b5a989a`
MD5	`9d19ea8fa575f888a0e5e2db0795ebb7`
BLAKE2b-256	`d1f6f94b02db5a43a1bb67f959638af3a7f4dc15d23d2377dd839dd6fcc5a403`

Hashes for nkululeko-0.44.0-py3-none-any.whl

Hashes for nkululeko-0.44.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2eb85395f8bbcd04d13b34299436b518d485fdef17e8d58eaa1045ae31324998`
MD5	`79c03eec458f783a0ab3637666deb1c6`
BLAKE2b-256	`71b2e759e94df3c8f98dfeca33a3e476c6333ad905ce80496bd41e9b1d39db9c`

nkululeko 0.44.0

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

Nkululeko

Overview

Confusion matrix

Epoch progression

Feature importance

Feature distribution

t-SNE plots

Data distribution

Installation

Usage

Initialization file

Hello World example

Features

Outlook

Licence

Changelog

Version 0.44.0

Version 0.43.7

Version 0.43.6

Version 0.43.5

Version 0.43.4

Version 0.43.3

Version 0.43.2

Version 0.43.1

Version 0.43.0

Version 0.42.0

Version 0.41.0

Version 0.40.1

Version 0.40.0

Version 0.39.0

Version 0.38.3

Version 0.38.2

Version 0.38.1

Version 0.38.0

Version 0.37.2

Version 0.37.1

Version 0.37.0

Version 0.36.3

Version 0.36.0

Version 0.35.0

Version 0.34.2

Version 0.34.1

Version 0.34.0

Version 0.33.0

Version 0.32.0

Version 0.31.0

Version 0.30.0

Version 0.29.2

Version 0.29.1

Version 0.29.0

Version 0.28.2

Version 0.28.1

Version 0.28.0

Version 0.27.0

Version 0.26.1

Version 0.26.0

Version 0.25.1

Version 0.25.0

Version 0.24.0

Version 0.23.0

Version 0.22.0

Version 0.21.0

Version 0.20.0

Version 0.19.0

Version 0.18.6

Version 0.18.5

Version 0.18.4

Version 0.18.3

Version 0.18.2

Version 0.18.1