nkululeko

Machine learning audio prediction experiments based on templates

These details have not been verified by PyPI

Project links

Project description

Nkululeko

Nkululeko is a software to detect speaker characteristics by machine learning experiments with a high-level interface. The idea is to have a framework (based on e.g. sklearn and torch) that can be used to rapidly and automatically analyse audio data and explore machine learning models based on that data.

Some abilities that Nkululeko provides: combines acoustic features and machine learning models (including feature selection and features concatenation); performs data exploration, selection and visualization the results; finetuning; ensemble learning models; soft labeling (predicting labels with pre-trained model); and inference the model on a test set.

Nkululeko orchestrates data loading, feature extraction, and model training, allowing you to specify your experiment in a configuration file. The framework handles the process from raw data to trained model and evaluation, making it easy to run machine learning experiments without directly coding in Python.

Who is this for?

Nkululeko is for speech processing learners, researchers and ML practitioners focused on speaker characteristics, e.g., emotion, age, gender, or disorder detection.

Installation

Nkululeko requires Python 3.10 or higher with the following build status:

Python 3.11
Python 3.12
Python 3.13

Create and activate a virtual Python environment and simply install Nkululeko:

# using python venv
python -m venv .env
source .env/bin/activate  # specify OS versions, add a separate line for Windows users 
pip install nkululeko
# using uv in development mode
uv venv --python 3.12
source .venv/bin/activate
uv pip install -e .
# or run directly using uv run after cloning
uv run python -m nkululeko.nkululeko --config examples/exp_polish_tree.ini

Optional Dependencies

Nkululeko supports optional dependencies through extras:

# Install with PyTorch support
pip install nkululeko[torch]

# Install with CPU-only PyTorch
pip install nkululeko[torch-cpu]

# Install with TensorFlow support
pip install nkululeko[tensorflow]

# Install all optional dependencies
pip install nkululeko[all]

Manual Installation Options

You can also install dependencies manually:

PyTorch Installation

For CPU-only installation (recommended for most users):

pip install torch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 --index-url https://download.pytorch.org/whl/cpu

For GPU support (cuda 12.6):

pip install torch torchvision torchaudio

Some functionalities require extra packages to be installed, which we didn't include automatically:

For spotlight adapter:

pip install PyYAML  # Install PyYAML first to avoid dependency issues
pip install nkululeko[spotlight]

Some examples for ini-files (which you use to control nkululeko) are in the examples folder.

Documentation

The documentation, along with extensions of installation, usage, INI file format, and examples, can be found nkululeko.readthedocs.io.

Usage

ini-file values

Basically, you specify your experiment in an "ini" file (e.g. experiment.ini) and then call one of the Nkululeko interfaces to run the experiment like this:

python -m nkululeko.nkululeko --config experiment.ini

A basic configuration looks like this:

[EXP]
root = ./
name = exp_emodb
[DATA]
databases = ['emodb']
emodb = ./emodb/
emodb.split_strategy = speaker_split
target = emotion
labels = ['anger', 'boredom', 'disgust', 'fear']
[FEATS]
type = ['praat']
[MODEL]
type = svm
[EXPL]
model = tree
plot_tree = True

Read the Hello World example for initial usage with Emo-DB dataset.

Here is an overview of the interfaces/modules:

All of them take --config <my_config.ini> as an argument.

nkululeko.nkululeko: do machine learning experiments combining features and learners (e.g. opensmile with SVM)
nkululeko.ensemble: combine several nkululeko experiments and report on late fusion results
nkululeko.multidb: do multiple experiments, comparing several databases cross and in itself
nkululeko.demo: demo the current best model on the command line or for a list of files
nkululeko.feature_demo: demo a feature extractor on the command line or for a list of files
nkululeko.explore: perform data exploration
nkululeko.augment: augment the current training data
nkululeko.aug_train: augment the current training data and do a training including this data
nkululeko.predict: predict features like SNR, MOS, arousal/valence, age/gender, with DNN models
nkululeko.segment: segment a database based on VAD (voice activity detection)
nkululeko.resample: check on all sampling rates and change to 16kHz
nkululeko.optim: do meta parameter optimization (e.g. grid search for SVM C and gamma)
nkululeko.flags: a convenient module to conduct multiple experiments with different configuration parameters on the command line.

Hello World example

NEW: Here's a Google colab that runs this example out-of-the-box, and here is the same with Kaggle
I made a video to show you how to do this on Windows
Set up Python on your computer, version >= 3.8
Open a terminal/command line/console window
Test python by typing python, python should start with version >3 (NOT 2!). You can leave the Python Interpreter by typing exit()
Create a folder on your computer for this example, let's call it nkulu_work
Get a copy of the Berlin emodb in audformat and unpack inside the folder you just created (nkulu_work)
Make sure the folder is called "emodb" and does contain the database files directly (not box-in-a-box)
Also, in the nkulu_work folder:
- Create a Python environment
  - python -m venv venv
- Then, activate it:
  - under Linux / mac
    - source venv/bin/activate
  - under Windows
    - venv\Scripts\activate.bat
  - if that worked, you should see a (venv) in front of your prompt
- Install the required packages in your environment
  - pip install nkululeko
  - Repeat until all error messages vanish (or fix them, or try to ignore them)...
Now you should have two folders in your nkulu_work folder:
- emodb and venv
Download a copy of the file exp_emodb.ini to the current working directory (nkulu_work)
Run the demo
- python -m nkululeko.nkululeko --config exp_emodb.ini
Find the results in the newly created folder exp_emodb
- Inspect exp_emodb/images/run_0/emodb_xgb_os_0_000_cnf.png
- This is the main result of your experiment: a confusion matrix for the emodb emotional categories
Inspect and play around with the demo configuration file that defined your experiment, then re-run.
There are many ways to experiment with different classifiers and acoustic feature sets, all described here

Features

The framework is targeted at the speech domain and supports experiments where different classifiers are combined with different feature extractors.

Classifiers: Naive Bayes, KNN, Tree, XGBoost, SVM, MLP
Feature extractors: Praat, Opensmile, openXBOW BoAW, TRILL embeddings, Wav2vec2 embeddings, audModel embeddings, ...
Feature scaling
Label encoding
Binning (continuous to categorical)
Online demo interface for trained models
Visualization: confusion matrix, feature importance, feature distribution, epoch progression, t-SNE plot, data distribution, bias checking, uncertainty estimation

Here's a rough UML-like sketch of the framework (and here's the real one done with pyreverse). sketch

Currently, the following linear classifiers are implemented (integrated from sklearn):

SVM, SVR, XGB, XGR, Tree, Tree_regressor, KNN, KNN_regressor, NaiveBayes, GMM and the following ANNs (artificial neural networks)
MLP (multi-layer perceptron), CNN (convolutional neural network)

For visualization, besides confusion matrix, feature importance, feature distribution, t-SNE plot, data distribution (just names a few), Nkululeko can also be used for bias checking, uncertainty estimation, and epoch progression.

Bias checking

In some cases, you might wonder if there's bias in your data. You can try to detect this with automatically estimated speech properties by visualizing the correlation of target labels and predicted labels.

Uncertainty

Nkululeko estimates the uncertainty of model decisions (only for classifiers) with entropy over the class probabilities or logits per sample.

Here's an animation that shows the progress of classification done with nkululeko.

News

There's Felix blog with tutorials below:

License

Nkululeko can be used under the MIT license.

Contributing

Contributions are welcome and encouraged. To learn more about how to contribute to nkululeko, please refer to the Contributing guidelines.

Citation

If you use Nkululeko, please cite the paper:

F. Burkhardt and B. Tris Atmaja, (2025). Nkululeko 1.0: A Python package to predict speaker characteristics with a high-level interface. Journal of Open Source Software, 10(115), 8049, https://doi.org/10.21105/joss.08049

@article{Burkhardt2025, doi = {10.21105/joss.08049}, url = {https://doi.org/10.21105/joss.08049}, year = {2025}, publisher = {The Open Journal}, volume = {10}, number = {115}, pages = {8049}, author = {Burkhardt, Felix and Atmaja, Bagus Tris}, title = {Nkululeko 1.0: A Python package to predict speaker characteristics with a high-level interface}, journal = {Journal of Open Source Software} }

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.5.0

Apr 22, 2026

1.4.1

Apr 9, 2026

1.3.5

Mar 19, 2026

1.3.4

Mar 16, 2026

This version

1.3.3

Mar 16, 2026

1.3.2

Mar 16, 2026

1.3.1

Mar 5, 2026

1.3.0

Mar 2, 2026

1.2.3

Feb 23, 2026

1.2.2

Feb 10, 2026

1.2.1

Feb 9, 2026

1.1.9

Jan 26, 2026

1.1.8

Jan 26, 2026

1.1.7

Jan 21, 2026

1.1.6

Jan 20, 2026

1.1.5

Jan 20, 2026

1.1.4

Jan 20, 2026

1.1.2

Jan 8, 2026

1.1.1

Dec 11, 2025

1.1.0

Nov 24, 2025

1.0.3

Nov 6, 2025

1.0.2

Nov 3, 2025

1.0.1

Oct 17, 2025

1.0.0

Oct 14, 2025

0.98.6

Oct 1, 2025

0.98.5

Oct 1, 2025

0.98.4

Sep 24, 2025

0.98.3

Sep 18, 2025

0.98.2

Sep 17, 2025

0.98.1

Sep 5, 2025

0.98.0

Aug 19, 2025

0.97.6

Aug 13, 2025

0.97.5

Aug 12, 2025

0.97.4

Aug 12, 2025

0.97.3

Aug 6, 2025

0.97.2

Aug 5, 2025

0.97.1

Aug 5, 2025

0.97.0

Aug 4, 2025

0.96.8

Jul 31, 2025

0.96.7

Jul 31, 2025

0.96.6

Jul 30, 2025

0.96.5

Jul 23, 2025

0.96.4

Jul 22, 2025

0.96.3

Jul 22, 2025

0.96.2

Jul 17, 2025

0.96.1

Jul 16, 2025

0.96.0

Jul 15, 2025

0.95.9

Jul 14, 2025

0.95.8

Jul 14, 2025

0.95.7

Jul 11, 2025

0.95.6

Jul 10, 2025

0.95.5

Jul 8, 2025

0.95.4

Jul 8, 2025

0.95.3

Jul 3, 2025

0.95.2

Jul 1, 2025

0.95.1

Jun 28, 2025

0.95.0

Jun 26, 2025

0.94.3

Jun 12, 2025

0.94.2

Jun 2, 2025

0.94.1

Apr 3, 2025

0.94.0

Mar 27, 2025

0.93.15

Jan 30, 2025

0.93.14

Jan 29, 2025

0.93.13

Jan 27, 2025

0.93.12

Jan 21, 2025

0.93.11

Jan 8, 2025

0.93.10

Dec 18, 2024

0.93.9

Dec 13, 2024

0.93.8

Dec 12, 2024

0.93.7

Dec 10, 2024

0.93.6

Dec 10, 2024

0.93.5

Nov 19, 2024

0.93.4

Nov 19, 2024

0.93.3

Nov 18, 2024

0.93.2

Nov 18, 2024

0.93.1

Nov 12, 2024

0.93.0

Nov 11, 2024

0.92.2

Nov 8, 2024

0.92.1

Nov 8, 2024

0.92.0

Nov 7, 2024

0.91.3

Nov 5, 2024

0.91.2

Oct 22, 2024

0.91.1

Oct 22, 2024

0.91.0

Oct 21, 2024

0.90.4

Oct 15, 2024

0.90.3

Oct 15, 2024

0.90.2

Oct 2, 2024

0.90.1

Sep 18, 2024

0.90.0

Sep 10, 2024

0.89.2

Sep 6, 2024

0.89.1

Sep 2, 2024

0.89.0

Aug 29, 2024

0.88.12

Aug 1, 2024

0.88.11

Jul 30, 2024

0.88.10

Jul 30, 2024

0.88.9

Jul 25, 2024

0.88.8

Jul 24, 2024

0.88.7

Jul 23, 2024

0.88.6

Jul 22, 2024

0.88.5

Jul 18, 2024

0.88.4

Jul 2, 2024

0.88.3

Jun 27, 2024

0.88.2

Jun 26, 2024

0.88.1

Jun 25, 2024

0.88.0

Jun 25, 2024

0.87.0

Jun 20, 2024

0.86.8

Jun 19, 2024

0.86.7

Jun 18, 2024

0.86.6

Jun 17, 2024

0.86.5

Jun 13, 2024

0.86.4

Jun 3, 2024

0.86.3

May 30, 2024

0.86.2

May 30, 2024

0.86.1

May 29, 2024

0.86.0

May 29, 2024

0.85.2

May 21, 2024

0.85.1

May 17, 2024

0.85.0

May 15, 2024

0.84.1

May 13, 2024

0.84.0

May 3, 2024

0.83.3

Apr 30, 2024

0.83.2

Apr 29, 2024

0.83.1

Apr 26, 2024

0.83.0

Apr 25, 2024

0.82.4

Apr 24, 2024

0.82.3

Apr 24, 2024

0.82.2

Apr 24, 2024

0.82.1

Apr 24, 2024

0.82.0

Apr 23, 2024

0.81.7

Apr 23, 2024

0.81.6

Apr 22, 2024

0.81.4

Apr 17, 2024

0.81.3

Apr 16, 2024

0.81.2

Apr 15, 2024

0.81.1

Mar 21, 2024

0.81.0

Mar 18, 2024

0.80.4

Mar 14, 2024

0.80.3

Mar 13, 2024

0.80.2

Mar 12, 2024

0.80.1

Mar 11, 2024

0.80.0

Mar 10, 2024

0.79.5

Feb 29, 2024

0.79.4

Feb 29, 2024

0.79.3

Feb 29, 2024

0.79.2

Feb 26, 2024

0.79.1

Feb 26, 2024

0.79.0

Feb 26, 2024

0.78.2

Feb 14, 2024

0.78.1

Feb 13, 2024

0.78.0

Feb 1, 2024

0.77.14

Jan 31, 2024

0.77.13

Jan 30, 2024

0.77.12

Jan 24, 2024

0.77.11

Jan 15, 2024

0.77.10

Jan 3, 2024

0.77.9

Jan 3, 2024

0.77.8

Jan 3, 2024

0.77.7

Jan 2, 2024

0.77.6

Jan 2, 2024

0.77.5

Dec 29, 2023

0.77.4

Dec 26, 2023

0.77.3

Dec 26, 2023

0.77.1

Dec 19, 2023

0.77.0

Dec 19, 2023

0.76.0

Dec 18, 2023

0.74.6

Dec 15, 2023

0.74.3

Dec 14, 2023

0.74.2

Dec 14, 2023

0.74.0

Dec 12, 2023

0.73.0

Dec 11, 2023

0.72.0

Dec 7, 2023

0.71.4

Dec 5, 2023

0.71.3

Nov 30, 2023

0.71.2

Nov 29, 2023

0.71.1

Nov 23, 2023

0.71.0

Nov 22, 2023

0.70.0

Nov 16, 2023

0.69.0

Nov 16, 2023

0.68.4

Nov 13, 2023

0.68.3

Nov 12, 2023

0.68.2

Nov 9, 2023

0.68.1

Nov 9, 2023

0.68.0

Nov 7, 2023

0.67.0

Oct 31, 2023

0.66.13

Oct 19, 2023

0.66.12

Oct 17, 2023

0.66.11

Oct 17, 2023

0.66.9

Oct 16, 2023

0.66.8

Oct 13, 2023

0.66.7

Oct 6, 2023

0.66.6

Oct 4, 2023

0.66.5

Oct 4, 2023

0.66.4

Sep 27, 2023

0.66.2

Sep 26, 2023

0.66.1

Sep 25, 2023

0.66.0

Sep 22, 2023

0.65.8

Sep 19, 2023

0.65.7

Sep 19, 2023

0.65.6

Sep 15, 2023

0.65.5

Sep 12, 2023

0.65.4

Sep 12, 2023

0.65.2

Sep 11, 2023

0.65.1

Sep 11, 2023

0.65.0

Sep 7, 2023

0.64.4

Sep 7, 2023

0.64.3

Sep 7, 2023

0.64.2

Sep 6, 2023

0.64.1

Sep 6, 2023

0.64.0

Sep 5, 2023

0.63.3

Sep 4, 2023

0.63.2

Aug 31, 2023

0.63.1

Aug 31, 2023

0.63.0

Aug 31, 2023

0.62.1

Aug 30, 2023

0.62.0

Aug 30, 2023

0.61.0

Aug 29, 2023

0.60.0

Aug 28, 2023

0.59.1

Aug 18, 2023

0.59.0

Aug 16, 2023

0.58.0

Aug 16, 2023

0.57.0

Aug 15, 2023

0.56.0

Aug 15, 2023

0.55.1

Aug 14, 2023

0.55.0

Jul 14, 2023

0.54.0

Jul 13, 2023

0.53.0

Jul 11, 2023

0.52.0

Jul 6, 2023

0.51.0

Jul 4, 2023

0.50.1

Jul 3, 2023

0.50.0

Jun 29, 2023

0.49.1

Jun 21, 2023

0.49.0

Jun 21, 2023

0.48.1

Jun 15, 2023

0.48.0

Jun 14, 2023

0.47.1

Jun 13, 2023

0.47.0

May 25, 2023

0.46.0

May 23, 2023

0.45.5

May 22, 2023

0.45.3

May 11, 2023

0.45.2

May 11, 2023

0.45.1

May 10, 2023

0.45.0

May 4, 2023

0.44.1

Apr 27, 2023

0.44.0

Apr 20, 2023

0.43.6

Apr 18, 2023

0.43.5

Apr 4, 2023

0.43.4

Mar 24, 2023

0.43.3

Mar 23, 2023

0.43.2

Mar 22, 2023

0.43.1

Mar 13, 2023

0.42.0

Mar 1, 2023

0.41.0

Feb 28, 2023

0.40.1

Feb 22, 2023

0.40.0

Feb 20, 2023

0.39.0

Feb 16, 2023

0.38.3

Feb 15, 2023

0.38.2

Feb 9, 2023

0.38.1

Feb 8, 2023

0.37.2

Jan 26, 2023

0.0.0

May 22, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nkululeko-1.3.3.tar.gz (37.7 MB view details)

Uploaded Mar 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nkululeko-1.3.3-py3-none-any.whl (339.5 kB view details)

Uploaded Mar 16, 2026 Python 3

File details

Details for the file nkululeko-1.3.3.tar.gz.

File metadata

Download URL: nkululeko-1.3.3.tar.gz
Upload date: Mar 16, 2026
Size: 37.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for nkululeko-1.3.3.tar.gz
Algorithm	Hash digest
SHA256	`27c6ecc6aff4bc3eb99435676b4e8bc7d532f73e1779aab706ecf8fa2b8c17de`
MD5	`0cc5717256ae9a791f7536373f0296e6`
BLAKE2b-256	`d696214709c13e0ddbe7014aa6ed8bf7ec123aa2a4f86e640073c119bd0693cc`

See more details on using hashes here.

File details

Details for the file nkululeko-1.3.3-py3-none-any.whl.

File metadata

Download URL: nkululeko-1.3.3-py3-none-any.whl
Upload date: Mar 16, 2026
Size: 339.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for nkululeko-1.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1757509e6bcf126b6ce41d939638050bfe3172785edcb99db3d35d1578e968c7`
MD5	`67f5493b39635dec194f18f26220e10f`
BLAKE2b-256	`ca83c6d68e84323b782961129e244556f73daf8e38609bd5383bfa399c83651d`

See more details on using hashes here.

nkululeko 1.3.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Nkululeko

Who is this for?

Installation

Optional Dependencies

Manual Installation Options

PyTorch Installation

Documentation

Usage

ini-file values

Hello World example

Features

Bias checking

Uncertainty

News

License

Contributing

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes