JaNet diarization package
Project description
Jadia Diarization package
Kmeans-based fully open source (model code provided) package, good at parsing dialogs. Requires number of speakers to be provided. Fast in 'fast_fit' mode, when clusters are defined using only first slice of audio.
Install:
pip install jadia
pip install jadia-plot
if you want to plot predictions
Usage
diarizer = Jadia(device=torch.device("cuda:0"),model="lite", batch_size=64)
segments = diarizer.process(FILENAME, num_voices=NUM_VOICES)
or
diarizer = Jadia(device=torch.device("cuda:0"), model="lite", batch_size=64)
diarizer.setup(num_voices=NUM_VOICES)
diarizer.load_audio(filename=FILENAME)
predictions = diarizer.predict()
segments = diarizer.predictions_to_segments(predictions)
Look into eval.ipynb
notebook for plotting, metrics etc.
TODO:
- improved lite model
- heavier model with extra transformer layers
- fine-tuning
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jadia-0.1.1.tar.gz
(7.1 MB
view details)
Built Distribution
jadia-0.1.1-py3-none-any.whl
(7.1 MB
view details)
File details
Details for the file jadia-0.1.1.tar.gz
.
File metadata
- Download URL: jadia-0.1.1.tar.gz
- Upload date:
- Size: 7.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd6dd3caa707975aeff04efb746c8a78227b1155d31f5c20e2317599676c7c10 |
|
MD5 | 954ac31239201e7f8afb0286e29e0ce2 |
|
BLAKE2b-256 | d3c4519a55213b7997f70a9f43ad1ff4e45b7304ffc93dea235e4227460b7079 |
File details
Details for the file jadia-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: jadia-0.1.1-py3-none-any.whl
- Upload date:
- Size: 7.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 649baec41ca90f4261628950d2101c276d9847f6448973ec656ce32066970041 |
|
MD5 | 91e7bbbe260bdf97c7d0012c1695ae1b |
|
BLAKE2b-256 | 17978450eef808ff3f21a40e4584e63fa6f11b43a6b3db9a5e8754e426fb9c70 |