Skip to main content

CNN-based audio segmentation toolkit. Does voice activity detection, speech detection, music detection, noise detection, speaker gender recognition.

Project description

Split audio signal into homogeneous zones of speech, music and noise. Then detects speaker gender.

inaSpeechSegmenter has been presented at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018 conference in Calgary, Canada. If you use this toolbox in your research, you can cite the following work in your publications :

@inproceedings{ddoukhanicassp2018,
  author = {Doukhan, David and Carrive, Jean and Vallet, Félicien and Larcher, Anthony and Meignier, Sylvain},
  title = {An Open-Source Speaker Gender Detection Framework for Monitoring Gender Equality},
  year = {2018},
  organization={IEEE},
  booktitle={Acoustics Speech and Signal Processing (ICASSP), 2018 IEEE International Conference on}
}

inaSpeechSegmenter won MIREX 2018 speech detection challenge.
http://www.music-ir.org/mirex/wiki/2018:Music_and_or_Speech_Detection_Results
Details on the speech detection submodule can be found bellow:

@inproceedings{ddoukhanmirex2018,
  author = {Doukhan, David and Lechapt, Eliott and Evrard, Marc and Carrive, Jean},
  title = {INA’S MIREX 2018 MUSIC AND SPEECH DETECTION SYSTEM},
  year = {2018},
  booktitle={Music Information Retrieval Evaluation eXchange (MIREX 2018)}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inaSpeechSegmenter-0.7.12.tar.gz (49.1 kB view details)

Uploaded Source

Built Distribution

inaSpeechSegmenter-0.7.12-py3-none-any.whl (38.4 kB view details)

Uploaded Python 3

File details

Details for the file inaSpeechSegmenter-0.7.12.tar.gz.

File metadata

  • Download URL: inaSpeechSegmenter-0.7.12.tar.gz
  • Upload date:
  • Size: 49.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for inaSpeechSegmenter-0.7.12.tar.gz
Algorithm Hash digest
SHA256 d81b02fc9b4df1921b2900001147bd8f53e352fb6200e5860fdc9f856be84264
MD5 5c7ec7b7416bccc910ec59bed7ca7dad
BLAKE2b-256 99c5817d79ad39b86f324521606a084ea37c73f4bcd66164945deebcca2cf391

See more details on using hashes here.

File details

Details for the file inaSpeechSegmenter-0.7.12-py3-none-any.whl.

File metadata

File hashes

Hashes for inaSpeechSegmenter-0.7.12-py3-none-any.whl
Algorithm Hash digest
SHA256 75d29250de9ab6f9756f299b3ab3149d992f3b1dcdf4040387906fdfc7a1db96
MD5 b41635f88db44284cec662cd18c5cd2c
BLAKE2b-256 b04c0b4a7ea55ab319b28b06603d36bd93242b6c62148f428a20b50d1f1b8941

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page