Skip to main content

CNN-based audio segmentation toolkit. Does voice activity detection, speech detection, music detection, noise detection, speaker gender recognition.

Project description

Split audio signal into homogeneous zones of speech, music and noise. Then detects speaker gender.

inaSpeechSegmenter has been presented at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018 conference in Calgary, Canada. If you use this toolbox in your research, you can cite the following work in your publications :

@inproceedings{ddoukhanicassp2018,
  author = {Doukhan, David and Carrive, Jean and Vallet, Félicien and Larcher, Anthony and Meignier, Sylvain},
  title = {An Open-Source Speaker Gender Detection Framework for Monitoring Gender Equality},
  year = {2018},
  organization={IEEE},
  booktitle={Acoustics Speech and Signal Processing (ICASSP), 2018 IEEE International Conference on}
}

inaSpeechSegmenter won MIREX 2018 speech detection challenge.
http://www.music-ir.org/mirex/wiki/2018:Music_and_or_Speech_Detection_Results
Details on the speech detection submodule can be found bellow:

@inproceedings{ddoukhanmirex2018,
  author = {Doukhan, David and Lechapt, Eliott and Evrard, Marc and Carrive, Jean},
  title = {INA’S MIREX 2018 MUSIC AND SPEECH DETECTION SYSTEM},
  year = {2018},
  booktitle={Music Information Retrieval Evaluation eXchange (MIREX 2018)}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inaSpeechSegmenter-0.7.14.tar.gz (49.7 kB view details)

Uploaded Source

Built Distribution

inaSpeechSegmenter-0.7.14-py3-none-any.whl (39.1 kB view details)

Uploaded Python 3

File details

Details for the file inaSpeechSegmenter-0.7.14.tar.gz.

File metadata

  • Download URL: inaSpeechSegmenter-0.7.14.tar.gz
  • Upload date:
  • Size: 49.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.9.21

File hashes

Hashes for inaSpeechSegmenter-0.7.14.tar.gz
Algorithm Hash digest
SHA256 8ff914c015340b284a91d48e77b244cf3524322918b999d9f524aa58ca120a31
MD5 ec4ff5da2ce2980d2dd84ffba0d4a12b
BLAKE2b-256 59ee81d1c17e6be15f08a40a23d903e55ca0d3613a848ce75c3085eaa2281e5e

See more details on using hashes here.

File details

Details for the file inaSpeechSegmenter-0.7.14-py3-none-any.whl.

File metadata

File hashes

Hashes for inaSpeechSegmenter-0.7.14-py3-none-any.whl
Algorithm Hash digest
SHA256 3ad32ef0fb18e40d308192ce83d4e783f39dabfc1eefd0e7429f37ad2e683ead
MD5 170873c6f972ff29319098b13cd85cfe
BLAKE2b-256 8c8856afcdd959ba816efad4ada005841e99a0890f3540dd08f8fa9cc125edfa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page