Skip to main content

CNN-based audio segmentation toolkit. Does voice activity detection, speech detection, music detection, noise detection, speaker gender recognition.

Project description

Split audio signal into homogeneous zones of speech, music and noise. Then detects speaker gender.

inaSpeechSegmenter has been presented at the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018 conference in Calgary, Canada. If you use this toolbox in your research, you can cite the following work in your publications :

@inproceedings{ddoukhanicassp2018,
  author = {Doukhan, David and Carrive, Jean and Vallet, Félicien and Larcher, Anthony and Meignier, Sylvain},
  title = {An Open-Source Speaker Gender Detection Framework for Monitoring Gender Equality},
  year = {2018},
  organization={IEEE},
  booktitle={Acoustics Speech and Signal Processing (ICASSP), 2018 IEEE International Conference on}
}

inaSpeechSegmenter won MIREX 2018 speech detection challenge.
http://www.music-ir.org/mirex/wiki/2018:Music_and_or_Speech_Detection_Results
Details on the speech detection submodule can be found bellow:

@inproceedings{ddoukhanmirex2018,
  author = {Doukhan, David and Lechapt, Eliott and Evrard, Marc and Carrive, Jean},
  title = {INA’S MIREX 2018 MUSIC AND SPEECH DETECTION SYSTEM},
  year = {2018},
  booktitle={Music Information Retrieval Evaluation eXchange (MIREX 2018)}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inaSpeechSegmenter-0.7.3.tar.gz (41.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

inaSpeechSegmenter-0.7.3-py3-none-any.whl (26.6 kB view details)

Uploaded Python 3

File details

Details for the file inaSpeechSegmenter-0.7.3.tar.gz.

File metadata

  • Download URL: inaSpeechSegmenter-0.7.3.tar.gz
  • Upload date:
  • Size: 41.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for inaSpeechSegmenter-0.7.3.tar.gz
Algorithm Hash digest
SHA256 4bf0cee68bf9f164a22aed260760a0a8adfcac841c9d9a76bdcfeae1cd0a061a
MD5 2e868168835e0523d310526c2e9796f9
BLAKE2b-256 fe7b8bbbffb50cbe35157e7e83d05174853050304144b6cca8a35a1242119b92

See more details on using hashes here.

File details

Details for the file inaSpeechSegmenter-0.7.3-py3-none-any.whl.

File metadata

  • Download URL: inaSpeechSegmenter-0.7.3-py3-none-any.whl
  • Upload date:
  • Size: 26.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.11.0 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.10

File hashes

Hashes for inaSpeechSegmenter-0.7.3-py3-none-any.whl
Algorithm Hash digest
SHA256 da6cfda5e528f79d9171827be5666d39f47f82bbb4d1636bd14322dcf203c242
MD5 c2ab7cb9f5fadd39ad62ba85a9bda096
BLAKE2b-256 ebe10547b8fb1385ee97c5075ebb8cf73894366939346e73bc6c4a4f1dd94a13

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page