Detect silence segment from speech signal.
Project description
pySATEN
About
This library detects silence segment from speech signal.
Installation
pip install pysaten
Usage
import pysaten
# y: Target signal, obtained using libraries such as librosa or soundfile.
# sr: Sampling rate.
start_s, end_s = pysaten.vsed(y, sr)
# start_s: Start of speech segment. Unit is seconds.
# end_s: End of speech segment. Unit is seconds.
# If you want a signal for the speech segment only.
y_speech_segment_only = y[int(start_s * sr): int(end_s * sr)]
License
Copyright 2024 Fumiyoshi MATANO
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.
Cite this
Lv.1 / Library version 1.X
Japanese
俣野 文義,小口 純矢,森勢 将雅,``音声コーパス構築のための仮定を追加した発話区間検出法の提案と基礎評価,'' 日本音響学会第 152 回 (2024 年秋季) 研究発表会, pp.xxx--xxx (2024.09).
English
F. Matano, J. Koguchi, M. Morise, ``Proposal and basic evaluation of a voice activity detection with additional assumptions for speech corpus construction,'' Proceedings of the 2024 Autumn meeting of the Acoustical Society of Japan, pp.xxx--xxx (2024.09) (in Japanese).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pysaten-1.0.0.post2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 60c91c1b06650593b2b4df9deb9d86365c73dfb43d799bce6709ce6a04af5392 |
|
MD5 | da12879bdf811537af08f7080347e23a |
|
BLAKE2b-256 | 20e3a71d09e6f93bfce5549f274405b16eacf8f939ea6a2ee7176d7d103dc97b |