Skip to main content

Detect silence segment from speech signal.

Project description

pySATEN

PyPI - Version Downloads

About

This library detects silence segment from speech signal.

(alt: Image of Voice Start-End Detection)

Installation

pip install pysaten

Usage

Command line

pysaten_trim input.wav trimmed.wav

Python

import pysaten

# y: Target signal(numpy.ndarray), obtained using libraries such as librosa or soundfile.
# sr: Sampling rate.

# Get trimmed signal for the speech segment only.
y_trimmed: numpy.ndarray = pysaten.trim(y, sr)

# If you trim manually or want to get start/end time...
start_sec, end_sec = pysaten.vsed(y, sr)
y_trimmed = y[int(start_sec * sr) : int(end_sec * sr)]

For development

git clone https://gitlab.com/f-matano44/pysaten.git
uv sync

License

Copyright 2024 Fumiyoshi MATANO

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.

Acknowledgements 🙏

Cite this

Library version 2.x (Non-peer-reviewed)

Japanese

俣野 文義,森勢 将雅,``音声コーパス構築のための仮定を追加した発話区間検出法の改良,'' 日本音響学会第 154 回 (2025 年秋季) 研究発表会, pp.1223--1224 (2025.09).

English

F. Matano, M. Morise, ``An improved method for voice activity detection with additional assumptions for speech corpus construction,'' Proceedings of the 2025 Autumn meeting of the Acoustical Society of Japan, pp.1223--1224 (2025.09) (in Japanese).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pysaten-2.2.2.post1.tar.gz (21.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pysaten-2.2.2.post1-py3-none-any.whl (23.2 kB view details)

Uploaded Python 3

File details

Details for the file pysaten-2.2.2.post1.tar.gz.

File metadata

  • Download URL: pysaten-2.2.2.post1.tar.gz
  • Upload date:
  • Size: 21.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for pysaten-2.2.2.post1.tar.gz
Algorithm Hash digest
SHA256 1cba4a5e0339ef3f3df22aaba3600321cb0e4314296ad2245447f949ad03e152
MD5 fc1c5a8fd3b37e828a200928a09ccde1
BLAKE2b-256 bd3d7fc6421d7be6809dcec0e38fcd72175b49d01568b0d8e838fffdb9d58fd5

See more details on using hashes here.

File details

Details for the file pysaten-2.2.2.post1-py3-none-any.whl.

File metadata

  • Download URL: pysaten-2.2.2.post1-py3-none-any.whl
  • Upload date:
  • Size: 23.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for pysaten-2.2.2.post1-py3-none-any.whl
Algorithm Hash digest
SHA256 201c9cd00484adcf86e888f1f06c0aadc107c8647774e574b9fbc85e99af77a3
MD5 2d3baa72f1b59ae94f0ce324838c3cde
BLAKE2b-256 88a4f96d5a9a2de9f6cd294357595406e6db565c0431515eab435db754b46e29

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page