Skip to main content

Detect silence segment from speech signal.

Project description

pySATEN

PyPI - Version Downloads

About

This library detects silence segment from speech signal.

(alt: Image of voice segment detection)

Installation

pip install pysaten

Usage

Command line

Supported formats for reading with pysoundfile. The audio file that can be loaded is mono only.

pysaten_trim input.wav trimmed.wav

Python

import pysaten

# y: Target signal, obtained using libraries such as librosa or soundfile.
# sr: Sampling rate.

# Get trimmed signal for the speech segment only.
y_trimmed = pysaten.trim(y, sr)

# If you trim manually or want to get start/end time...
start_s, end_s = pysaten.vsed(y, sr)
y_trimmed = y[start_s * sr : end_s * sr]
# start_s: Start of speech segment. Unit is seconds.
# end_s: End of speech segment. Unit is seconds.

License

Copyright 2024 Fumiyoshi MATANO

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.

Acknowledgements

The following programs were used to test the performance of pysaten. We would like to take this opportunity to express our gratitude.

Cite this

Lv.1 / Library version 1.X

Japanese

俣野 文義,小口 純矢,森勢 将雅,``音声コーパス構築のための仮定を追加した発話区間検出法の提案と基礎評価,'' 日本音響学会第 152 回 (2024 年秋季) 研究発表会, pp.1161--1162 (2024.09).

English

F. Matano, J. Koguchi, M. Morise, ``Proposal and basic evaluation of a voice activity detection with additional assumptions for speech corpus construction,'' Proceedings of the 2024 Autumn meeting of the Acoustical Society of Japan, pp.1161--1162 (2024.09) (in Japanese).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pysaten-1.2.1.tar.gz (870.3 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pysaten-1.2.1-py3-none-any.whl (19.6 kB view details)

Uploaded Python 3

pysaten-1.2.1-py2.py3-none-any.whl (19.5 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pysaten-1.2.1.tar.gz.

File metadata

  • Download URL: pysaten-1.2.1.tar.gz
  • Upload date:
  • Size: 870.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for pysaten-1.2.1.tar.gz
Algorithm Hash digest
SHA256 64aa78e2ec2b4cc30a96037d26d230801273daacf5c160760d1ed433a5576aa9
MD5 a31dfb12efbf6bd745fae3264d0ccf7c
BLAKE2b-256 3b2318a607fd2daf946d853c9bb51e12037ceaef2e0c745c6c390d6649cb9a9a

See more details on using hashes here.

File details

Details for the file pysaten-1.2.1-py3-none-any.whl.

File metadata

  • Download URL: pysaten-1.2.1-py3-none-any.whl
  • Upload date:
  • Size: 19.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for pysaten-1.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9b9b0bb01bece04915b5254cd64270f34c14a7fa0e765764d0b3153e7e7e28aa
MD5 93cf9de2537afb5944f06be253c4c971
BLAKE2b-256 ac3fa317f0b6e5c111ea6cee53042a558c29aa7f5d42d6550725ccc7a3b210af

See more details on using hashes here.

File details

Details for the file pysaten-1.2.1-py2.py3-none-any.whl.

File metadata

  • Download URL: pysaten-1.2.1-py2.py3-none-any.whl
  • Upload date:
  • Size: 19.5 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for pysaten-1.2.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 282d7fd7c2e5f22045d89d49a22a60c2d05b9cffbf68caf0f6e88073b900c057
MD5 8f48f1e6d60a0162065a85ae324e1e45
BLAKE2b-256 960a319487a9e393f4881bb58ab66eb26e0e7222b310de0ae3031627755588ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page