Detect silence segment from speech signal.

These details have been verified by PyPI

Project links

Repository

GitLab Statistics

Maintainers

f-matano44

These details have not been verified by PyPI

Project description

pySATEN

[Repository] [Mirror]

About

This library detects silence segment from speech signal.

(alt: Image of voice segment detection)

Installation

$ pip install pysaten

Usage

Command line

Supported formats for reading with pysoundfile. The audio file that can be loaded is mono only.

$ pysaten_trim input.wav trimmed.wav

Python

import pysaten

# y: Target signal, obtained using libraries such as librosa or soundfile.
# sr: Sampling rate.

# Get trimmed signal for the speech segment only.
y_trimmed = pysaten.trim(y, sr)

# If you trim manually or want to get start/end time...
start_s, end_s = pysaten.vsed(y, sr)
y_trimmed = y[int(start_s * sr) : int(end_s * sr)]
# start_s: Start of speech segment. Unit is seconds.
# end_s: End of speech segment. Unit is seconds.

For development

$ git clone https://gitlab.com/f-matano44/pysaten.git
$ poetry install

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see https://www.gnu.org/licenses/.

Acknowledgements

The following programs were used to [evaluate the performance of pysaten]. We would like to take this opportunity to express our gratitude.

test/marblenet: Apache License Version 2.0
- https://github.com/NVIDIA/NeMo
test/rvad: MIT License
- https://github.com/zhenghuatan/rVAD
inaSpeechSegmenter
- https://github.com/ina-foss/inaSpeechSegmenter

Cite this

Library version 1.X (Non-peer-reviewed)

Japanese

俣野文義，小口純矢，森勢将雅，``音声コーパス構築のための仮定を追加した発話区間検出法の提案と基礎評価,'' 日本音響学会第 152 回 (2024 年秋季) 研究発表会, pp.1161--1162 (2024.09).

English

F. Matano, J. Koguchi, M. Morise, ``Proposal and basic evaluation of a voice activity detection with additional assumptions for speech corpus construction,'' Proceedings of the 2024 Autumn meeting of the Acoustical Society of Japan, pp.1161--1162 (2024.09) (in Japanese).

Project details

These details have been verified by PyPI

Project links

Repository

GitLab Statistics

Maintainers

f-matano44

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.2.2.post1

Dec 2, 2025

2.2.2

Nov 5, 2025

2.2.1.post1

Oct 28, 2025

2.2.1

Oct 23, 2025

2.2.0.post3

Oct 15, 2025

2.2.0.post2

Oct 15, 2025

2.2.0.post1

Oct 15, 2025

2.2.0

Oct 14, 2025

2.1.1.post1

Oct 6, 2025

2.1.1

Oct 1, 2025

2.1.0

Sep 30, 2025

2.0.3

Sep 23, 2025

2.0.2.post2

Sep 17, 2025

2.0.2.post1

Sep 17, 2025

2.0.2

Sep 11, 2025

2.0.1

Sep 9, 2025

2.0.0

Aug 31, 2025

1.4.4

Aug 28, 2025

1.4.3

Jun 18, 2025

1.4.2

Jun 8, 2025

1.4.1.post1

Jun 7, 2025

1.4.1

Jun 7, 2025

1.4.0.post2

May 30, 2025

1.4.0.post1

May 27, 2025

This version

1.4.0

May 27, 2025

1.3.1

May 14, 2025

1.3.0

May 11, 2025

1.2.1.post3

May 6, 2025

1.2.1.post2

May 6, 2025

1.2.1.post1

May 1, 2025

1.2.1

May 1, 2025

1.2.0

Jan 26, 2025

1.1.6.post1

Sep 28, 2024

1.1.6

Sep 28, 2024

1.1.5

Sep 19, 2024

1.1.4.post1

Sep 8, 2024

1.1.4

Aug 31, 2024

1.1.3

Aug 31, 2024

1.1.2

Aug 10, 2024

1.1.1

Aug 6, 2024

1.1.0

Aug 5, 2024

1.0.0.post4

Aug 5, 2024

1.0.0.post3

Aug 5, 2024

1.0.0.post2

Aug 4, 2024

1.0.0.post1

Aug 4, 2024

1.0.0

Aug 4, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pysaten-1.4.0.tar.gz (18.3 kB view details)

Uploaded May 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pysaten-1.4.0-py3-none-any.whl (20.7 kB view details)

Uploaded May 27, 2025 Python 3

File details

Details for the file pysaten-1.4.0.tar.gz.

File metadata

Download URL: pysaten-1.4.0.tar.gz
Upload date: May 27, 2025
Size: 18.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for pysaten-1.4.0.tar.gz
Algorithm	Hash digest
SHA256	`33ad06f0ba493572d8eaa44afdc0c9c1a60867bed8f57792c4da638eefeff20f`
MD5	`7c43432e57aebd91e7fc805b46df9931`
BLAKE2b-256	`ced1de288ce6fc892dc2a171bd4ac0558420c558615fbd8ef447b735bfb9de13`

See more details on using hashes here.

File details

Details for the file pysaten-1.4.0-py3-none-any.whl.

File metadata

Download URL: pysaten-1.4.0-py3-none-any.whl
Upload date: May 27, 2025
Size: 20.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for pysaten-1.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e7a13c6f843ccf6c6e7d9e6b1bd70930a07ef90bd77bd741d59d56eb6348d36c`
MD5	`ecb7c6bb0caf8067a24b49cbf8af3eab`
BLAKE2b-256	`cb816643cc2648fffad504902f5e755af9ad5d168206bc413fc4cb86ce2b0278`

See more details on using hashes here.

pysaten 1.4.0

Navigation

Verified details

Project links

GitLab Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

pySATEN

About

Installation

Usage

Command line

Python

For development

License

Acknowledgements

Cite this

Library version 1.X (Non-peer-reviewed)

Japanese

English

Project details

Verified details

Project links

GitLab Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes