Sound Event Activity Detection (SEAD) - speech, music, others
Project description
SEAD - Sound Event Activity Detection
Sound Event Activity Detection (SEAD). Detects speech, music, and other sounds in audio with offline and streaming support.
Installation
pip install sead
For microphone streaming, also install the optional sounddevice dependency:
pip install sead[dev]
For GPU inference (optional):
pip install sead[onnx-gpu]
Usage
CLI
# Offline: process a WAV file
sead --audio audio_16khz.wav
# Or via Python module
python -m sead.cli --audio audio_16khz.wav
# Synthetic streaming: feed file as streaming input (emits start/end events)
sead --stream-file audio_16khz.wav
# Compare streaming vs offline
sead --stream-file audio_16khz.wav --compare
# Live microphone streaming (requires sounddevice)
sead --stream
# Save debug chunks for each segment
sead --audio audio_16khz.wav --debug-dir debug_segments
Python API
from sead import DEFAULT_MODEL_PATH, SEADDetector, SEADIterator, Segment
from pathlib import Path
# Offline
detector = SEADDetector(DEFAULT_MODEL_PATH)
segments = detector.process_file(Path("audio.wav"))
for s in segments:
print(s) # [start_time, end_time, label, confidence]
# Streaming (incremental events)
iterator = SEADIterator(detector)
for chunk in audio_chunks:
for e in iterator(chunk):
print(e) # {'start': t, 'label': str} or {'end': t, 'label': str, 'confidence': float}
for e in iterator.flush():
print(e)
Output
- Segments:
[start_time, end_time, label, confidence]with labelsspeech,music,others - Events (streaming):
{'start': t, 'label': str}on onset,{'end': t, 'label': str, 'confidence': float}on offset
Acknowledgement
SEAD uses YamNet by Qualcomm, an audio event classifier trained on the AudioSet dataset.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sead-0.1.2.tar.gz.
File metadata
- Download URL: sead-0.1.2.tar.gz
- Upload date:
- Size: 14.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1635bc364917d087a9f5d530dd7a01736d7082b9bf06234c63fd3161144f672b
|
|
| MD5 |
e79baab0380d807cbf223c0cd4373833
|
|
| BLAKE2b-256 |
d4dacf9d8e2c11fe53dd41ed6b3f7fd5374bf0db847fa07bda542310ae1bad64
|
File details
Details for the file sead-0.1.2-py3-none-any.whl.
File metadata
- Download URL: sead-0.1.2-py3-none-any.whl
- Upload date:
- Size: 14.0 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ed503049352d0a6367f1c6a32d76b27f9548ba5688b15c272e3604799b6a57c6
|
|
| MD5 |
bc525f8bdb0978ce38255f331f733fc4
|
|
| BLAKE2b-256 |
1dc26b571b707eb6a884bf8a1dab284080a5a2ec621d3d6d8d4c72b428eb0a04
|