Real-time streaming automatic speech recognition (ASR) with support for Chinese, English, Cantonese, Japanese, and Korean languages using SenseVoiceSmall model.
Reason this release was yanked:
Pip installation is broken due to package size limit. Please install from source: Clone the repository and follow the Development section of the README at https://github.com/nikoseven/sense-voice-streaming-asr/.
Project description
🎙️ SenseVoice Streaming ASR
A lightweight, real-time streaming speech recognition engine powered by SenseVoiceSmall.
✨ Features
- Streaming ASR: lightweight, real-time streaming speech recognition engine, runs on CPU with realtime recongnition.
- Integrated VAD: Built-in Voice Activity Detection (VAD) to detect speech segments.
- Multilingual: Supports
Chinese,English,Japanese,Korean,Cantoneseout of the box. - Self-contained: Models bundled inside the package — no external downloads needed.
- Pure Python + ONNX: No heavy dependencies; runs on CPU.
🚀 Installation
pip install sense-voice-streaming-asr
Requires Python ≥ 3.8.
🛠️ Development
To install in development mode:
git clone https://github.com/yourname/sense-voice-streaming-asr.git
cd sense-voice-streaming-asr
git submodule init # for SenseVoice models
pip install -e .
📄 License
Apache 2.0 License.
This project incorporates code from SenseVoice which is licensed under Apache 2.0. The project as a whole is therefore distributed under the Apache 2.0 license. See the LICENSE file for full license text.
🙏 Acknowledgements
- SenseVoice by FunAudioLLM
- SenseVoice ONNX Models
- FSMN VAD
- Kaldi-native-fbank
- ONNX Runtime
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sense_voice_streaming_asr-0.1.1.tar.gz.
File metadata
- Download URL: sense_voice_streaming_asr-0.1.1.tar.gz
- Upload date:
- Size: 111.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
321e2f8c5cabd165c7796a1ee3e20d7448c554abfb367ea50d0d2efa24c5e135
|
|
| MD5 |
eba278fdd2b80d59af92a6de4ac25884
|
|
| BLAKE2b-256 |
7420baeae93891a1694ac59a89dbd6b01439de4c15de1fa3122eea614a31058b
|
Provenance
The following attestation bundles were made for sense_voice_streaming_asr-0.1.1.tar.gz:
Publisher:
python-publish.yml on nikoseven/sense-voice-streaming-asr
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sense_voice_streaming_asr-0.1.1.tar.gz -
Subject digest:
321e2f8c5cabd165c7796a1ee3e20d7448c554abfb367ea50d0d2efa24c5e135 - Sigstore transparency entry: 592839707
- Sigstore integration time:
-
Permalink:
nikoseven/sense-voice-streaming-asr@7f2f77cc1617ed910d66eb820e61e8a437a260a7 -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/nikoseven
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@7f2f77cc1617ed910d66eb820e61e8a437a260a7 -
Trigger Event:
release
-
Statement type:
File details
Details for the file sense_voice_streaming_asr-0.1.1-py3-none-any.whl.
File metadata
- Download URL: sense_voice_streaming_asr-0.1.1-py3-none-any.whl
- Upload date:
- Size: 116.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8078c7446bba5161a2f6e924bdec3d58cabab7a30b077558a2ed97b1ffa0b3a2
|
|
| MD5 |
445bddf006eca5f8bd87bbf2749e5b90
|
|
| BLAKE2b-256 |
efdf5576f1a88399e6b4d39ad5f18da9b27f1c39671ccbfed864f03cccc8a3b3
|
Provenance
The following attestation bundles were made for sense_voice_streaming_asr-0.1.1-py3-none-any.whl:
Publisher:
python-publish.yml on nikoseven/sense-voice-streaming-asr
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
sense_voice_streaming_asr-0.1.1-py3-none-any.whl -
Subject digest:
8078c7446bba5161a2f6e924bdec3d58cabab7a30b077558a2ed97b1ffa0b3a2 - Sigstore transparency entry: 592839802
- Sigstore integration time:
-
Permalink:
nikoseven/sense-voice-streaming-asr@7f2f77cc1617ed910d66eb820e61e8a437a260a7 -
Branch / Tag:
refs/tags/v0.1.1 - Owner: https://github.com/nikoseven
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@7f2f77cc1617ed910d66eb820e61e8a437a260a7 -
Trigger Event:
release
-
Statement type: