Skip to main content

Real-time streaming automatic speech recognition (ASR) with support for Chinese, English, Cantonese, Japanese, and Korean languages using SenseVoiceSmall model.

Reason this release was yanked:

Pip installation is broken due to package size limit. Please install from source: Clone the repository and follow the Development section of the README at https://github.com/nikoseven/sense-voice-streaming-asr/.

Project description

🎙️ SenseVoice Streaming ASR

PyPI Python License

A lightweight, real-time streaming speech recognition engine powered by SenseVoiceSmall.

✨ Features

  • Streaming ASR: lightweight, real-time streaming speech recognition engine, runs on CPU with realtime recongnition.
  • Integrated VAD: Built-in Voice Activity Detection (VAD) to detect speech segments.
  • Multilingual: Supports Chinese, English, Japanese, Korean, Cantonese out of the box.
  • Self-contained: Models bundled inside the package — no external downloads needed.
  • Pure Python + ONNX: No heavy dependencies; runs on CPU.

🚀 Installation

pip install sense-voice-streaming-asr

Requires Python ≥ 3.8.

🛠️ Development

To install in development mode:

git clone https://github.com/yourname/sense-voice-streaming-asr.git
cd sense-voice-streaming-asr
git submodule init # for SenseVoice models
pip install -e .

📄 License

Apache 2.0 License.

This project incorporates code from SenseVoice which is licensed under Apache 2.0. The project as a whole is therefore distributed under the Apache 2.0 license. See the LICENSE file for full license text.

🙏 Acknowledgements

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sense_voice_streaming_asr-0.1.1.tar.gz (111.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sense_voice_streaming_asr-0.1.1-py3-none-any.whl (116.5 kB view details)

Uploaded Python 3

File details

Details for the file sense_voice_streaming_asr-0.1.1.tar.gz.

File metadata

File hashes

Hashes for sense_voice_streaming_asr-0.1.1.tar.gz
Algorithm Hash digest
SHA256 321e2f8c5cabd165c7796a1ee3e20d7448c554abfb367ea50d0d2efa24c5e135
MD5 eba278fdd2b80d59af92a6de4ac25884
BLAKE2b-256 7420baeae93891a1694ac59a89dbd6b01439de4c15de1fa3122eea614a31058b

See more details on using hashes here.

Provenance

The following attestation bundles were made for sense_voice_streaming_asr-0.1.1.tar.gz:

Publisher: python-publish.yml on nikoseven/sense-voice-streaming-asr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file sense_voice_streaming_asr-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for sense_voice_streaming_asr-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8078c7446bba5161a2f6e924bdec3d58cabab7a30b077558a2ed97b1ffa0b3a2
MD5 445bddf006eca5f8bd87bbf2749e5b90
BLAKE2b-256 efdf5576f1a88399e6b4d39ad5f18da9b27f1c39671ccbfed864f03cccc8a3b3

See more details on using hashes here.

Provenance

The following attestation bundles were made for sense_voice_streaming_asr-0.1.1-py3-none-any.whl:

Publisher: python-publish.yml on nikoseven/sense-voice-streaming-asr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page