Easy to use implementation of Vosk local speech recognition

Project description

Noah's Local Speech Recognition

This project provides a local, privacy-preserving speech recognition tool using the Vosk speech recognition toolkit and microphone input. It supports keyword detection, timestamped transcription, logging, pause/resume functionality, and model auto-downloading.

Features

Offline speech recognition (no internet needed)
Works with USB microphones (on Linux with ALSA)
Auto-download of Vosk models if not present
Realtime transcription with timestamps
Logging to disk
Support for pause/resume listening

Requirements

Python 3.9+
vosk
pyaudio
tqdm
requests

Setup:

pip install noahs_local_speech_recognition

On Linux (including Raspberry Pi), pyaudio still depends on system-level PortAudio. Make sure to install portaudio19-dev and pyaudio is working before pip installing noahs_local_speech_recognition which relies on pyaudio to avoid errors.

Linux Setup Instructions

If you're using Linux, the script attempts to install the required PortAudio libraries automatically. However, you may still need to:

Run the script that imports noahs_local_speech_recognition with sudo:

Manually install dependencies if issues arise:

sudo apt-get update
sudo apt-get install -y portaudio19-dev python3-pyaudio
pip uninstall pyaudio
pip install pyaudio

These steps ensure that pyaudio is compiled correctly against the system's PortAudio library.

Demo

To test the speech recognizer locally, simply run this script:

from noahs_local_speech_recognition import get_text_after_keyword, list_microphones, start_speech_listening, stop_speech_listening, get_speech_log_entry, set_speech_log_response, remove_speech_log_entry, pause_speech_listening, resume_speech_listening

import time

print("Listing available microphone devices:")
list_microphones()

print("\nStarting speech recognition with default input device...")
start_speech_listening(name="robot", stop_talking_delay=2, device_index=None, model_name="vosk-model-small-en-us-0.15")

try:
    while True:
        time.sleep(2)
        print("_________________________")
        entry = get_speech_log_entry()
        if entry and entry["response"] is None:
            heard = entry["content"]
            print(f"I HEARD: {heard}")
            speech_log1 = get_speech_log()
            print(f"SPEECH LOG BEFORE SET RESPONSE: {speech_log1}")
            set_speech_log_response(heard)
            speech_log2 = get_speech_log()
            print(f"SPEECH LOG AFTER SET RESPONSE: {speech_log2}")
        if entry and entry["content"] in ["goodbye", "good bye", "bye", "quit", "end", "exit"]:
            break
except KeyboardInterrupt:
    pass
finally:
    stop_speech_listening()

This will:

Auto-download the Vosk "vosk-model-small-en-us-0.15" model if missing.
Show available audio devices.
Look for defualt microphone since device_index=None.
Begin listening and transcribing speech in real-time based on hearing name="robot" keyword
Log results with timestamps to a file named like transcript_YYYY-MM-DD_HH-MM-SS.txt.
Display speech_log before and after having response set in the console.
Use set_speech_log_response() to flag entries as having been responded to.

Changing Models

You can modify the model by altering the model_name passed to start_speech_listening():

start_speech_listening(name="robot", stop_talking_delay=2, device_index=None, model_name="vosk-model-small-en-us-0.15")

Other available models:

vosk-model-small-en-us-0.15
vosk-model-en-us-0.22
vosk-model-en-us-0.22-lgraph

Known Issues

Use on debian Linux may require manually setting up pyaudio prior to import
Automaitic Model download links are hardcoded and limited to 3 options.

License

MIT License

Acknowledgements

Vosk Speech Recognition Toolkit
TQDM for progress bars

Project details

Release history Release notifications | RSS feed

0.1.4

Sep 25, 2025

0.1.3

May 26, 2025

0.1.2

May 26, 2025

0.1.1

May 26, 2025

This version

0.1.0

May 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

noahs_local_speech_recognition-0.1.0.tar.gz (5.8 kB view details)

Uploaded May 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

noahs_local_speech_recognition-0.1.0-py3-none-any.whl (6.6 kB view details)

Uploaded May 26, 2025 Python 3

File details

Details for the file noahs_local_speech_recognition-0.1.0.tar.gz.

File metadata

Download URL: noahs_local_speech_recognition-0.1.0.tar.gz
Upload date: May 26, 2025
Size: 5.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.4

File hashes

Hashes for noahs_local_speech_recognition-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`ca5a15fd9b020cf74dbd0dfc372cd5ce14a89f88a30a4f77202fb6d504e79eff`
MD5	`e3a373fe73238cca4c268c02c342ef36`
BLAKE2b-256	`db8ddc94545a6400eceb3a9803642e4ed9ab6e54b420eba2c5b62ce74e48528e`

See more details on using hashes here.

File details

Details for the file noahs_local_speech_recognition-0.1.0-py3-none-any.whl.

File metadata

Download URL: noahs_local_speech_recognition-0.1.0-py3-none-any.whl
Upload date: May 26, 2025
Size: 6.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.4

File hashes

Hashes for noahs_local_speech_recognition-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a9ade79839542d135f25dc77faa6cbb47cb1e61607274e465b36042f13bd9769`
MD5	`d327f39fab482ef294565bbf9b3389dd`
BLAKE2b-256	`32421968e58ca32e1b1632ab93f46c00c56d25affce28cb030ffe6d8547c83c1`

See more details on using hashes here.

noahs-local-speech-recognition 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Noah's Local Speech Recognition

Features

Requirements

Linux Setup Instructions

Demo

Changing Models

Known Issues

License

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes