Real-time microphone transcription with Deepgram using Python.
Project description
livetranscriber
A single-file helper with minimal external dependencies that streams microphone audio to Deepgram for real-time speech-to-text. This is available as a package on PyPI.
Features
- Simple API - single
LiveTranscriberclass. - Configurable - every Deepgram LiveOptions parameter can be overridden via keyword arguments; sensible Nova-3 defaults are provided.
- Mandatory callback - forces the calling code to supply a function that will be invoked for every final transcript chunk (empty / interim chunks are ignored).
- Output capture - optional
output_pathwrites each final transcript line to disk. - Pause / resume - you may call
pauseorresumefrom your callback. - Graceful shutdown - Ctrl-C or
stopshuts everything down and releases resources.
Installation
Install the package directly from PyPI using pip:
pip install livetranscriber
Alternatively, if you are working with the source code or a specific requirements file, you can install the dependencies listed in requirements.txt:
deepgram-sdk>=4,<5
numpy>=1.24 # build-time requirement of sounddevice
sounddevice>=0.4
Install with uv (preferred) or plain pip:
uv venv .venv && source .venv/bin/activate
uv pip install -r requirements.txt
or
pip install -r requirements.txt
-
Python Version:
Python 3.11 is required.
Environment Setup
Export your Deepgram API key (see https://console.deepgram.com). For persistent access, add the following line to your shell profile file (e.g., ~/.zshrc, ~/.bashrc, or ~/.profile) and restart your terminal or source the file:
export DEEPGRAM_API_KEY="dg_…"
Example Usage
Here are examples demonstrating how to use the livetranscriber package.
Minimal Example
A basic example showing the essential setup:
from livetranscriber import LiveTranscriber
def simple_callback(text: str):
print("NEW >", text)
tr = LiveTranscriber(callback=simple_callback)
tr.run()
Comprehensive Example
A more detailed example demonstrating various features like output to file and pause/resume:
import time
from livetranscriber import LiveTranscriber
def comprehensive_callback(text: str):
print("Transcript received:", text)
# Example: Pause transcription if a specific phrase is detected
if "pause recording" in text.lower():
print("Status: PAUSING...")
transcriber.pause()
print("Status: RECORDING PAUSED. Say 'resume recording' to continue.")
# Example: Resume transcription if another phrase is detected
if "resume recording" in text.lower():
print("Status: RESUMING...")
transcriber.resume()
print("Status: RECORDING RESUMED.")
# Example: Stop transcription if a stop phrase is detected
if "stop recording" in text.lower():
print("Status: STOPPING...")
transcriber.stop()
# Instantiate with various options
output_file = "transcript_output.txt"
transcriber = LiveTranscriber(
callback=comprehensive_callback,
output_path=output_file, # Output transcript to a file
model="nova-3-general", # Specify a model
language="en-US", # Specify a language
punctuate=True, # Enable punctuation
smart_format=True # Enable smart formatting (like numbers)
)
try:
print(f"Starting transcription. Transcript will also be saved to {output_file}")
print("Instructions: Press Ctrl+C to stop, or say 'pause recording', 'resume recording', or 'stop recording'.")
transcriber.run() # Blocks until stop() is called or Ctrl-C is pressed
except KeyboardInterrupt:
print("\nInterrupted by user. Stopping.")
finally:
print("Transcription session ended.")
API
LiveTranscriber Class
High-level wrapper around Deepgram live transcription.
Parameters:
callback: A function that will be invoked for every final transcript. Must accept a singlestrargument. May be sync or async.output_path(Optional): Path to a text file that will receive each final transcript line (UTF-8).api_key(Optional): Your Deepgram API key. If omitted, theDEEPGRAM_API_KEYenvironment variable is used; failing both raisesRuntimeError.keepalive(Optional): IfTrue(default) the WebSocket client sends keepalive pings.**live_options_overrides(Optional): Any keyword argument that matches a LiveOptions field overrides the built-in defaults. For example,punctuate=False.
Methods:
run(): Run until.stop()or Ctrl-C.stop(): Public request to shut down; may be called from any thread.pause(): Pause writing transcripts tooutput_path. Note that the callback function will continue to receive transcription data while paused.resume(): Resume writing transcripts tooutput_path.
Development Standards
This section outlines the standards and practices for contributing to livetranscriber.
This project is distributed under the MIT License.
Versioning
livetranscriber follows Semantic Versioning. The version number is managed in the following locations:
pyproject.tomluv.lock- The package docstring in
livetranscriber.py
When making changes that require a version bump:
- Update the version number in all three locations according to Semantic Versioning principles.
- Commit the changes using a Conventional Commit message.
- Push the commit to the remote repository.
- Create a Git tag corresponding to the new version number (e.g.,
v0.2.2). - Push the tag to the remote repository.
Tagging
After pushing a new version commit, always create a Git tag for that version and push the tag. For version x.y.z, the tag name should be vx.y.z.
Dependencies
deepgram-sdknumpysounddevice
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file livetranscriber-0.3.11.tar.gz.
File metadata
- Download URL: livetranscriber-0.3.11.tar.gz
- Upload date:
- Size: 8.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
67b3c70521bf9441a9d6ce56edc957e84ad5e3c5773230809f251e167b0d29af
|
|
| MD5 |
c40960cfdbabcd30f0eb4eb09edc43ef
|
|
| BLAKE2b-256 |
aa611acaa6955063578d4d8d2ad3687c5df018c24a2832f4ac3b23bd6eb89dbd
|
File details
Details for the file livetranscriber-0.3.11-py3-none-any.whl.
File metadata
- Download URL: livetranscriber-0.3.11-py3-none-any.whl
- Upload date:
- Size: 9.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.7.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
51adad3c7bf1900a3957d9244b59a6495babb1b4d43fe34ce2e7cd5ed70fd59a
|
|
| MD5 |
0bc49314793dad2fab0e80208be2dffe
|
|
| BLAKE2b-256 |
7b75e95fe3fdb5f0b1921921fd332ef00466215ee5d87f4a2eee9243594d2532
|