Real-time microphone transcription with Deepgram using Python.

These details have not been verified by PyPI

Project links

Project description

livetranscriber

A single-file helper with minimal external dependencies that streams microphone audio to Deepgram for real-time speech-to-text. This is available as a package on PyPI. It also includes a WhisperTranscriber for fully offline transcription using the open-source OpenAI Whisper model.

Features

Simple API - single LiveTranscriber class.
Configurable - every Deepgram LiveOptions parameter can be overridden via keyword arguments; sensible Nova-3 defaults are provided.
Mandatory callback - forces the calling code to supply a function that will be invoked for every final transcript chunk (empty / interim chunks are ignored).
Output capture - optional output_path writes each final transcript line to disk.
Pause / resume - you may call pause or resume from your callback.
Offline mode - pause() switches to OpenAI Whisper locally until resume().
Graceful shutdown - Ctrl-C or stop shuts everything down and releases resources.

Installation

Install the package directly from PyPI using pip:

pip install livetranscriber

Alternatively, if you are working with the source code or a specific requirements file, you can install the dependencies listed in requirements.txt:

deepgram-sdk>=4,<5
numpy>=1.24  # build-time requirement of sounddevice
sounddevice>=0.4
openai-whisper>=20240930  # optional: required for offline mode

Whisper downloads its model the first time you use offline mode. For manual downloads see the Whisper README.

Install with uv (preferred) or plain pip:

uv venv .venv && source .venv/bin/activate
uv pip install -r requirements.txt

pip install -r requirements.txt

Python Version:

Python 3.11 is required.

Environment Setup

Export your Deepgram API key (see https://console.deepgram.com). For persistent access, add the following line to your shell profile file (e.g., ~/.zshrc, ~/.bashrc, or ~/.profile) and restart your terminal or source the file:

export DEEPGRAM_API_KEY="dg_…"

Command Line Usage

After installation you can invoke livetranscriber directly from the command line. This command starts listening to your microphone and prints final transcripts to the terminal.

livetranscriber --output transcript.txt --model nova-3-general --language en-US

The --output option writes each transcript line to the given file, while --model and --language control the Deepgram model and language.

Example Usage

Here are examples demonstrating how to use the livetranscriber package.

Minimal Example

A basic example showing the essential setup:

from livetranscriber import LiveTranscriber

def simple_callback(text: str):
    print("NEW >", text)

tr = LiveTranscriber(callback=simple_callback)
tr.run()

Comprehensive Example

A more detailed example demonstrating various features like output to file and pause/resume:

import time
from livetranscriber import LiveTranscriber

def comprehensive_callback(text: str):
    print("Transcript received:", text)

    # Example: Pause transcription if a specific phrase is detected
    if "pause recording" in text.lower():
        print("Status: PAUSING...")
        transcriber.pause()
        print("Status: RECORDING PAUSED. Say 'resume recording' to continue.")

    # Example: Resume transcription if another phrase is detected
    if "resume recording" in text.lower():
        print("Status: RESUMING...")
        transcriber.resume()
        print("Status: RECORDING RESUMED.")

    # Example: Stop transcription if a stop phrase is detected
    if "stop recording" in text.lower():
        print("Status: STOPPING...")
        transcriber.stop()

# Instantiate with various options
output_file = "transcript_output.txt"
transcriber = LiveTranscriber(
    callback=comprehensive_callback,
    output_path=output_file, # Output transcript to a file
    model="nova-3-general", # Specify a model
    language="en-US",     # Specify a language
    punctuate=True,         # Enable punctuation
    smart_format=True       # Enable smart formatting (like numbers)
)

try:
    print(f"Starting transcription. Transcript will also be saved to {output_file}")
    print("Instructions: Press Ctrl+C to stop, or say 'pause recording', 'resume recording', or 'stop recording'.")
    transcriber.run() # Blocks until stop() is called or Ctrl-C is pressed
except KeyboardInterrupt:
    print("\nInterrupted by user. Stopping.")
finally:
    print("Transcription session ended.")

Pausing Activates Local Mode

Calling pause() disconnects Deepgram and transcribes audio locally with Whisper until resume():

transcriber.pause()   # Whisper now processes audio offline
# ... speak offline ...
transcriber.resume()  # Deepgram connection restored

API

`LiveTranscriber` Class

High-level wrapper around Deepgram live transcription.

Parameters:

callback: A function that will be invoked for every final transcript. Must accept a single str argument. May be sync or async.
output_path (Optional): Path to a text file that will receive each final transcript line (UTF-8).
api_key (Optional): Your Deepgram API key. If omitted, the DEEPGRAM_API_KEY environment variable is used; failing both raises RuntimeError.
keepalive (Optional): If True (default) the WebSocket client sends keepalive pings.
**live_options_overrides (Optional): Any keyword argument that matches a LiveOptions field overrides the built-in defaults. For example, punctuate=False.

Methods:

run(): Run until .stop() or Ctrl-C.
stop(): Public request to shut down; may be called from any thread.
pause(): Pause writing transcripts to output_path. Note that the callback function will continue to receive transcription data while paused.
resume(): Resume writing transcripts to output_path.

Development Standards

This section outlines the standards and practices for contributing to livetranscriber.

This project is distributed under the MIT License.

Versioning

livetranscriber follows Semantic Versioning. The version number is managed in the following locations:

pyproject.toml
uv.lock
The package docstring in livetranscriber.py

When making changes that require a version bump:

Update the version number in all three locations according to Semantic Versioning principles.
Commit the changes using a Conventional Commit message.
Push the commit to the remote repository.
Create a Git tag corresponding to the new version number (e.g., v0.2.2).
Push the tag to the remote repository.

Tagging

After pushing a new version commit, always create a Git tag for that version and push the tag. For version x.y.z, the tag name should be vx.y.z.

Changelog

v0.3.3

Enable local transcription via pause.

Dependencies

deepgram-sdk
numpy
sounddevice
openai-whisper (offline mode)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.13

May 22, 2025

0.3.12

May 20, 2025

0.3.11

May 20, 2025

0.3.9

May 20, 2025

0.3.6

May 20, 2025

0.3.5

May 20, 2025

0.3.4

May 20, 2025

This version

0.3.3

May 19, 2025

0.3.2

May 19, 2025

0.3.1

May 19, 2025

0.2.5

May 18, 2025

0.2.4

May 17, 2025

0.2.3

May 17, 2025

0.2.2

May 15, 2025

0.2.1

May 15, 2025

0.2.0

May 15, 2025

0.1.1

May 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livetranscriber-0.3.3.tar.gz (14.1 kB view details)

Uploaded May 19, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

livetranscriber-0.3.3-py3-none-any.whl (11.9 kB view details)

Uploaded May 19, 2025 Python 3

File details

Details for the file livetranscriber-0.3.3.tar.gz.

File metadata

Download URL: livetranscriber-0.3.3.tar.gz
Upload date: May 19, 2025
Size: 14.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.5

File hashes

Hashes for livetranscriber-0.3.3.tar.gz
Algorithm	Hash digest
SHA256	`65619ca7b5e8ab8be060715b6498342df003ba7c71c5bcf6fc794ca49adff703`
MD5	`3f95f0c742647b80308dad4b8ecd69f9`
BLAKE2b-256	`79c5274e626c04080eb6736a3cd6780e20610da5a229aaaf41928813e571cac9`

See more details on using hashes here.

File details

Details for the file livetranscriber-0.3.3-py3-none-any.whl.

File metadata

Download URL: livetranscriber-0.3.3-py3-none-any.whl
Upload date: May 19, 2025
Size: 11.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.5

File hashes

Hashes for livetranscriber-0.3.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f4903ac84b01226c2af228760258c685fa8ff294a449f138471dc6f7178f87ea`
MD5	`f174d9973bb4abdb113d87f7a31633a5`
BLAKE2b-256	`b7124748d6ace25e464c789a9673a9961c74fe54a6be6d7fb60680ed7b0f1d41`

See more details on using hashes here.

livetranscriber 0.3.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

livetranscriber

Features

Installation

Environment Setup

Command Line Usage

Example Usage

Minimal Example

Comprehensive Example

Pausing Activates Local Mode

API

LiveTranscriber Class

Development Standards

Versioning

Tagging

Changelog

v0.3.3

Dependencies

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`LiveTranscriber` Class