Real-time audio transcription. Runs OpenAI's Whisper locally.
Project description
Live Transcribe
Live Transcribe is a Python package that provides live, real-time transcription of audio based on OpenAI's Whisper API.
Currently, Live Transcribe supports only PulseAudio as an audio backend.
Installation
- optional but highly recommended for low latency Refer to the Pytorch documentation guide to install Pytorch with CUDA support.
pip install live-transcribe
Usage
Just run:
python -m live_transcribe
or
live_transcribe
If you want to transcribe from another audio device, than the default, use the --device
option, e.g.:
live_transcribe --list-devices
live_transcribe --device "alsa_input.usb-046d_HD_Pro_Webcam_C920_8C0B5B0F-02.analog-stereo
On the first usage, the OpenAI's Whisper model will be downloaded and cached.
See live_transcribe --help
for options.
Dependencies
Live Transcribe has the following dependencies:
Python 3.8 or higher
OpenAI-Whisper
PulseAudio
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
live-transcribe-0.1.1.tar.gz
(8.6 kB
view hashes)
Built Distribution
Close
Hashes for live_transcribe-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 038b6b8719c73ade9ef29ec478608c9ac07ed8cef6273afa5806a8b36d83e44f |
|
MD5 | 9983c772fdb3553dd7c534a62323f85c |
|
BLAKE2b-256 | 9d46f5cda2a1bda969bfb8fc89acef1c785a000ab16644d2015b1831ba8fb5b8 |