Skip to main content

A GUI application for Whisper speech recognition

Project description

whisper-gui

Basic GUI for openai-whisper

A simple graphical user interface for OpenAI's Whisper speech recognition system.

Features

  • Convert video files to audio
  • Transcribe audio files using Whisper
  • Support for multiple languages
  • Drag & drop support
  • Save transcription settings

Installation

  1. Install ffmpeg (required for audio conversion):

    # Ubuntu/Debian
    sudo apt install ffmpeg
    
    # macOS
    brew install ffmpeg
    
    # Windows
    # Download from https://ffmpeg.org/download.html or use winget / choco
    winget install ffmpeg
    
  2. Install whisper-gui:

    pip install openai-whisper 
    # or pip install git+https://github.com/openai/whisper.git 
    pip install PySide6
    pip install whisper-gui
    

Usage

  1. Launch the application:

    whisper-gui
    
  2. Either:

    • Open a video file and convert it to audio
    • Open an audio file directly
  3. Select language and model size

  4. Click "Transcribe" to generate text from speech

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_gui-0.2.0.tar.gz (11.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

whisper_gui-0.2.0-py3-none-any.whl (9.9 kB view details)

Uploaded Python 3

File details

Details for the file whisper_gui-0.2.0.tar.gz.

File metadata

  • Download URL: whisper_gui-0.2.0.tar.gz
  • Upload date:
  • Size: 11.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for whisper_gui-0.2.0.tar.gz
Algorithm Hash digest
SHA256 98af6d9af7f85d92a9bbde6a5805636d1a6236711f8f96176b34414d6b7f71e3
MD5 715fb5e5058841e51c69ffcbac39c2ca
BLAKE2b-256 2656b68e80650e52157aecfa68eaa36a1a7736def235506ebd3781b271bab0f3

See more details on using hashes here.

Provenance

The following attestation bundles were made for whisper_gui-0.2.0.tar.gz:

Publisher: python-publish.yml on fbergmann/whisper-gui

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file whisper_gui-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: whisper_gui-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 9.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for whisper_gui-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 89953227713a943cb45aefc73a98b59cc38164cb3a6bfb23216569fe7e39070f
MD5 e7518b6c62e8a5dfb092e795e79002f4
BLAKE2b-256 a938a166c775527c66fc5ce27883b63b8fb91f8936bdb0cdf75928b7e7b97ae7

See more details on using hashes here.

Provenance

The following attestation bundles were made for whisper_gui-0.2.0-py3-none-any.whl:

Publisher: python-publish.yml on fbergmann/whisper-gui

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page