Skip to main content

A GUI application for Whisper speech recognition

Project description

whisper-gui

Basic GUI for openai-whisper

A simple graphical user interface for OpenAI's Whisper speech recognition system.

Features

  • Convert video files to audio
  • Transcribe audio files using Whisper
  • Support for multiple languages
  • Drag & drop support
  • Save transcription settings

Installation

  1. Install ffmpeg (required for audio conversion):

    # Ubuntu/Debian
    sudo apt install ffmpeg
    
    # macOS
    brew install ffmpeg
    
    # Windows
    # Download from https://ffmpeg.org/download.html or use winget / choco
    winget install ffmpeg
    
  2. Install whisper-gui:

    pip install openai-whisper 
    # or pip install git+https://github.com/openai/whisper.git 
    pip install PySide6
    pip install whisper-gui
    

Usage

  1. Launch the application:

    whisper-gui
    
  2. Either:

    • Open a video file and convert it to audio
    • Open an audio file directly
  3. Select language and model size

  4. Click "Transcribe" to generate text from speech

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_gui-0.2.1.tar.gz (55.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

whisper_gui-0.2.1-py3-none-any.whl (52.6 kB view details)

Uploaded Python 3

File details

Details for the file whisper_gui-0.2.1.tar.gz.

File metadata

  • Download URL: whisper_gui-0.2.1.tar.gz
  • Upload date:
  • Size: 55.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for whisper_gui-0.2.1.tar.gz
Algorithm Hash digest
SHA256 db2a407cd69e74d3b10df4a632f6b2c7d70403e4ac06e091132c2e7ffac5554b
MD5 6cc20ac99346d1bd91382a2eeaff79d2
BLAKE2b-256 c1896c6a22765fd39183740d353946e3754a4d41cd26e30224d5f8a8e6417ba9

See more details on using hashes here.

Provenance

The following attestation bundles were made for whisper_gui-0.2.1.tar.gz:

Publisher: python-publish.yml on fbergmann/whisper-gui

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file whisper_gui-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: whisper_gui-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 52.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for whisper_gui-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8d8d73f6e74b4b8413e41078d3dbf26a76e8c954f53b353154e4dba11e144e28
MD5 b7190e8bae0205d2d1b6aa166ec9a27b
BLAKE2b-256 6f7d10a684db3a508cb8f00b916e53ce054e9b4a4ecc46886dd92fe4ce653812

See more details on using hashes here.

Provenance

The following attestation bundles were made for whisper_gui-0.2.1-py3-none-any.whl:

Publisher: python-publish.yml on fbergmann/whisper-gui

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page