Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.2.3.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.3.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.3.tar.gz
Algorithm Hash digest
SHA256 aa114c6c942b009cb828522ae1763427d7db5594dd8c4294463385ffec79e5bf
MD5 d9677056bdc445f9268aeb8d4efbf3c5
BLAKE2b-256 2a31277a228ddd84b6f2df8ab78aa8215dbb95b162afd9f2d61817c28584f0b8

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.3-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 c04fdfd31be29b9904cc33b3f5aa701053084e1c5fd549da71298a11e6ad2bde
MD5 1cbfe20fe1d04c1c8abe2926faf3329a
BLAKE2b-256 87d9ec6e4b2a2d428349e1ed4679d8f10d9f90e358631b828077b85be63a4490

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page