Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.3.8.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.8.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.8.tar.gz
Algorithm Hash digest
SHA256 bddde52b24d7b8ad988e2f428bc358db2a087804d326cd168ff7cf12362cfb0e
MD5 1a464ba66471dd0c1e56b8773af7583a
BLAKE2b-256 c05b38e8e2e4abd26f1049e639f445324babbc58dc61457f7cda659e71b87c01

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.8-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.8-py3-none-any.whl
Algorithm Hash digest
SHA256 3ce64ddc7395d6e6b57d445993862bd7105eac1c2032202a7cb9da997cf82ad0
MD5 56eb2cd015f05187a9dfa54f97ca0a04
BLAKE2b-256 532e39bde9e5a3faedc712797a08593b77ab90771d23925cd82baba9b42d7369

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page