Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.3.7.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.7.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.7.tar.gz
Algorithm Hash digest
SHA256 47ddc274b70e703220448a4eb7d191851ac1c9f5e0008ddb7a6f4a738c679994
MD5 e2d16fb8d18981a57daff5f543b2a7e3
BLAKE2b-256 ababc1892e6a6ddf28cae7d158bc5cc96ea3ac3286644ef0ce08941e5d067ad6

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.7-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.7-py3-none-any.whl
Algorithm Hash digest
SHA256 43f01ad70b7f904d16708575b60b73ced82640a5ddcdb285befc4968cfeb3d51
MD5 8035636e98286cafd5346ce8c5731bed
BLAKE2b-256 03bb1e9ec95ab5701e660aa6eca7fa22a5c2f44d8680a7cfca5b06b29c062956

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page