Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.2.10.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.10.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.10.tar.gz
Algorithm Hash digest
SHA256 ec88fa3a345e7332fac67ea3522f097c5024a1b9f6a76353743ff8a2f809e9ce
MD5 e9bee1e700440fd64bc7afec09cc7123
BLAKE2b-256 a8843a8df48a314bf2ccdb37741e98d10920cd3dd5b00f537cfd7e64fe6619ef

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.10-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.10-py3-none-any.whl
Algorithm Hash digest
SHA256 c18de9f672a647b7ddbc97e74b08961fdc15f1a0be89c1acd9530a6e4b42484d
MD5 0e9c864628a9349dfcb6c7accd31a2ad
BLAKE2b-256 574e7944465dc1e49be265591502dd5d6bad3d4a1796b105427d7917f3e9db9b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page