Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.3.3.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.3.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.3.tar.gz
Algorithm Hash digest
SHA256 0608e413177eaffbd963db70b416af58e102bf6e1638850d73e44a72b637a489
MD5 81b517b57ffdf1663d0c47539b8473fb
BLAKE2b-256 de4f3772d9eb809ebd7b0faa6bcf277513732277aab0e03c56e4424635510f2d

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.3-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.3-py3-none-any.whl
Algorithm Hash digest
SHA256 1ce2d58349ab3bafece1d8c7fa094ae170a6d63733c1f16f82cc5a51ba26a2fa
MD5 53bf84715bd0d5b39628c45f5b6995f6
BLAKE2b-256 042ac5bb1bfdfeaa4108f9cd057688275d80f915e8ffe8cda117b903e773b77b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page