Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.2.4.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.4.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.4.tar.gz
Algorithm Hash digest
SHA256 c581aa63769ab3254dcc8105ab966902326b10723e5267fbdb8f72101a4b50aa
MD5 5c24f2913cf642c19a2e802985395bd0
BLAKE2b-256 e142cbbbd681b36a04d9eca5ec4e2f0ce814e02f133271af4ef190667ddb596f

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.4-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 bad6adb48b74dbc5295c2eae29d7948806d6005749f4befdd15f18119e451369
MD5 cbd5c0052fdb1ff35e279147597707be
BLAKE2b-256 b6aa7d1954f8ed13fec331fc96efa47df5e2d1010e24bf2a04d6de29f4cf0733

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page