Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.3.0.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.0.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.0.tar.gz
Algorithm Hash digest
SHA256 e9523586e1d15894b27efa2c7ceb4d24e948994c5362cfc71f14678f037a278d
MD5 f5da35e09528bc30b3bde4397340df96
BLAKE2b-256 db6b4456dbe6e2cfdcce457ff9e839410254386b226a2a8327038afe8db518e9

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 192f8c0bd710520ff601c80160cb82cba6deb255cbaa39b295655cacb3b99e3d
MD5 ff778664c3a38ca7ae0e6984dbf79bb3
BLAKE2b-256 92ddef2e7fc060574a08442061c244fdb15d4641d8ce430c1df2c6f6c0242fe1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page