Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.2.5.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.5.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.5.tar.gz
Algorithm Hash digest
SHA256 fe3c1abd3c3b0501659d78332295f5c017327e1368d2e45ef00fae025de5aef5
MD5 96520544c998afb7bb228e280a00cafc
BLAKE2b-256 c6b9ec28ccabde0c15b183845f6855855a7e08fafd592c48f2632cf986ba2e15

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.5-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e8421938a80d103ede58802ee91644789b0206114ca31f5efd9bd5831529edc2
MD5 d4c7986c03cea5b2634e5a138581035d
BLAKE2b-256 409feb43fbab3457b1daa3d43b46ef1adfae0e4004623b984f16add77be4befe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page