Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.3.4.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.4.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.4.tar.gz
Algorithm Hash digest
SHA256 b5ee19b3a15586e53d39da8539ab57e156765accbe6edccb2858565f54e6db70
MD5 3b3ad0534e63da4614eee5d313608fbe
BLAKE2b-256 3f0e0cc342a895d2b15310e6ee4ed1028f988ac31363dd608b9a06bb3b174edb

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.4-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.4-py3-none-any.whl
Algorithm Hash digest
SHA256 36bc883acff6c0460f27b7d72332f13b64424a4cbc6e364ca84e0936316b0072
MD5 8d093838752f6c2f4948634ef5edbf84
BLAKE2b-256 38139a1a8da6fb05e893fe98969ff29453c05820d8ed74d1a38993d07ae08fab

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page