Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.4.1.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.4.1.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.4.1.tar.gz
Algorithm Hash digest
SHA256 c85bc3353152452bd7bec5d2d4a1fb5f1b8b8ca8a2caefd4ed8865fd6211308b
MD5 fc87455653a50d2b2d980ff0bc0d47bc
BLAKE2b-256 dcff17c64356cc7f1915594b1b844f48e26a643f42ae09803470805f6be9fd00

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.4.1-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.4.1-py3-none-any.whl
Algorithm Hash digest
SHA256 25dbf5a522f67723b17e71971d8763ee860fabb82c5efca4df20102fd5c420c1
MD5 9b74657591cbded1887f65e0ee7c9b12
BLAKE2b-256 82a8ffcacedc4fa2f12a5269e03c2621c02243be93278d5e491cbf0ee7548132

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page