Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.3.1.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.1.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.1.tar.gz
Algorithm Hash digest
SHA256 a0382252a3d610dc14108f0239d0dc42036b5f182d455f46ff0c64725d1b4d72
MD5 78ba300d805d3137bdc4fcb7e83aae45
BLAKE2b-256 6310f5692bef8c999a99ac31b4d4a6e11aea564c6c4a6ee315a039fc9dcd4bd2

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8768df647604b8ac52a1c54a870d0af56532d2475a8fc596e41ebf3f8371cd43
MD5 f7e99a09b7d42e2fb7787f194b0a5b36
BLAKE2b-256 10aac9b82f2d0f8d56fc4a38266e26c965def1419a99ab822049823d88dd2f1c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page