Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.2.1.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.1.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.1.tar.gz
Algorithm Hash digest
SHA256 6fb4c765ed4ecd7abf414bc60c404a14c0f4ba6c4f01f993789dae218629d827
MD5 19869c46f67c545876622e8975be65c0
BLAKE2b-256 f1b1e0c59b2453817efd78cb6f3031d9c763f03ae6f399d8da493f60ded9f2f0

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a260377ffcd913903bbb2f77707f448585a933013bce261545f8ebfcc093eb96
MD5 89fe37eb7e7f67981618a86721d703ca
BLAKE2b-256 bc88bfd2a837e82eb03d4f3520f4ce0b39f6c40ce0649ed9a657ba2416ac05fc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page