Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.2.9.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.9.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.9.tar.gz
Algorithm Hash digest
SHA256 d3f560daa8df52347c285ee2a34d78b76a8402283255ca9eb30fd211bf58fe35
MD5 3b7debaf4d0a4337c69c6bd13099cde8
BLAKE2b-256 f3203501730c02e333719975b9ce600cc61cf2f4fd06e49711d8aacd8a1e0054

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.9-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.9-py3-none-any.whl
Algorithm Hash digest
SHA256 f78a18e871550da4e287eb653e3150eea203424ee9ac2eee9f9e11c0b5ae23f6
MD5 a1c9814776a665cb113e4e11c4a90c22
BLAKE2b-256 25d5f3ecce3438361572d6134d3fc4d628a28c2f354fd4583cdb180bbddaeb30

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page