Skip to main content

Fast Whisper STT integration for Vision Agents

Project description

Fast Whisper STT Plugin

Fast Whisper STT plugin for Vision Agents, providing real-time audio transcription using faster-whisper.

Features

  • Fast inference using CTranslate2-based Whisper implementation
  • Support for multiple model sizes (tiny, base, small, medium, large, large-v2, large-v3)
  • Automatic language detection or manual language specification
  • CPU and GPU support
  • Quantization support (int8, float16, float32)

Installation

uv add vision-agents[fast-whisper]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_fast_whisper-0.2.0.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.0.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.0.tar.gz
Algorithm Hash digest
SHA256 a7702bfb47dd697e8929e4bb43a0a14bc3fc11920680994c3021ce82b3e5905c
MD5 030c067e7fa0c7595ccfd4e2e68da939
BLAKE2b-256 2261016017be78cb70b6728331faef8bbe14b1dc096dadddbf81553b6f5654df

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_fast_whisper-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_fast_whisper-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b738e27e3388410394fc01df4ac97bfa641e23d73900a9902715b4442958342d
MD5 fd802a032e735de1e742d8414ca2640d
BLAKE2b-256 fe2c66ada969df113caf0ce890b1b58f9bdb85478f49f5e86344e2844abfeba5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page