Deepgram STT and TTS integration for Vision Agents
Project description
Deepgram Plugin
Speech-to-Text (STT) and Text-to-Speech (TTS) plugins for Vision Agents using the Deepgram API.
Installation
uv add "vision-agents[deepgram]"
# or directly
uv add vision-agents-plugins-deepgram
Speech-to-Text (STT)
High-quality speech recognition using Deepgram's Flux model with built-in turn detection.
from vision_agents.plugins import deepgram
stt = deepgram.STT(
model="flux-general-en", # Default model
eager_turn_detection=True, # Enable eager end-of-turn detection
)
STT Docs
- https://developers.deepgram.com/docs/flux/quickstart
- https://github.com/deepgram/deepgram-python-sdk/blob/main/examples/listen/v2/connect/async.py
Text-to-Speech (TTS)
Low-latency text-to-speech using Deepgram's Aura model via WebSocket streaming.
from vision_agents.plugins import deepgram
tts = deepgram.TTS(
model="aura-2-thalia-en", # Default voice
sample_rate=16000, # Audio sample rate
)
Available Voices
Deepgram offers various Aura voice models:
aura-2-thalia-en- Default female voiceaura-2-orion-en- Male voice- See TTS Models for all options
TTS Docs
- https://developers.deepgram.com/docs/tts-websocket
- https://developers.deepgram.com/docs/streaming-text-to-speech
Environment Variables
Set DEEPGRAM_API_KEY in your environment or pass api_key to the constructor.
Example
See the example directory for a complete working example using both STT and TTS.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vision_agents_plugins_deepgram-0.5.0.tar.gz.
File metadata
- Download URL: vision_agents_plugins_deepgram-0.5.0.tar.gz
- Upload date:
- Size: 7.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
14cb8d2d400a1f86bf9088e70098da8e853c035c118403f8b1dfe12dea884663
|
|
| MD5 |
09981fb5a633c4b35f370eafe8b324b1
|
|
| BLAKE2b-256 |
95d8970db39c64764c8d730502e9e0eb815a1eeb47c86a82bd14f80f0a276f80
|
File details
Details for the file vision_agents_plugins_deepgram-0.5.0-py3-none-any.whl.
File metadata
- Download URL: vision_agents_plugins_deepgram-0.5.0-py3-none-any.whl
- Upload date:
- Size: 17.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
10a714b980fce1c338c65b0cec20f87ecbafdfcc397f88a7f5b22d435c07f8c5
|
|
| MD5 |
32f789cf6ac0c98aaf94ab685b5f35c8
|
|
| BLAKE2b-256 |
e7000a68152c6a93e8808c9d775443b6568018ca02dfd2e6b5f9d119c2f93f3e
|