Skip to main content

Deepgram STT and TTS integration for Vision Agents

Project description

Deepgram Plugin

Speech-to-Text (STT) and Text-to-Speech (TTS) plugins for Vision Agents using the Deepgram API.

Installation

uv add "vision-agents[deepgram]"
# or directly
uv add vision-agents-plugins-deepgram

Speech-to-Text (STT)

High-quality speech recognition using Deepgram's Flux model with built-in turn detection.

from vision_agents.plugins import deepgram

stt = deepgram.STT(
    model="flux-general-en",  # Default model
    eager_turn_detection=True,  # Enable eager end-of-turn detection
)

STT Docs

Text-to-Speech (TTS)

Low-latency text-to-speech using Deepgram's Aura model via WebSocket streaming.

from vision_agents.plugins import deepgram

tts = deepgram.TTS(
    model="aura-2-thalia-en",  # Default voice
    sample_rate=16000,  # Audio sample rate
)

Available Voices

Deepgram offers various Aura voice models:

  • aura-2-thalia-en - Default female voice
  • aura-2-orion-en - Male voice
  • See TTS Models for all options

TTS Docs

Environment Variables

Set DEEPGRAM_API_KEY in your environment or pass api_key to the constructor.

Example

See the example directory for a complete working example using both STT and TTS.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_deepgram-0.5.5.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_deepgram-0.5.5-py3-none-any.whl (17.4 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_deepgram-0.5.5.tar.gz.

File metadata

  • Download URL: vision_agents_plugins_deepgram-0.5.5.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_deepgram-0.5.5.tar.gz
Algorithm Hash digest
SHA256 5c0413a26a3001cff3ae699dc04c01c1ec9c8210675f3fba6f8ef3c7f24ee959
MD5 18b996e942d62c267fa366e341b88ae6
BLAKE2b-256 899c565687ecacfd91445b494e1cbcf3f2d6f27ec14c6b372389741cd135434f

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_deepgram-0.5.5-py3-none-any.whl.

File metadata

  • Download URL: vision_agents_plugins_deepgram-0.5.5-py3-none-any.whl
  • Upload date:
  • Size: 17.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_deepgram-0.5.5-py3-none-any.whl
Algorithm Hash digest
SHA256 852ae5486855d0872cd7274b815951cafd2f4f2712cf35ccf7681adcc3cdc748
MD5 9320e83080ba36c05435fa3f002e4659
BLAKE2b-256 49045a295ba0d9dfd1e6935d1b7af9a5df351ae913e3d13ec3b65cf2e95c22f9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page