Skip to main content

Deepgram STT and TTS integration for Vision Agents

Project description

Deepgram Plugin

Speech-to-Text (STT) and Text-to-Speech (TTS) plugins for Vision Agents using the Deepgram API.

Installation

uv add "vision-agents[deepgram]"
# or directly
uv add vision-agents-plugins-deepgram

Speech-to-Text (STT)

High-quality speech recognition using Deepgram's Flux model with built-in turn detection.

from vision_agents.plugins import deepgram

stt = deepgram.STT(
    model="flux-general-en",  # Default model
    eager_turn_detection=True,  # Enable eager end-of-turn detection
)

STT Docs

Text-to-Speech (TTS)

Low-latency text-to-speech using Deepgram's Aura model via WebSocket streaming.

from vision_agents.plugins import deepgram

tts = deepgram.TTS(
    model="aura-2-thalia-en",  # Default voice
    sample_rate=16000,  # Audio sample rate
)

Available Voices

Deepgram offers various Aura voice models:

  • aura-2-thalia-en - Default female voice
  • aura-2-orion-en - Male voice
  • See TTS Models for all options

TTS Docs

Environment Variables

Set DEEPGRAM_API_KEY in your environment or pass api_key to the constructor.

Example

See the example directory for a complete working example using both STT and TTS.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_deepgram-0.5.2.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_deepgram-0.5.2-py3-none-any.whl (17.4 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_deepgram-0.5.2.tar.gz.

File metadata

  • Download URL: vision_agents_plugins_deepgram-0.5.2.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_deepgram-0.5.2.tar.gz
Algorithm Hash digest
SHA256 117ed44e1ddd03e8bf09431414f1c3ba7fff7c9ca7eb5e47b8bd45316adad381
MD5 ff39fff5392ddac1dd3d448ecb19db45
BLAKE2b-256 663727977eb0cfcf1112673091a8150440a9a7117d5f744d0e19ac7c1a46d273

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_deepgram-0.5.2-py3-none-any.whl.

File metadata

  • Download URL: vision_agents_plugins_deepgram-0.5.2-py3-none-any.whl
  • Upload date:
  • Size: 17.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_deepgram-0.5.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ad08caed9e47bb1e170e77e5ea58ba937f386b19378e96292ca38fb33285b1bc
MD5 96e8cf3591dcd040b7922adbfc4bdb0e
BLAKE2b-256 243d611d9ed59bed59f937702f28f1a3db5495d0e1737d1fbfe9b9d8ca4ecdb7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page