Skip to main content

Deepgram STT and TTS integration for Vision Agents

Project description

Deepgram Plugin

Speech-to-Text (STT) and Text-to-Speech (TTS) plugins for Vision Agents using the Deepgram API.

Installation

uv add "vision-agents[deepgram]"
# or directly
uv add vision-agents-plugins-deepgram

Speech-to-Text (STT)

High-quality speech recognition using Deepgram's Flux model with built-in turn detection.

from vision_agents.plugins import deepgram

stt = deepgram.STT(
    model="flux-general-en",  # Default model
    eager_turn_detection=True,  # Enable eager end-of-turn detection
)

STT Docs

Text-to-Speech (TTS)

Low-latency text-to-speech using Deepgram's Aura model via WebSocket streaming.

from vision_agents.plugins import deepgram

tts = deepgram.TTS(
    model="aura-2-thalia-en",  # Default voice
    sample_rate=16000,  # Audio sample rate
)

Available Voices

Deepgram offers various Aura voice models:

  • aura-2-thalia-en - Default female voice
  • aura-2-orion-en - Male voice
  • See TTS Models for all options

TTS Docs

Environment Variables

Set DEEPGRAM_API_KEY in your environment or pass api_key to the constructor.

Example

See the example directory for a complete working example using both STT and TTS.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_deepgram-0.5.1.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vision_agents_plugins_deepgram-0.5.1.tar.gz.

File metadata

  • Download URL: vision_agents_plugins_deepgram-0.5.1.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_deepgram-0.5.1.tar.gz
Algorithm Hash digest
SHA256 af6f98b95853976cf550829f1d6c12de3ca74e5476cb58018d78358d95c3d5ea
MD5 5b67e7f01310dd0a00d066197cfdf970
BLAKE2b-256 616c95ed67cba131b3c395fb818bcd99bc76b0ec1e411c37fdaa1c7eaf56e3ea

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_deepgram-0.5.1-py3-none-any.whl.

File metadata

  • Download URL: vision_agents_plugins_deepgram-0.5.1-py3-none-any.whl
  • Upload date:
  • Size: 17.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_deepgram-0.5.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d62224aa446bc73441f89ae95c944f7953854e353b55dc47abaefd134808d031
MD5 adc38db92bd0225c69937ede7010b16d
BLAKE2b-256 92411f003b1d2ad4d99ff42239ad4c7b63d65a652c287282af58e47ae9991057

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page