Skip to main content

Sarvam AI STT, TTS, and LLM integration for Vision Agents

Project description

Sarvam AI Plugin

This plugin provides STT, TTS, and LLM capabilities using Sarvam AI, a suite of AI models built for Indian languages.

Features

  • STT: WebSocket streaming speech-to-text (Saarika / Saaras) with Voice Activity Detection for turn events.
  • TTS: WebSocket streaming text-to-speech (Bulbul) with configurable speaker, pace, and language.
  • LLM: OpenAI-compatible chat completions (Sarvam-30B / Sarvam-105B / Sarvam-M) via the existing ChatCompletionsLLM from the OpenAI plugin.

Installation

uv add vision-agents-plugins-sarvam

Usage

from vision_agents.core import Agent, User
from vision_agents.plugins import getstream, sarvam, smart_turn

agent = Agent(
    edge=getstream.Edge(),
    agent_user=User(name="Sarvam AI"),
    instructions="Reply in Hindi or English, whichever the user speaks",
    llm=sarvam.LLM(model="sarvam-30b"),
    stt=sarvam.STT(language="hi-IN"),
    tts=sarvam.TTS(speaker="shubh"),
    turn_detection=smart_turn.TurnDetection(),
)

All three services read the same SARVAM_API_KEY environment variable and send it via the api-subscription-key header.

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_sarvam-0.5.5.tar.gz (10.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_sarvam-0.5.5-py3-none-any.whl (25.8 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_sarvam-0.5.5.tar.gz.

File metadata

  • Download URL: vision_agents_plugins_sarvam-0.5.5.tar.gz
  • Upload date:
  • Size: 10.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_sarvam-0.5.5.tar.gz
Algorithm Hash digest
SHA256 87434d3e0534274e3e276859f7d93db5f92a1bd285b120396fd8d0bf0837f34e
MD5 d91e7b16db9deed5435fb698b7e2dcb8
BLAKE2b-256 fed5935dc6ba6f030ab3cb0a0b54745c255e2c935c09959644d2b3d2c468a97b

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_sarvam-0.5.5-py3-none-any.whl.

File metadata

  • Download URL: vision_agents_plugins_sarvam-0.5.5-py3-none-any.whl
  • Upload date:
  • Size: 25.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_sarvam-0.5.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e20c3889bbbbff77fec224c3e793ce7fa4a70dd349b2d6d626f371187c1cb783
MD5 28658798d3167c77c4fe3d2bf1f48e4b
BLAKE2b-256 8eb99dd17cdea1555c11988774985f8b39e12a28b5daa3b8c73d43858e7caab5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page