Skip to main content

Inworld AI TTS integration for Vision Agents

Project description

Inworld AI Text-to-Speech Plugin

A high-quality Text-to-Speech (TTS) plugin for Vision Agents that uses the Inworld AI API with streaming support.

Installation

uv add vision-agents[inworld]

Usage

from vision_agents.plugins import inworld

# Initialize with API key from environment variable
tts = inworld.TTS()

# Or specify API key and other options directly
tts = inworld.TTS(
    api_key="your_inworld_api_key",
    voice_id="Dennis",
    model_id="inworld-tts-1.5-max",
    temperature=1.1
)

# Use with an Agent
from vision_agents.core import Agent
from vision_agents.plugins import getstream, gemini, smart_turn

agent = Agent(
    edge=getstream.Edge(),
    tts=inworld.TTS(),
    llm=gemini.LLM(),
    turn_detection=smart_turn.TurnDetection(),
)

Configuration Options

  • api_key: Inworld AI API key (default: reads from INWORLD_API_KEY environment variable)
  • voice_id: The voice ID to use for synthesis (default: "Dennis")
  • model_id: The model ID to use for synthesis. Options: "inworld-tts-1.5-max", "inworld-tts-1.5-min" "inworld-tts-1", "inworld-tts-1-max" (default: "inworld-tts-1.5-max")
  • temperature: Determines the degree of randomness when sampling audio tokens. Accepts values between 0 and 2 (default: 1.1)

Requirements

  • Python 3.10+
  • httpx>=0.27.0 "av>=10.0.0",

Getting Started

  1. Get your Inworld AI API key from the Inworld Portal
  2. Set the INWORLD_API_KEY environment variable:
    export INWORLD_API_KEY="your_api_key_here"
    
  3. Use the plugin in your Vision Agents application

API Reference

The plugin implements the standard Vision Agents TTS interface:

  • stream_audio(text: str): Convert text to speech and return an async iterator of PcmData chunks
  • stop_audio(): Stop audio playback (no-op for this plugin)
  • send(text: str): Send text to be converted to speech (inherited from base class)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_inworld-0.3.7.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_inworld-0.3.7-py3-none-any.whl (10.6 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_inworld-0.3.7.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_inworld-0.3.7.tar.gz
Algorithm Hash digest
SHA256 5ab41d20d098a6295adb6da138db939b931965a90f6256d294f98ef29a5c1fc9
MD5 dc52368e06882695832adf74adc62c4c
BLAKE2b-256 6ea3fcc9e32272b355232bd990b915ceedc028734cb0d313d0f156f74b40532b

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_inworld-0.3.7-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_inworld-0.3.7-py3-none-any.whl
Algorithm Hash digest
SHA256 e993cf09ab568098435e5c2d0d328b627652010fb2d4a0fea19ae5ebfcb7a5c9
MD5 f8210d7367d66ff28f9443b0e38460f3
BLAKE2b-256 4e057fca2f66881d1a9a475b7b145889fda10da543f77530ce6b2ef0002ec137

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page