Inworld AI TTS integration for Vision Agents
Project description
Inworld AI Text-to-Speech Plugin
A high-quality Text-to-Speech (TTS) plugin for Vision Agents that uses the Inworld AI API with streaming support.
Installation
uv add vision-agents[inworld]
Usage
from vision_agents.plugins import inworld
# Initialize with API key from environment variable
tts = inworld.TTS()
# Or specify API key and other options directly
tts = inworld.TTS(
api_key="your_inworld_api_key",
voice_id="Dennis",
model_id="inworld-tts-1",
temperature=1.1
)
# Use with an Agent
from vision_agents.core import Agent
from vision_agents.plugins import getstream, gemini, smart_turn
agent = Agent(
edge=getstream.Edge(),
tts=inworld.TTS(),
llm=gemini.LLM("gemini-2.0-flash"),
turn_detection=smart_turn.TurnDetection(),
)
Configuration Options
api_key: Inworld AI API key (default: reads fromINWORLD_API_KEYenvironment variable)voice_id: The voice ID to use for synthesis (default: "Dennis")model_id: The model ID to use for synthesis. Options: "inworld-tts-1", "inworld-tts-1-max" (default: "inworld-tts-1")temperature: Determines the degree of randomness when sampling audio tokens. Accepts values between 0 and 2 (default: 1.1)
Requirements
- Python 3.10+
- httpx>=0.27.0 "av>=10.0.0",
Getting Started
- Get your Inworld AI API key from the Inworld Portal
- Set the
INWORLD_API_KEYenvironment variable:export INWORLD_API_KEY="your_api_key_here"
- Use the plugin in your Vision Agents application
API Reference
The plugin implements the standard Vision Agents TTS interface:
stream_audio(text: str): Convert text to speech and return an async iterator ofPcmDatachunksstop_audio(): Stop audio playback (no-op for this plugin)send(text: str): Send text to be converted to speech (inherited from base class)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vision_agents_plugins_inworld-0.2.10.tar.gz.
File metadata
- Download URL: vision_agents_plugins_inworld-0.2.10.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6027b9477a518b003ff3867de75bce0e5ab0cd54973c000578b4d9fac55b7b14
|
|
| MD5 |
aa6c812b7f6ec35d3f5348be3f003960
|
|
| BLAKE2b-256 |
73b199839c306327987092e4b73674263cf5287fa955e7d3e3fae954b95bbd72
|
File details
Details for the file vision_agents_plugins_inworld-0.2.10-py3-none-any.whl.
File metadata
- Download URL: vision_agents_plugins_inworld-0.2.10-py3-none-any.whl
- Upload date:
- Size: 10.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d353308d0998f6ff5b1fad6d83f5c646aff4e31edd85ebee4a8f63aa7b62a1cc
|
|
| MD5 |
bee956f7761e1460e4de20ad8003793b
|
|
| BLAKE2b-256 |
f5a74a7b5237d76af49da2adfa4afe27210b33cb29911f0546d69301f91ff677
|