XAI for stream agents

These details have not been verified by PyPI

Project links

Project description

XAI Plugin for Stream Agents

This package provides xAI (Grok) integration for the Stream Agents ecosystem, enabling you to use xAI's powerful language models in your conversational AI applications.

Features

Native xAI SDK Integration: Full access to xAI's chat completion and streaming APIs
Conversation Memory: Automatic conversation history management
Streaming Support: Real-time response streaming with standardized events
Multimodal Support: Handle text and image inputs
Event System: Subscribe to response events for custom handling
Easy Integration: Drop-in replacement for other LLM providers

Installation

uv add "vision-agents[xai]"
# or directly
uv add vision-agents-plugins-xai

Quick Start

import asyncio
from vision_agents.plugins import xai

async def main():
    # Initialize with your xAI API key
    llm = xai.LLM(
        model="grok-4",
        api_key="your_xai_api_key"  # or set XAI_API_KEY environment variable
    )

    # Simple response
    response = await llm.simple_response("Explain quantum computing in simple terms")

    print(f"\n\nComplete response: {response.text}")

if __name__ == "__main__":
    asyncio.run(main())

Advanced Usage

Conversation with Memory

from vision_agents.plugins import xai

llm = xai.LLM(model="grok-4", api_key="your_api_key")

# First message
await llm.simple_response("My name is Alice and I have 2 cats")

# Second message - the LLM remembers the context
response = await llm.simple_response("How many pets do I have?")
print(response.text)  # Will mention the 2 cats

Using Instructions

llm = LLM(
    model="grok-4",
    api_key="your_api_key"
)

# Create a response with system instructions
response = await llm.create_response(
    input="Tell me about the weather",
    instructions="You are a helpful weather assistant. Always be cheerful and optimistic.",
    stream=True
)

Multimodal Input

# Handle complex multimodal messages
advanced_message = [
    {
        "role": "user",
        "content": [
            {"type": "input_text", "text": "What do you see in this image?"},
            {"type": "input_image", "image_url": "https://example.com/image.jpg"},
        ],
    }
]

messages = LLM._normalize_message(advanced_message)
# Use with your conversation system

API Reference

XAILLM Class

Constructor

LLM(
    model: str = "grok-4",
    api_key: Optional[str] = None,
    client: Optional[AsyncClient] = None
)

Parameters:

model: xAI model to use (default: "grok-4")
api_key: Your xAI API key (default: reads from XAI_API_KEY environment variable)
client: Optional pre-configured xAI AsyncClient

Methods

`async simple_response(text: str, processors=None, participant=None)`

Generate a simple response to text input.

Parameters:

text: Input text to respond to
processors: Optional list of processors for video/voice AI context
participant: Optional participant object

Returns: LLMResponseEvent[Response] with the generated text

`async create_response(input: str, instructions: str = "", model: str = None, stream: bool = True)`

Create a response with full control over parameters.

Parameters:

input: Input text
instructions: System instructions for the model
model: Override the default model
stream: Whether to stream the response (default: True)

Returns: LLMResponseEvent[Response] with the generated text

Configuration

Environment Variables

XAI_API_KEY: Your xAI API key (required if not provided in constructor)

Text-to-Speech (TTS)

The plugin also ships an xai.TTS class powered by xAI's Grok Voice API. It provides five expressive voices with inline speech tags for fine-grained delivery control.

Usage

from vision_agents.plugins import xai

# Default voice (eve) — energetic, upbeat
tts = xai.TTS()

# Specify a voice
tts = xai.TTS(voice="ara")   # warm, friendly
tts = xai.TTS(voice="leo")   # authoritative, strong
tts = xai.TTS(voice="rex")   # confident, clear
tts = xai.TTS(voice="sal")   # smooth, balanced

# Custom output format
tts = xai.TTS(
    voice="rex",
    codec="mp3",
    sample_rate=44100,
    bit_rate=192000,
)

# Explicit API key (otherwise reads XAI_API_KEY env var)
tts = xai.TTS(api_key="xai-your-key-here")

Configuration

Parameter	Type	Default	Description
`api_key`	str	env var	xAI API key. Falls back to `XAI_API_KEY` environment variable.
`voice`	str	`"eve"`	Voice ID: `"eve"`, `"ara"`, `"leo"`, `"rex"`, or `"sal"`.
`language`	str	`"en"`	BCP-47 language code or `"auto"` for detection.
`codec`	str	`"pcm"`	Output codec: `"pcm"`, `"mp3"`, `"wav"`, `"mulaw"`, `"alaw"`.
`sample_rate`	int	`24000`	Sample rate: `8000`–`48000` Hz.
`bit_rate`	int	`None`	MP3 bit rate (only used with `codec="mp3"`).
`base_url`	str	`None`	Override the xAI TTS API endpoint.
`session`	object	`None`	Optional pre-existing `aiohttp.ClientSession`.

Voices

Voice	Tone	Best For
`eve`	Energetic, upbeat	Demos, announcements, upbeat content (default)
`ara`	Warm, friendly	Conversational interfaces, hospitality
`leo`	Authoritative, strong	Instructional, educational, healthcare
`rex`	Confident, clear	Business, corporate, customer support
`sal`	Smooth, balanced	Versatile — works for any context

Speech tags

Add expressiveness to synthesized speech with inline and wrapping tags:

Inline tags (placed where the expression should occur):

Pauses: [pause] [long-pause] [hum-tune]
Laughter: [laugh] [chuckle] [giggle] [cry]
Mouth sounds: [tsk] [tongue-click] [lip-smack]
Breathing: [breath] [inhale] [exhale] [sigh]

Wrapping tags (wrap text to change delivery):

Volume: <soft>text</soft> <loud>text</loud> <shout>text</shout>
Pitch/speed: <high-pitch>text</high-pitch> <low-pitch>text</low-pitch> <slow>text</slow> <fast>text</fast>
Style: <whisper>text</whisper> <sing>text</sing>

MP3 output

MP3 decoding requires pydub. Install it via the mp3 extra:

uv add "vision-agents-plugins-xai[mp3]"

Requirements

Python 3.10+
xai-sdk
vision-agents-core
Optional: pydub (for MP3 decoding via the mp3 extra)

License

Apache-2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.9

May 15, 2026

This version

0.5.8

May 13, 2026

0.5.7

May 7, 2026

0.5.6

May 5, 2026

0.5.5

Apr 27, 2026

0.5.4

Apr 15, 2026

0.5.3

Apr 14, 2026

0.5.2

Apr 13, 2026

0.5.1

Apr 7, 2026

0.5.0

Apr 1, 2026

0.4.7

Mar 27, 2026

0.4.6

Mar 26, 2026

0.4.5

Mar 25, 2026

0.4.4

Mar 23, 2026

0.4.3

Mar 11, 2026

0.4.2

Mar 10, 2026

0.4.1

Mar 4, 2026

0.4.0

Mar 3, 2026

0.3.8

Feb 24, 2026

0.3.7

Feb 23, 2026

0.3.6

Feb 13, 2026

0.3.5

Feb 10, 2026

0.3.4

Feb 6, 2026

0.3.3

Feb 4, 2026

0.3.2

Jan 27, 2026

0.3.0

Jan 20, 2026

0.2.10

Jan 14, 2026

0.2.9

Jan 9, 2026

0.2.8

Jan 8, 2026

0.2.7

Jan 7, 2026

0.2.6

Dec 16, 2025

0.2.5

Dec 12, 2025

0.2.4

Dec 12, 2025

0.2.3

Dec 7, 2025

0.2.2

Nov 29, 2025

0.2.1

Nov 21, 2025

0.2.0

Nov 14, 2025

0.1.14

Nov 11, 2025

0.1.13

Nov 3, 2025

0.1.12

Oct 31, 2025

0.1.11

Oct 28, 2025

0.1.9

Oct 22, 2025

0.1.8

Oct 22, 2025

0.1.7

Oct 21, 2025

0.1.6

Oct 16, 2025

0.1.5

Oct 9, 2025

0.0.18

Oct 8, 2025

0.0.17

Oct 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_xai-0.5.8.tar.gz (18.5 kB view details)

Uploaded May 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vision_agents_plugins_xai-0.5.8-py3-none-any.whl (20.6 kB view details)

Uploaded May 13, 2026 Python 3

File details

Details for the file vision_agents_plugins_xai-0.5.8.tar.gz.

File metadata

Download URL: vision_agents_plugins_xai-0.5.8.tar.gz
Upload date: May 13, 2026
Size: 18.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_xai-0.5.8.tar.gz
Algorithm	Hash digest
SHA256	`514f26423241e6a2f96ef6f8607772b9b33e84bfb08ef67bae05f1a595c7dc20`
MD5	`d27b322871c849cb862872f9db57aeec`
BLAKE2b-256	`c7c9a61e1792349cea6cb03fc8f01a837b6f1750821aa3fdd2ba1e5568466f81`

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_xai-0.5.8-py3-none-any.whl.

File metadata

Download URL: vision_agents_plugins_xai-0.5.8-py3-none-any.whl
Upload date: May 13, 2026
Size: 20.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.6 {"installer":{"name":"uv","version":"0.10.6","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_xai-0.5.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`718d3489c4d3511d0ac7fb096692605a421547fc2da6928ec740a2675740cc47`
MD5	`dd88296800152210249d1ec4d0cd949e`
BLAKE2b-256	`139acc4104d9b6ae3daa3723eb3c6652d82cc98866183047a0b7bfb9da34baa9`

See more details on using hashes here.

vision-agents-plugins-xai 0.5.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

XAI Plugin for Stream Agents

Features

Installation

Quick Start

Advanced Usage

Conversation with Memory

Using Instructions

Multimodal Input

API Reference

XAILLM Class

Constructor

Methods

async simple_response(text: str, processors=None, participant=None)

async create_response(input: str, instructions: str = "", model: str = None, stream: bool = True)

Configuration

Environment Variables

Text-to-Speech (TTS)

Usage

Configuration

Voices

Speech tags

MP3 output

Requirements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`async simple_response(text: str, processors=None, participant=None)`

`async create_response(input: str, instructions: str = "", model: str = None, stream: bool = True)`