Skip to main content

Python SDK for Akool Streaming Avatar WebSocket API

Project description

Akool Streaming Avatar SDK

A Python SDK for connecting to Akool's streaming avatar services via WebSocket. This SDK provides easy-to-use interfaces for real-time audio streaming, automatic load balancing, and robust error handling.

Features

  • 🔐 Token-based Authentication - Secure API key authentication
  • ⚖️ Automatic Load Balancing - Service discovery with random server selection
  • 🔄 Retry Logic - Automatic reconnection with configurable retry attempts
  • 🎵 Audio Streaming - Real-time PCM audio transmission
  • 💓 Built-in Heartbeat - Automatic connection health monitoring
  • 🛡️ Error Handling - Comprehensive exception handling with user-friendly messages
  • 📝 Event-driven - Callback-based architecture for real-time responses

Installation

pip install akool-streaming-avatar

Quick Start

Method 1: Manual Connection Management

import asyncio
from akool_streaming_avatar import StreamingAvatarClient

async def main():
    # Initialize client
    client = StreamingAvatarClient(
        api_key="your-api-key",
        discovery_url="https://api.akool.com/streamingAvatar/service_status",
        avatar_id="your-avatar-id"
    )
    
    try:
        # Connect to service
        await client.connect()
        
        # Set Agora parameters (required)
        await client.set_agora_params(
            agora_app_id="your-agora-app-id",
            agora_channel="your-channel",
            agora_token="your-rtc-token",
            agora_uid="12345"
        )
        
        # Send audio data
        audio_data = b"..."  # PCM 16-bit, 16kHz, mono
        await client.send_audio(audio_data)
        
    finally:
        # Always disconnect when done
        await client.disconnect()

# Run the client
asyncio.run(main())

Method 2: Using Async Context Manager (Recommended)

import asyncio
from akool_streaming_avatar import AsyncStreamingAvatarClient

async def main():
    # Using async context manager for automatic connection handling
    async with AsyncStreamingAvatarClient(
        api_key="your-api-key",
        discovery_url="https://api.akool.com/streamingAvatar/service_status",
        avatar_id="your-avatar-id"
    ) as client:
        # Client is automatically connected
        
        # Set Agora parameters (required)
        await client.set_agora_params(
            agora_app_id="your-agora-app-id",
            agora_channel="your-channel",
            agora_token="your-rtc-token",
            agora_uid="12345"
        )
        
        # Send audio data
        audio_data = b"..."  # PCM 16-bit, 16kHz, mono
        await client.send_audio(audio_data)
        
    # Client is automatically disconnected when exiting the context

# Run the client
asyncio.run(main())

Environment Configuration

The SDK supports different environment configurations through the env parameter:

Test Environment (Default)

client = StreamingAvatarClient(
    api_key="your-api-key",
    discovery_url="https://api.akool.com/streamingAvatar/service_status",
    avatar_id="your-avatar-id",
    env="test"  # Default value
)
  • Service Discovery URL: /streamingAvatar/service_status?group_name=WEBSOCKET_GROUP_TEST
  • Use Case: Development, testing, and staging environments

Production Environment

client = StreamingAvatarClient(
    api_key="your-api-key",
    discovery_url="https://api.akool.com/streamingAvatar/service_status",
    avatar_id="your-avatar-id",
    env="prod"
)
  • Service Discovery URL: /streamingAvatar/service_status
  • Use Case: Production deployments

Environment Parameter Details

  • Default: "test" (for backward compatibility)
  • Valid Values: "test" or "prod"
  • Test Environment: Adds group_name=WEBSOCKET_GROUP_TEST parameter to service discovery
  • Production Environment: Uses standard service discovery without additional parameters

API Reference

StreamingAvatarClient

Main client class for connecting to Akool streaming avatar services.

Constructor

StreamingAvatarClient(
    api_key: str,
    discovery_url: str,
    avatar_id: str,
    session_id: Optional[str] = None,
    max_retry_attempts: int = 3,
    heartbeat_interval: int = 30,
    connection_timeout: int = 30,
    discovery_timeout: int = 10,
    env: str = "test"
)

Parameters:

  • api_key: Your Akool API key
  • discovery_url: Service discovery endpoint URL
  • avatar_id: Target avatar identifier
  • session_id: Session ID (will be generated if not provided)
  • max_retry_attempts: Maximum retry attempts (default: 3)
  • heartbeat_interval: Heartbeat interval in seconds (default: 30)
  • connection_timeout: Connection timeout in seconds (default: 30)
  • discovery_timeout: Service discovery timeout in seconds (default: 10)
  • env: Environment parameter ("test" or "prod"), defaults to "test"

Methods

connect()
async def connect() -> None

Establish connection to the streaming avatar service with automatic service discovery and retry logic.

Raises:

  • AuthenticationError: Invalid API key
  • ServiceDiscoveryError: No available services
  • ConnectionError: Connection failed
  • RetryError: All retry attempts exhausted
disconnect()
async def disconnect() -> None

Close the WebSocket connection gracefully.

set_agora_params()
async def set_agora_params(
    agora_app_id: str,
    agora_channel: str,
    agora_token: str,
    agora_uid: str,
    voice_id: Optional[str] = None,
    language: str = "en",
    background_url: Optional[str] = None
) -> None

Configure Agora RTC parameters (required before sending audio).

Parameters:

  • agora_app_id: Agora application ID
  • agora_channel: Agora channel name
  • agora_token: Agora RTC token
  • agora_uid: Agora user ID
  • voice_id: Voice ID for the avatar (optional)
  • language: Language code (default: "en")
  • background_url: Background image/video URL (optional)

Raises:

  • ConnectionError: Not connected to server
  • AudioStreamError: If setting parameters fails
send_audio()
async def send_audio(audio_data: bytes) -> None

Send PCM audio data to the avatar.

Parameters:

  • audio_data: PCM audio bytes (16-bit, 16kHz, mono)

Raises:

  • ConnectionError: Not connected to server
  • AudioStreamError: Audio transmission failed
send_audio_stream()
async def send_audio_stream(audio_chunks: List[bytes]) -> None

Send multiple audio chunks in sequence.

Parameters:

  • audio_chunks: List of PCM audio data chunks

Raises:

  • ConnectionError: Not connected to server
  • AudioStreamError: If sending audio fails
interrupt()
async def interrupt() -> None

Send interrupt command to stop current avatar response.

Raises:

  • ConnectionError: If not connected
Event Handlers

The client supports event-driven programming through callback properties:

# Set event handlers
client.on_connected = lambda: print("Connected!")
client.on_disconnected = lambda: print("Disconnected!")
client.on_agora_connected = lambda channel: print(f"Agora connected: {channel}")
client.on_error = lambda error: print(f"Error: {error}")
client.on_message = lambda message: print(f"Message: {message}")

Available event handlers:

  • on_connected: Called when WebSocket connection is established
  • on_disconnected: Called when connection is lost
  • on_agora_connected: Called when Agora connection is established (receives channel ID)
  • on_error: Called when errors occur (receives Exception)
  • on_message: Called when receiving messages from server (receives Dict)
get_connection_info()
def get_connection_info() -> Dict[str, Any]

Get information about the current connection.

Returns:

  • Dictionary with connection information including status, session ID, service details
Connection Status Properties

The client provides several properties to check connection status:

# Check WebSocket connection status
if client.is_connected:
    print("✅ WebSocket connected")

# Check Agora connection status  
if client.is_agora_connected:
    print("✅ Agora connected")

# Get detailed connection information
connection_info = client.get_connection_info()
print(f"📊 Connection details: {connection_info}")

Connection Status Properties:

  • client.is_connected: Boolean indicating if WebSocket is connected
  • client.is_agora_connected: Boolean indicating if Agora connection is established
  • client.session_id: Current session ID string

Monitoring Connection Status

Real-time Connection Monitoring

import asyncio
from akool_streaming_avatar import StreamingAvatarClient

async def monitor_connection():
    client = StreamingAvatarClient(
        api_key="your-api-key",
        discovery_url="https://api.akool.com/streamingAvatar/service_status", 
        avatar_id="your-avatar-id"
    )
    
    # Set up event handlers for real-time monitoring
    def on_connected():
        print("🔗 WebSocket connected!")
        
    def on_agora_connected(channel_id: str):
        print(f"🎥 Agora connected to channel: {channel_id}")
        
    def on_message_received(message):
        msg_type = message.get("type")
        print(f"📨 Received message: {msg_type}")
        
        # Handle different message types
        if msg_type == "system":
            handle_system_message(message)
        elif msg_type == "ack":
            handle_acknowledgment(message)
        elif msg_type == "error":
            handle_error_message(message)
            
    def on_error_occurred(error):
        print(f"❌ Error: {error}")
        
    def on_disconnected():
        print("🔌 Connection lost!")
    
    # Register event handlers
    client.on_connected = on_connected
    client.on_agora_connected = on_agora_connected
    client.on_message = on_message_received
    client.on_error = on_error_occurred
    client.on_disconnected = on_disconnected
    
    try:
        # Connect to service
        await client.connect()
        print(f"✅ Initial connection: {client.is_connected}")
        
        # Set Agora parameters and wait for connection
        await client.set_agora_params(
            agora_app_id="your-agora-app-id",
            agora_channel="your-channel", 
            agora_token="your-rtc-token",
            agora_uid="12345"
        )
        
        # Check status after Agora setup
        print(f"🎥 Agora connected: {client.is_agora_connected}")
        
        # Monitor connection for 30 seconds
        for i in range(6):
            await asyncio.sleep(5)
            
            # Check connection health
            info = client.get_connection_info()
            print(f"📊 Status check {i+1}: WebSocket={info['connected']}, "
                  f"Agora={info['agora_connected']}")
            
    finally:
        await client.disconnect()

def handle_system_message(message):
    """Handle system messages from the server."""
    pld = message.get("pld", {})
    status = pld.get("status")
    
    if status == "websocket_connected":
        print("✅ WebSocket connection confirmed by server")
    elif status == "agora_connected": 
        channel_id = pld.get("channel_id", "")
        print(f"✅ Agora connection confirmed: {channel_id}")
    else:
        print(f"ℹ️ System status: {status}")

def handle_acknowledgment(message):
    """Handle acknowledgment messages."""
    pld = message.get("pld", {})
    original_mid = pld.get("original_mid")
    print(f"✅ Command acknowledged: {original_mid}")

def handle_error_message(message):
    """Handle error messages from server."""
    pld = message.get("pld", {})
    error_code = pld.get("error_code")
    error_message = pld.get("error_message", "Unknown error")
    print(f"❌ Server error {error_code}: {error_message}")

asyncio.run(monitor_connection())

Polling Connection Status

async def check_connection_status(client):
    """Periodically check connection status."""
    while True:
        try:
            # Get current status
            info = client.get_connection_info()
            
            print(f"📊 Connection Status:")
            print(f"   WebSocket: {'✅' if info['connected'] else '❌'}")
            print(f"   Agora: {'✅' if info['agora_connected'] else '❌'}")
            print(f"   Session: {info['session_id']}")
            print(f"   Service: {info['current_service']}")
            
            # Wait before next check
            await asyncio.sleep(10)
            
        except Exception as e:
            print(f"❌ Status check failed: {e}")
            break

WebSocket Message Types

The SDK receives various message types from the server. Here are the common message types and their meanings:

System Messages

{
    "v": 2,
    "type": "system", 
    "mid": "system-123",
    "pld": {
        "status": "websocket_connected" | "agora_connected",
        "channel_id": "channel-name"  // Only for agora_connected
    }
}

Acknowledgment Messages

{
    "v": 2,
    "type": "ack",
    "mid": "ack-123", 
    "pld": {
        "original_mid": "cmd-set-params-456",
        "status": "success"
    }
}

Error Messages

{
    "v": 2,
    "type": "error",
    "mid": "error-123",
    "pld": {
        "error_code": 4001,
        "error_message": "Authentication failed"
    }
}

Pong Messages (Heartbeat Response)

{
    "v": 2,
    "type": "pong",
    "mid": "pong-123",
    "pld": {}
}

Advanced Usage

Manual Connection Management

client = StreamingAvatarClient(api_key, discovery_url, avatar_id)

try:
    await client.connect()
    await client.set_agora_params(
        agora_app_id=agora_app_id,
        agora_channel=channel,
        agora_token=rtc_token,
        agora_uid=user_id
    )
    
    # Your audio streaming logic here
    while streaming:
        audio_data = get_audio_data()
        await client.send_audio(audio_data)
        
finally:
    await client.disconnect()

Event Callbacks

def on_message_received(message):
    print(f"Received: {message}")

def on_error_occurred(error):
    print(f"Error: {error}")

def on_connection_lost():
    print("Connection lost")

# Set event handlers
client.on_message = on_message_received
client.on_error = on_error_occurred
client.on_disconnected = on_connection_lost

Custom Retry Configuration

client = StreamingAvatarClient(
    api_key="your-api-key",
    discovery_url="https://api.akool.com/streamingAvatar/service_status",
    avatar_id="your-avatar-id",
    max_retry_attempts=5,  # Try up to 5 times
    heartbeat_interval=15  # Ping every 15 seconds
)

Error Handling

The SDK provides specific exception types for different error scenarios:

from akool_streaming_avatar.exceptions import (
    AuthenticationError,
    ServiceDiscoveryError,
    ConnectionError,
    AudioStreamError,
    RetryError,
    ConfigurationError
)

try:
    await client.connect()
except AuthenticationError:
    print("Invalid API key")
except ServiceDiscoveryError:
    print("No available services")
except ConnectionError:
    print("Failed to connect")
except RetryError:
    print("All retry attempts failed")

Audio Format Requirements

The SDK expects audio data in the following format:

  • Format: PCM (uncompressed)
  • Sample Rate: 16,000 Hz
  • Bit Depth: 16-bit
  • Channels: Mono (1 channel)
  • Encoding: Little-endian

Converting Audio

import wave
import numpy as np

# Load and convert audio file
with wave.open('input.wav', 'rb') as wav_file:
    frames = wav_file.readframes(-1)
    sound_info = np.frombuffer(frames, dtype=np.int16)
    
    # Convert to mono if stereo
    if wav_file.getnchannels() == 2:
        sound_info = sound_info.reshape(-1, 2).mean(axis=1).astype(np.int16)
    
    # Resample to 16kHz if needed
    # (use librosa or scipy for resampling)
    
    # Send to avatar
    await client.send_audio(sound_info.tobytes())

Service Discovery

The SDK automatically discovers available streaming avatar services by calling the service status endpoint. The discovery process:

  1. Makes HTTP GET request to discovery_url
  2. Parses the response to extract available services
  3. Randomly selects a service for load balancing
  4. Attempts WebSocket connection
  5. Falls back to other services if connection fails

Expected Service Response Format

{
    "success": true,
    "data": {
        "services": [
            {
                "host": "avatar1.akool.com",
                "port": 8080,
                "status": "active"
            },
            {
                "host": "avatar2.akool.com", 
                "port": 8080,
                "status": "active"
            }
        ]
    }
}

WebSocket Protocol

The SDK uses WebSocket protocol version 2 with the following message format:

Authentication

{
    "type": "auth",
    "token": "your-api-key",
    "avatar_id": "your-avatar-id"
}

Agora Configuration

{
    "type": "agora_params",
    "data": {
        "channel": "channel-name",
        "userId": 12345,
        "rtcToken": "agora-rtc-token"
    }
}

Audio Data

{
    "type": "audio_data",
    "data": "base64-encoded-pcm-audio"
}

Heartbeat

{
    "type": "ping"
}

Development

Setting up Development Environment

git clone https://github.com/akoolteam/akool-streaming-avatar-sdk.git
cd akool-streaming-avatar-sdk
pip install -e ".[dev]"

Running Tests

pytest tests/

Code Formatting

black akool_streaming_avatar/
flake8 akool_streaming_avatar/

Examples

See the examples/ directory for more usage examples:

  • basic_usage.py - Simple connection and audio streaming
  • error_handling.py - Comprehensive error handling patterns

Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Add tests
  5. Run the test suite
  6. Submit a pull request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

For support, please contact support@akool.com or visit our documentation.

Changelog

v0.1.0 (2024-01-XX)

  • Initial release
  • Basic WebSocket connection and authentication
  • Service discovery and load balancing
  • Audio streaming support
  • Retry logic and error handling
  • Event callback system

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

akool_streaming_avatar-0.1.8.tar.gz (25.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

akool_streaming_avatar-0.1.8-py3-none-any.whl (18.9 kB view details)

Uploaded Python 3

File details

Details for the file akool_streaming_avatar-0.1.8.tar.gz.

File metadata

  • Download URL: akool_streaming_avatar-0.1.8.tar.gz
  • Upload date:
  • Size: 25.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for akool_streaming_avatar-0.1.8.tar.gz
Algorithm Hash digest
SHA256 fd3ea2d8e64aa771d0c03e76a762d7a01d5d631bf7786148232ea94ee0cacdda
MD5 dcfdcb9c5eacce2bde58870f67852f25
BLAKE2b-256 e33dcec331d0e47299ca314873c7a24a93386922538695aa5d96cc6499ca5715

See more details on using hashes here.

File details

Details for the file akool_streaming_avatar-0.1.8-py3-none-any.whl.

File metadata

File hashes

Hashes for akool_streaming_avatar-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 7786180ab76013fa3884c309fe8b0911172fc74e9f3cd3aed99ee2d431faf774
MD5 2cbf0ce7a1aacd3c6965fe7a04e9eef3
BLAKE2b-256 843b3252294501e540c43748b9657311d8ed22f885e5faeae2aaf95821e9e76a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page