Skip to main content

Python SDK for avatar WebSocket services with audio streaming and animation frame reception

Project description

Avatar SDK Python

A Python SDK for connecting to avatar services via WebSocket, supporting audio streaming and receiving animation frames.

Quick Start

import asyncio
from datetime import datetime, timedelta, timezone
from avatarkit import new_avatar_session

async def main():
    # Create session
    session = new_avatar_session(
        api_key="your-api-key",
        app_id="your-app-id",
        console_endpoint_url="https://console.us-west.spatialwalk.cloud/v1/console",
        ingress_endpoint_url="https://api.us-west.spatialwalk.cloud/v2/driveningress",
        avatar_id="your-avatar-id",
        expire_at=datetime.now(timezone.utc) + timedelta(minutes=5),
        transport_frames=lambda frame, last: print(f"Received frame: {len(frame)} bytes"),
        on_error=lambda err: print(f"Error: {err}"),
        on_close=lambda: print("Session closed")
    )
    
    # Initialize and connect
    await session.init()
    connection_id = await session.start()
    print(f"Connected: {connection_id}")
    
    # Send audio
    audio_data = b"..."  # Your PCM audio data
    request_id = await session.send_audio(audio_data, end=True)
    print(f"Sent audio: {request_id}")
    
    # Wait for frames...
    await asyncio.sleep(10)
    
    # Close
    await session.close()

if __name__ == "__main__":
    asyncio.run(main())

Detailed Usage

Session Configuration

The SDK provides two ways to configure a session:

Option 1: Using new_avatar_session() (Recommended)

from avatarkit import new_avatar_session

session = new_avatar_session(
    avatar_id="avatar-123",
    api_key="your-api-key",
    app_id="your-app-id",
    # For web-style auth, set use_query_auth=True to put (appId, sessionKey)
    # in the websocket URL query params instead of headers.
    use_query_auth=False,
    expire_at=datetime.now(timezone.utc) + timedelta(minutes=5),
    console_endpoint_url="https://console.us-west.spatialwalk.cloud/v1/console",
    ingress_endpoint_url="https://api.us-west.spatialwalk.cloud/v2/driveningress",
    sample_rate=16000,  # Default: 16000 Hz
    transport_frames=on_frame_received,
    on_error=on_error,
    on_close=on_close
)

Option 2: Using Configuration Builder

from avatarkit import SessionConfigBuilder, AvatarSession

config = (SessionConfigBuilder()
    .with_avatar_id("avatar-123")
    .with_api_key("your-api-key")
    .with_app_id("your-app-id")
    .with_console_endpoint_url("https://console.us-west.spatialwalk.cloud/v1/console")
    .with_ingress_endpoint_url("https://api.us-west.spatialwalk.cloud/v2/driveningress")
    .with_expire_at(datetime.now(timezone.utc) + timedelta(minutes=5))
    .with_transport_frames(on_frame_received)
    .build())

session = AvatarSession(config)

Session Lifecycle

# 1. Initialize (get session token)
await session.init()

# 2. Start WebSocket connection
connection_id = await session.start()

# 3. Send audio data
request_id = await session.send_audio(audio_bytes, end=True)

# 4. Receive frames via callback
# (automatically handled in background)

# 5. Close session
await session.close()

Audio Format

The SDK currently supports mono 16-bit PCM (s16le) audio:

  • Sample Rate: one of [8000, 16000, 22050, 24000, 32000, 44100, 48000]
  • Channels: 1 (mono)
  • Bit Depth: 16-bit
  • Format: Raw PCM bytes
# Example: Load PCM audio file
with open("audio.pcm", "rb") as f:
    audio_data = f.read()

# Send in chunks or all at once
await session.send_audio(audio_data, end=True)

Callbacks

Transport Frames Callback

Receives animation frames from the server:

def on_frame_received(frame_data: bytes, is_last: bool):
    print(f"Received frame: {len(frame_data)} bytes")
    if is_last:
        print("This is the last frame")
    # Process frame_data (contains serialized Message protobuf)

Error Callback

Handles errors from the session:

def on_error(error: Exception):
    print(f"Session error: {error}")

Close Callback

Called when the session closes:

def on_close():
    print("Session has been closed")

API Reference

AvatarSession

Main class for managing avatar sessions.

Methods

  • async init() - Initialize session and obtain token
  • async start() -> str - Start WebSocket connection, returns connection ID
  • async send_audio(audio: bytes, end: bool = False) -> str - Send audio data, returns request ID
  • async close() - Close the session and clean up resources
  • config -> SessionConfig - Get session configuration (property)

SessionConfig

Configuration dataclass for avatar sessions.

Fields

  • avatar_id: str - Avatar identifier
  • api_key: str - API key for authentication
  • app_id: str - Application identifier
  • use_query_auth: bool - Send websocket auth via query params (web) instead of headers (mobile)
  • expire_at: datetime - Session expiration time
  • sample_rate: int - Audio sample rate (default: 16000)
  • bitrate: int - Audio bitrate (default: 0; PCM typically uses 0)
  • transport_frames: Callable[[bytes, bool], None] - Frame callback
  • on_error: Callable[[Exception], None] - Error callback
  • on_close: Callable[[], None] - Close callback
  • console_endpoint_url: str - Console API URL
  • ingress_endpoint_url: str - Ingress WebSocket URL

SessionConfigBuilder

Builder for constructing SessionConfig with fluent interface.

Methods

All methods return self for chaining:

  • with_avatar_id(avatar_id: str)
  • with_api_key(api_key: str)
  • with_app_id(app_id: str)
  • with_expire_at(expire_at: datetime)
  • with_sample_rate(sample_rate: int)
  • with_transport_frames(handler: Callable)
  • with_on_error(handler: Callable)
  • with_on_close(handler: Callable)
  • with_console_endpoint_url(url: str)
  • with_ingress_endpoint_url(url: str)
  • build() -> SessionConfig - Build the configuration

Utility Functions

  • generate_log_id() -> str - Generate unique log ID in format "YYYYMMDDHHMMSS_<nanoid>"

Exceptions

  • SessionTokenError - Raised when session token request fails

Examples

See the examples directory for complete working examples:

  • single_audio_clip - Basic usage with a single audio file
  • http_service - Simple HTTP API that returns PCM audio (by sample rate) and generated animation Message binaries

Protocol Buffers

The SDK uses Protocol Buffers for efficient serialization. The proto definitions are in proto/message.proto.

Generating Proto Code

Proto code is generated using buf:

cd proto
buf generate

The generated Python code is placed in src/avatarkit/proto/generated/.

Message Types

  • MESSAGE_CLIENT_CONFIGURE_SESSION (1) - Client session negotiation parameters
  • MESSAGE_SERVER_CONFIRM_SESSION (2) - Server confirms and returns connection_id
  • MESSAGE_CLIENT_AUDIO_INPUT (3) - Client audio input
  • MESSAGE_SERVER_ERROR (4) - Server-side error message
  • MESSAGE_SERVER_RESPONSE_ANIMATION (5) - Server animation response (end indicates final)

Development

Setup

# Install uv if not already installed
curl -LsSf https://astral.sh/uv/install.sh | sh

# Clone and setup
git clone <repository-url>
cd avatar-sdk-python
uv sync

License

See LICENSE for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

avatarkit-0.1.1.tar.gz (11.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

avatarkit-0.1.1-py3-none-any.whl (14.2 kB view details)

Uploaded Python 3

File details

Details for the file avatarkit-0.1.1.tar.gz.

File metadata

  • Download URL: avatarkit-0.1.1.tar.gz
  • Upload date:
  • Size: 11.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for avatarkit-0.1.1.tar.gz
Algorithm Hash digest
SHA256 76ef46132bea25e842cec679e00e8ffeb95d6d11a6ab2d40018aecdf7f9efc8b
MD5 ead8d3ecb630c35e6b60ab44d7cf7797
BLAKE2b-256 4719abcbc7a54ccde0d5e8555077602c727394d5bcbe90b61c7651a197108598

See more details on using hashes here.

File details

Details for the file avatarkit-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: avatarkit-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 14.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for avatarkit-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 97480df364f7c4508111c2f728e8306d5d4b8cd9a155c68af116eae8a8f541b1
MD5 0a2d962007bf443ec3e4acf5586e8545
BLAKE2b-256 8b3d843ce91c652d1e8b160bd207eaf80ecf6a6c4b08acee49a9f4c75c1e87f6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page