Python SDK for avatar WebSocket services with audio streaming and animation frame reception

These details have not been verified by PyPI

Project description

Avatar SDK Python

A Python SDK for connecting to avatar services via WebSocket, supporting audio streaming and receiving animation frames.

Quick Start

import asyncio
from datetime import datetime, timedelta, timezone
from avatarkit import new_avatar_session

async def main():
    # Create session
    session = new_avatar_session(
        api_key="your-api-key",
        app_id="your-app-id",
        console_endpoint_url="https://console.us-west.spatialwalk.cloud/v1/console",
        ingress_endpoint_url="https://api.us-west.spatialwalk.cloud/v2/driveningress",
        avatar_id="your-avatar-id",
        expire_at=datetime.now(timezone.utc) + timedelta(minutes=5),
        transport_frames=lambda frame, last: print(f"Received frame: {len(frame)} bytes"),
        on_error=lambda err: print(f"Error: {err}"),
        on_close=lambda: print("Session closed")
    )

    # Initialize and connect
    await session.init()
    connection_id = await session.start()
    print(f"Connected: {connection_id}")

    # Send audio
    audio_data = b"..."  # Your PCM audio data
    request_id = await session.send_audio(audio_data, end=True)
    print(f"Sent audio: {request_id}")

    # Wait for frames...
    await asyncio.sleep(10)

    # Close
    await session.close()

if __name__ == "__main__":
    asyncio.run(main())

Detailed Usage

Session Configuration

The SDK provides two ways to configure a session:

Option 1: Using `new_avatar_session()` (Recommended)

from avatarkit import new_avatar_session

session = new_avatar_session(
    avatar_id="avatar-123",
    api_key="your-api-key",
    app_id="your-app-id",
    # For web-style auth, set use_query_auth=True to put (appId, sessionKey)
    # in the websocket URL query params instead of headers.
    use_query_auth=False,
    expire_at=datetime.now(timezone.utc) + timedelta(minutes=5),
    console_endpoint_url="https://console.us-west.spatialwalk.cloud/v1/console",
    ingress_endpoint_url="https://api.us-west.spatialwalk.cloud/v2/driveningress",
    sample_rate=16000,  # Default: 16000 Hz
    transport_frames=on_frame_received,
    on_error=on_error,
    on_close=on_close
)

Option 2: Using Configuration Builder

from avatarkit import SessionConfigBuilder, AvatarSession

config = (SessionConfigBuilder()
    .with_avatar_id("avatar-123")
    .with_api_key("your-api-key")
    .with_app_id("your-app-id")
    .with_console_endpoint_url("https://console.us-west.spatialwalk.cloud/v1/console")
    .with_ingress_endpoint_url("https://api.us-west.spatialwalk.cloud/v2/driveningress")
    .with_expire_at(datetime.now(timezone.utc) + timedelta(minutes=5))
    .with_transport_frames(on_frame_received)
    .build())

session = AvatarSession(config)

Session Lifecycle

# 1. Initialize (get session token)
await session.init()

# 2. Start WebSocket connection
connection_id = await session.start()

# 3. Send audio data
request_id = await session.send_audio(audio_bytes, end=True)

# 4. Receive frames via callback
# (automatically handled in background)

# 5. Close session
await session.close()

Audio Format

The SDK currently supports mono 16-bit PCM (s16le) audio:

Sample Rate: one of [8000, 16000, 22050, 24000, 32000, 44100, 48000]
Channels: 1 (mono)
Bit Depth: 16-bit
Format: Raw PCM bytes

# Example: Load PCM audio file
with open("audio.pcm", "rb") as f:
    audio_data = f.read()

# Send in chunks or all at once
await session.send_audio(audio_data, end=True)

LiveKit Egress Mode

When configured with livekit_egress, audio and animation data are streamed to a LiveKit room via the egress service instead of being returned through the WebSocket connection.

from avatarkit import new_avatar_session, LiveKitEgressConfig

session = new_avatar_session(
    avatar_id="avatar-123",
    api_key="your-api-key",
    app_id="your-app-id",
    console_endpoint_url="https://console.example.com/v1/console",
    ingress_endpoint_url="https://api.example.com/v2/driveningress",
    expire_at=datetime.now(timezone.utc) + timedelta(minutes=5),
    livekit_egress=LiveKitEgressConfig(
        url="wss://livekit.example.com",
        api_key="livekit-api-key",
        api_secret="livekit-api-secret",
        room_name="my-room",
        publisher_id="avatar-publisher",
    ),
)

When LiveKit egress is enabled:

The server streams output to the specified LiveKit room
The transport_frames callback will not be invoked
Audio and animation data are published to the room under the specified publisher ID

Interrupt (LiveKit Egress Only)

The interrupt() method sends an interrupt signal to stop current audio processing. This is only available when using LiveKit egress mode.

# Send audio
request_id = await session.send_audio(audio_data, end=True)

# Later, if you need to interrupt (e.g., user wants to stop playback)
interrupted_id = await session.interrupt()
print(f"Interrupted request: {interrupted_id}")

The interrupt uses the most recent request ID, even after end=True was sent. This allows interrupting requests that have finished sending audio but are still being processed by the server.

Callbacks

Transport Frames Callback

Receives animation frames from the server:

def on_frame_received(frame_data: bytes, is_last: bool):
    print(f"Received frame: {len(frame_data)} bytes")
    if is_last:
        print("This is the last frame")
    # Process frame_data (contains serialized Message protobuf)

Error Callback

Handles errors from the session:

def on_error(error: Exception):
    print(f"Session error: {error}")

Close Callback

Called when the session closes:

def on_close():
    print("Session has been closed")

API Reference

AvatarSession

Main class for managing avatar sessions.

Methods

async init() - Initialize session and obtain token
async start() -> str - Start WebSocket connection, returns connection ID
async send_audio(audio: bytes, end: bool = False) -> str - Send audio data, returns request ID
async interrupt() -> str - Interrupt current audio processing (LiveKit egress mode only), returns interrupted request ID
async close() - Close the session and clean up resources
config -> SessionConfig - Get session configuration (property)

SessionConfig

Configuration dataclass for avatar sessions.

Fields

avatar_id: str - Avatar identifier
api_key: str - API key for authentication
app_id: str - Application identifier
use_query_auth: bool - Send websocket auth via query params (web) instead of headers (mobile)
expire_at: datetime - Session expiration time
sample_rate: int - Audio sample rate (default: 16000)
bitrate: int - Audio bitrate (default: 0; PCM typically uses 0)
transport_frames: Callable[[bytes, bool], None] - Frame callback
on_error: Callable[[Exception], None] - Error callback
on_close: Callable[[], None] - Close callback
console_endpoint_url: str - Console API URL
ingress_endpoint_url: str - Ingress WebSocket URL
livekit_egress: Optional[LiveKitEgressConfig] - LiveKit egress configuration

LiveKitEgressConfig

Configuration for streaming to a LiveKit room.

Fields

url: str - LiveKit server URL (e.g., wss://livekit.example.com)
api_key: str - LiveKit API key
api_secret: str - LiveKit API secret
room_name: str - LiveKit room name to join
publisher_id: str - Publisher identity in the room

SessionConfigBuilder

Builder for constructing SessionConfig with fluent interface.

Methods

All methods return self for chaining:

with_avatar_id(avatar_id: str)
with_api_key(api_key: str)
with_app_id(app_id: str)
with_use_query_auth(use_query_auth: bool)
with_expire_at(expire_at: datetime)
with_sample_rate(sample_rate: int)
with_bitrate(bitrate: int)
with_transport_frames(handler: Callable)
with_on_error(handler: Callable)
with_on_close(handler: Callable)
with_console_endpoint_url(url: str)
with_ingress_endpoint_url(url: str)
with_livekit_egress(config: LiveKitEgressConfig)
build() -> SessionConfig - Build the configuration

Utility Functions

generate_log_id() -> str - Generate unique log ID in format "YYYYMMDDHHMMSS_<nanoid>"

Exceptions

SessionTokenError - Raised when session token request fails

Examples

See the examples directory for complete working examples:

single_audio_clip - Basic usage with a single audio file
http_service - Simple HTTP API that returns PCM audio (by sample rate) and generated animation Message binaries

Protocol Buffers

The SDK uses Protocol Buffers for efficient serialization. The proto definitions are in proto/message.proto.

Generating Proto Code

Proto code is generated using buf:

cd proto
buf generate

The generated Python code is placed in src/avatarkit/proto/generated/.

Message Types

MESSAGE_CLIENT_CONFIGURE_SESSION (1) - Client session negotiation parameters
MESSAGE_SERVER_CONFIRM_SESSION (2) - Server confirms and returns connection_id
MESSAGE_CLIENT_AUDIO_INPUT (3) - Client audio input
MESSAGE_SERVER_ERROR (4) - Server-side error message
MESSAGE_SERVER_RESPONSE_ANIMATION (5) - Server animation response (end indicates final)
MESSAGE_CLIENT_INTERRUPT (7) - Client interrupt signal to stop processing

Development

Setup

# Install uv if not already installed
curl -LsSf https://astral.sh/uv/install.sh | sh

# Clone and setup
git clone <repository-url>
cd avatar-sdk-python
uv sync

License

See LICENSE for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.7

Mar 22, 2026

0.1.6

Mar 19, 2026

0.1.5

Mar 7, 2026

0.1.4

Feb 24, 2026

This version

0.1.3

Jan 31, 2026

0.1.2

Jan 28, 2026

0.1.1

Jan 19, 2026

0.1.0

Jan 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

avatarkit-0.1.3.tar.gz (13.0 kB view details)

Uploaded Jan 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

avatarkit-0.1.3-py3-none-any.whl (16.2 kB view details)

Uploaded Jan 31, 2026 Python 3

File details

Details for the file avatarkit-0.1.3.tar.gz.

File metadata

Download URL: avatarkit-0.1.3.tar.gz
Upload date: Jan 31, 2026
Size: 13.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for avatarkit-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`5cac2457e66d0d69486c7b43e0d8c4b16b3c29c758d3a18f05e75e012a73f5de`
MD5	`c0d9f9f5ee55d5b57249f88ba56385b9`
BLAKE2b-256	`3bf8bbd50cc7e145e160cd852c9e1341f00a547740e989c9668f54692107cd95`

See more details on using hashes here.

File details

Details for the file avatarkit-0.1.3-py3-none-any.whl.

File metadata

Download URL: avatarkit-0.1.3-py3-none-any.whl
Upload date: Jan 31, 2026
Size: 16.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for avatarkit-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6de6c2803f5b2e0216c33d34306723c6176d9986b420becdc316b7acc633373e`
MD5	`b17c7c53fcb343ffa78e46dc2b36707f`
BLAKE2b-256	`1c4d99ace08190e28c0369ca31bbc6e7e1e330307a2f24a5dfdf452a54871133`

See more details on using hashes here.

avatarkit 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Avatar SDK Python

Quick Start

Detailed Usage

Session Configuration

Option 1: Using new_avatar_session() (Recommended)

Option 2: Using Configuration Builder

Session Lifecycle

Audio Format

LiveKit Egress Mode

Interrupt (LiveKit Egress Only)

Callbacks

Transport Frames Callback

Error Callback

Close Callback

API Reference

AvatarSession

Methods

SessionConfig

Fields

LiveKitEgressConfig

Fields

SessionConfigBuilder

Methods

Utility Functions

Exceptions

Examples

Protocol Buffers

Generating Proto Code

Message Types

Development

Setup

License

Contributing

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Option 1: Using `new_avatar_session()` (Recommended)