Python SDK for Sayna server-side WebSocket connections

These details have not been verified by PyPI

Project links

Project description

Sayna Python SDK

Python SDK for Sayna's real-time voice interaction API. Send audio for speech recognition, receive synthesized speech, and manage voice sessions from your Python applications.

Features

🎤 Speech-to-Text: Real-time transcription with support for multiple providers (Deepgram, Google, etc.)
🔊 Text-to-Speech: High-quality voice synthesis with various TTS providers (ElevenLabs, Google, etc.)
🔌 WebSocket Connection: Async/await support with aiohttp
🌐 REST API: Standalone endpoints for health checks, voice catalog, and TTS synthesis
✅ Type Safety: Full type hints with Pydantic models
🚀 Easy to Use: Simple, intuitive API
📦 Modern Python: Built for Python 3.9+

Installation

pip install sayna-client

Quick Start

import asyncio
from sayna_client import SaynaClient, STTConfig, TTSConfig

async def main():
    # Initialize the client with configs
    client = SaynaClient(
        url="https://api.sayna.ai",
        stt_config=STTConfig(
            provider="deepgram",
            model="nova-2"
        ),
        tts_config=TTSConfig(
            provider="cartesia",
            voice_id="example-voice"
        ),
        api_key="your-api-key"
    )

    # Register callbacks
    client.register_on_stt_result(lambda result: print(f"Transcription: {result.transcript}"))
    client.register_on_tts_audio(lambda audio: print(f"Received {len(audio)} bytes of audio"))

    # Connect and interact
    await client.connect()
    await client.speak("Hello, world!")
    await client.disconnect()

if __name__ == "__main__":
    asyncio.run(main())

API

REST API Methods

These methods use HTTP endpoints and don't require an active WebSocket connection:

`await client.health()`

Performs a health check on the Sayna server.

Returns: HealthResponse - Response object with status field ("OK" when healthy).

Example:

health = await client.health()
print(health.status)  # "OK"

`await client.get_voices()`

Retrieves the catalogue of text-to-speech voices grouped by provider.

Returns: dict[str, list[VoiceDescriptor]] - Dictionary where keys are provider names and values are lists of voice descriptors.

Example:

voices = await client.get_voices()
for provider, voice_list in voices.items():
    print(f"{provider}:", [v.name for v in voice_list])

`await client.speak_rest(text, tts_config)`

Synthesizes text into audio using the REST API. This is a standalone method that doesn't require an active WebSocket connection.

Parameter	Type	Purpose
`text`	`str`	Text to synthesize (must be non-empty).
`tts_config`	`TTSConfig`	Text-to-speech provider configuration.

Returns: tuple[bytes, dict[str, str]] - Tuple of (audio_data, response_headers). Headers include: Content-Type, Content-Length, x-audio-format, x-sample-rate.

Example:

audio_data, headers = await client.speak_rest("Hello, world!", TTSConfig(
    provider="elevenlabs",
    voice_id="21m00Tcm4TlvDq8ikWAM",
    model="eleven_turbo_v2",
    speaking_rate=1.0,
    audio_format="mp3",
    sample_rate=24000,
    connection_timeout=30,
    request_timeout=60,
    pronunciations=[]
))
print(f"Received {len(audio_data)} bytes of {headers['Content-Type']}")

`await client.get_livekit_token(room_name, participant_name, participant_identity)`

Issues a LiveKit access token for a participant.

Parameter	Type	Purpose
`room_name`	`str`	LiveKit room to join or create.
`participant_name`	`str`	Display name for the participant.
`participant_identity`	`str`	Unique identifier for the participant.

Returns: LiveKitTokenResponse - Object containing token, room name, participant identity, and LiveKit URL.

Example:

token_info = await client.get_livekit_token(
    "my-room",
    "John Doe",
    "user-123"
)
print("Token:", token_info.token)
print("LiveKit URL:", token_info.livekit_url)

WebSocket API Methods

These methods require an active WebSocket connection:

`SaynaClient(url, stt_config, tts_config, livekit_config=None, without_audio=False, api_key=None)`

Creates a new SaynaClient instance.

Parameter	Type	Default	Purpose
`url`	`str`	-	Sayna server URL (http://, https://, ws://, or wss://).
`stt_config`	`STTConfig`	-	Speech-to-text provider configuration.
`tts_config`	`TTSConfig`	-	Text-to-speech provider configuration.
`livekit_config`	`LiveKitConfig` (optional)	`None`	Optional LiveKit room configuration.
`without_audio`	`bool`	`False`	Disable audio streaming.
`api_key`	`str` (optional)	`None`	API key for authentication.

`await client.connect()`

Establishes WebSocket connection and sends initial configuration. Resolves when server sends ready message.

`client.register_on_stt_result(callback)`

Registers a callback for speech-to-text transcription results.

Example:

def handle_stt(result):
    print(f"Transcription: {result.transcript}")

client.register_on_stt_result(handle_stt)

`client.register_on_tts_audio(callback)`

Registers a callback for text-to-speech audio data.

Example:

def handle_audio(audio_data: bytes):
    print(f"Received {len(audio_data)} bytes of audio")

client.register_on_tts_audio(handle_audio)

`client.register_on_error(callback)`

Registers a callback for error messages.

`client.register_on_message(callback)`

Registers a callback for participant messages.

`client.register_on_participant_disconnected(callback)`

Registers a callback for participant disconnection events.

`client.register_on_tts_playback_complete(callback)`

Registers a callback for TTS playback completion events.

`await client.speak(text, flush=True, allow_interruption=True)`

Sends text to be synthesized as speech.

Parameter	Type	Default	Purpose
`text`	`str`	-	Text to synthesize.
`flush`	`bool`	`True`	Clear TTS queue before speaking.
`allow_interruption`	`bool`	`True`	Allow speech to be interrupted.

Example:

await client.speak("Hello, world!")
await client.speak("Important message", flush=True, allow_interruption=False)

`await client.on_audio_input(audio_data)`

Sends raw audio data (bytes) to the server for speech recognition.

Example:

await client.on_audio_input(audio_bytes)

`await client.send_message(message, role, topic="messages", debug=None)`

Sends a message to the Sayna session with role and optional metadata.

Parameter	Type	Default	Purpose
`message`	`str`	-	Message content.
`role`	`str`	-	Sender role (e.g., 'user', 'assistant').
`topic`	`str`	`"messages"`	LiveKit topic/channel.
`debug`	`dict` (optional)	`None`	Optional debug metadata.

`await client.clear()`

Clears the text-to-speech queue.

`await client.tts_flush(allow_interruption=True)`

Flushes the TTS queue by sending an empty speak command.

`await client.disconnect()`

Disconnects from the WebSocket server and cleans up resources.

Properties

client.ready: Boolean indicating whether the connection is ready (received ready message).
client.connected: Boolean indicating whether the WebSocket is connected.
client.livekit_room_name: LiveKit room name (available after ready when LiveKit is enabled).
client.livekit_url: LiveKit URL (available after ready).
client.sayna_participant_identity: Sayna participant identity (available after ready when LiveKit is enabled).
client.sayna_participant_name: Sayna participant name (available after ready when LiveKit is enabled).

Development

Setup

# Create a virtual environment
python -m venv .venv

# Activate it
source .venv/bin/activate  # On Linux/macOS
# .venv\Scripts\activate  # On Windows

# Install development dependencies
pip install -e ".[dev]"

Tip: For faster dependency installation, you can use uv:
pip install uv
uv pip install -e ".[dev]"

Running Tests

# Run all tests
pytest

# Run with coverage
pytest --cov=sayna_client --cov-report=html

# Run specific test file
pytest tests/test_client.py

Type Checking

mypy src/sayna_client

Linting and Formatting

This project uses Ruff for linting and formatting:

# Check code
ruff check .

# Format code
ruff format .

# Fix auto-fixable issues
ruff check --fix .

Requirements

Python 3.9 or higher
aiohttp >= 3.9.0
pydantic >= 2.0.0

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Support

For issues and questions, please visit the GitHub Issues page.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.0.17

Mar 30, 2026

0.0.16

Mar 24, 2026

0.0.15

Mar 6, 2026

0.0.14

Feb 27, 2026

0.0.13

Jan 21, 2026

0.0.12

Jan 7, 2026

0.0.11

Dec 31, 2025

0.0.10

Dec 6, 2025

0.0.9

Nov 30, 2025

0.0.8

Nov 27, 2025

0.0.7

Nov 26, 2025

0.0.6

Nov 23, 2025

This version

0.0.5

Nov 13, 2025

0.0.4

Oct 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sayna_client-0.0.5.tar.gz (20.8 kB view details)

Uploaded Nov 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sayna_client-0.0.5-py3-none-any.whl (16.4 kB view details)

Uploaded Nov 13, 2025 Python 3

File details

Details for the file sayna_client-0.0.5.tar.gz.

File metadata

Download URL: sayna_client-0.0.5.tar.gz
Upload date: Nov 13, 2025
Size: 20.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for sayna_client-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`0e3998455823a8150e70bed276d379bb9920fd9f6d552c4944834ab5b54dbed4`
MD5	`c195741abe9da836a357ae2ea854110f`
BLAKE2b-256	`ec28f97db222e8e72ae9d03a6457b6fcd4efac98e5420139c5d7b585d19bd476`

See more details on using hashes here.

File details

Details for the file sayna_client-0.0.5-py3-none-any.whl.

File metadata

Download URL: sayna_client-0.0.5-py3-none-any.whl
Upload date: Nov 13, 2025
Size: 16.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for sayna_client-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b599ff28d354a79cc7257eec733a392ccfefbf69fba982abe6be95d040a8f852`
MD5	`58c296a8dc852c4b75b362613551bb10`
BLAKE2b-256	`64e5120726c9e29e96b91cdd232efba7945eae48969db4daf55d40f24e3d9773`

See more details on using hashes here.

sayna-client 0.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Sayna Python SDK

Features

Installation

Quick Start

API

REST API Methods

await client.health()

await client.get_voices()

await client.speak_rest(text, tts_config)

await client.get_livekit_token(room_name, participant_name, participant_identity)

WebSocket API Methods

SaynaClient(url, stt_config, tts_config, livekit_config=None, without_audio=False, api_key=None)

await client.connect()

client.register_on_stt_result(callback)

client.register_on_tts_audio(callback)

client.register_on_error(callback)

client.register_on_message(callback)

client.register_on_participant_disconnected(callback)

client.register_on_tts_playback_complete(callback)

await client.speak(text, flush=True, allow_interruption=True)

await client.on_audio_input(audio_data)

await client.send_message(message, role, topic="messages", debug=None)

await client.clear()

await client.tts_flush(allow_interruption=True)

await client.disconnect()

Properties

Development

Setup

Running Tests

Type Checking

Linting and Formatting

Requirements

Contributing

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`await client.health()`

`await client.get_voices()`

`await client.speak_rest(text, tts_config)`

`await client.get_livekit_token(room_name, participant_name, participant_identity)`

`SaynaClient(url, stt_config, tts_config, livekit_config=None, without_audio=False, api_key=None)`

`await client.connect()`

`client.register_on_stt_result(callback)`

`client.register_on_tts_audio(callback)`

`client.register_on_error(callback)`

`client.register_on_message(callback)`

`client.register_on_participant_disconnected(callback)`

`client.register_on_tts_playback_complete(callback)`

`await client.speak(text, flush=True, allow_interruption=True)`

`await client.on_audio_input(audio_data)`

`await client.send_message(message, role, topic="messages", debug=None)`

`await client.clear()`

`await client.tts_flush(allow_interruption=True)`

`await client.disconnect()`