Python client for the LiveLLM Server

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

XvKuoMing

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Typing
- Typed

Project description

LiveLLM Python Client

Python client library for the LiveLLM Server - a unified proxy for AI agent, audio, and transcription services.

Features

🚀 Async-first - Built on httpx for high-performance operations
🔒 Type-safe - Full type hints and Pydantic validation
🎯 Multi-provider - OpenAI, Google, Anthropic, Groq, ElevenLabs
🔄 Streaming - Real-time streaming for agent and audio
🛠️ Flexible API - Use request objects or keyword arguments
🎙️ Audio services - Text-to-speech and transcription
⚡ Fallback strategies - Sequential and parallel handling
🧹 Auto cleanup - Context managers and garbage collection

Installation

pip install livellm

Or with development dependencies:

pip install livellm[testing]

Quick Start

import asyncio
from livellm import LivellmClient
from livellm.models import Settings, ProviderKind, TextMessage, MessageRole

async def main():
    # Initialize with automatic provider setup
    async with LivellmClient(
        base_url="http://localhost:8000",
        configs=[
            Settings(
                uid="openai",
                provider=ProviderKind.OPENAI,
                api_key="your-api-key"
            )
        ]
    ) as client:
        # Simple keyword arguments style (gen_config as kwargs)
        response = await client.agent_run(
            provider_uid="openai",
            model="gpt-4",
            messages=[TextMessage(role="user", content="Hello!")],
            temperature=0.7
        )
        print(response.output)

asyncio.run(main())

Configuration

Client Initialization

from livellm import LivellmClient
from livellm.models import Settings, ProviderKind

# Basic
client = LivellmClient(base_url="http://localhost:8000")

# With timeout and pre-configured providers
client = LivellmClient(
    base_url="http://localhost:8000",
    timeout=30.0,
    configs=[
        Settings(
            uid="openai",
            provider=ProviderKind.OPENAI,
            api_key="sk-...",
            base_url="https://api.openai.com/v1"  # Optional
        ),
        Settings(
            uid="anthropic",
            provider=ProviderKind.ANTHROPIC,
            api_key="sk-ant-...",
            blacklist_models=["claude-instant-1"]  # Optional
        )
    ]
)

Supported Providers

OPENAI • GOOGLE • ANTHROPIC • GROQ • ELEVENLABS

# Add provider dynamically
await client.update_config(Settings(
    uid="my-provider",
    provider=ProviderKind.OPENAI,
    api_key="your-api-key"
))

# List and delete
configs = await client.get_configs()
await client.delete_config("my-provider")

Usage Examples

Agent Services

Two Ways to Call Methods

All methods support two calling styles:

Style 1: Keyword arguments (kwargs become gen_config)

response = await client.agent_run(
    provider_uid="openai",
    model="gpt-4",
    messages=[TextMessage(role="user", content="Hello!")],
    temperature=0.7,
    max_tokens=500
)

Style 2: Request objects

from livellm.models import AgentRequest

response = await client.agent_run(
    AgentRequest(
        provider_uid="openai",
        model="gpt-4",
        messages=[TextMessage(role="user", content="Hello!")],
        gen_config={"temperature": 0.7, "max_tokens": 500}
    )
)

Basic Agent Run

from livellm.models import TextMessage

# Using kwargs (recommended for simplicity)
response = await client.agent_run(
    provider_uid="openai",
    model="gpt-4",
    messages=[
        TextMessage(role="system", content="You are helpful."),
        TextMessage(role="user", content="Explain quantum computing")
    ],
    temperature=0.7,
    max_tokens=500
)
print(f"Output: {response.output}")
print(f"Tokens: {response.usage.input_tokens} in, {response.usage.output_tokens} out")

Streaming Agent Response

# Streaming also supports both styles
stream = client.agent_run_stream(
    provider_uid="openai",
    model="gpt-4",
    messages=[TextMessage(role="user", content="Tell me a story")],
    temperature=0.8
)

async for chunk in stream:
    print(chunk.output, end="", flush=True)

Agent with Vision (Binary Messages)

import base64
from livellm.models import BinaryMessage

with open("image.jpg", "rb") as f:
    image_data = base64.b64encode(f.read()).decode("utf-8")

response = await client.agent_run(
    provider_uid="openai",
    model="gpt-4-vision",
    messages=[
        BinaryMessage(
            role="user",
            content=image_data,
            mime_type="image/jpeg",
            caption="What's in this image?"
        )
    ]
)

Agent with Tools

from livellm.models import WebSearchInput, MCPStreamableServerInput, ToolKind

# Web search tool
response = await client.agent_run(
    provider_uid="openai",
    model="gpt-4",
    messages=[TextMessage(role="user", content="Latest AI news?")],
    tools=[WebSearchInput(
        kind=ToolKind.WEB_SEARCH,
        search_context_size="high"  # low, medium, or high
    )]
)

# MCP server tool
response = await client.agent_run(
    provider_uid="openai",
    model="gpt-4",
    messages=[TextMessage(role="user", content="Run custom tool")],
    tools=[MCPStreamableServerInput(
        kind=ToolKind.MCP_STREAMABLE_SERVER,
        url="http://mcp-server:8080",
        prefix="mcp_",
        timeout=15
    )]
)

Audio Services

Text-to-Speech

from livellm.models import SpeakMimeType

# Non-streaming
audio = await client.speak(
    provider_uid="openai",
    model="tts-1",
    text="Hello, world!",
    voice="alloy",
    mime_type=SpeakMimeType.MP3,
    sample_rate=24000,
    speed=1.0  # kwargs become gen_config
)
with open("output.mp3", "wb") as f:
    f.write(audio)

# Streaming
audio = bytes()
async for chunk in client.speak_stream(
    provider_uid="openai",
    model="tts-1",
    text="Hello, world!",
    voice="alloy",
    mime_type=SpeakMimeType.PCM,
    sample_rate=24000
):
    audio += chunk

# Save PCM as WAV
import wave
with wave.open("output.wav", "wb") as wf:
    wf.setnchannels(1)
    wf.setsampwidth(2)
    wf.setframerate(24000)
    wf.writeframes(audio)

Transcription

# Method 1: Multipart upload (kwargs style)
with open("audio.wav", "rb") as f:
    audio_bytes = f.read()

transcription = await client.transcribe(
    provider_uid="openai",
    file=("audio.wav", audio_bytes, "audio/wav"),
    model="whisper-1",
    language="en",  # Optional
    temperature=0.0  # kwargs become gen_config
)
print(f"Text: {transcription.text}")
print(f"Language: {transcription.language}")

# Method 2: JSON request object (base64-encoded)
import base64
from livellm.models import TranscribeRequest

audio_b64 = base64.b64encode(audio_bytes).decode("utf-8")
transcription = await client.transcribe(
    TranscribeRequest(
        provider_uid="openai",
        file=("audio.wav", audio_b64, "audio/wav"),
        model="whisper-1"
    )
)

Fallback Strategies

Handle failures automatically with sequential or parallel fallback:

from livellm.models import AgentRequest, AgentFallbackRequest, FallbackStrategy, TextMessage

messages = [TextMessage(role="user", content="Hello!")]

# Sequential: try each in order until one succeeds
response = await client.agent_run(
    AgentFallbackRequest(
        strategy=FallbackStrategy.SEQUENTIAL,
        requests=[
            AgentRequest(provider_uid="primary", model="gpt-4", messages=messages, tools=[]),
            AgentRequest(provider_uid="backup", model="claude-3", messages=messages, tools=[])
        ],
        timeout_per_request=30
    )
)

# Parallel: try all simultaneously, use first success
response = await client.agent_run(
    AgentFallbackRequest(
        strategy=FallbackStrategy.PARALLEL,
        requests=[
            AgentRequest(provider_uid="p1", model="gpt-4", messages=messages, tools=[]),
            AgentRequest(provider_uid="p2", model="claude-3", messages=messages, tools=[]),
            AgentRequest(provider_uid="p3", model="gemini-pro", messages=messages, tools=[])
        ],
        timeout_per_request=10
    )
)

# Also works for audio
from livellm.models import AudioFallbackRequest, SpeakRequest

audio = await client.speak(
    AudioFallbackRequest(
        strategy=FallbackStrategy.SEQUENTIAL,
        requests=[
            SpeakRequest(provider_uid="elevenlabs", model="turbo", text="Hi", 
                        voice="rachel", mime_type=SpeakMimeType.MP3, sample_rate=44100),
            SpeakRequest(provider_uid="openai", model="tts-1", text="Hi",
                        voice="alloy", mime_type=SpeakMimeType.MP3, sample_rate=44100)
        ]
    )
)

Resource Management

Recommended: Use context managers for automatic cleanup.

# ✅ Best: Context manager (auto cleanup)
async with LivellmClient(base_url="http://localhost:8000") as client:
    response = await client.ping()
# Configs deleted, connection closed automatically

# ✅ Good: Manual cleanup
client = LivellmClient(base_url="http://localhost:8000")
try:
    response = await client.ping()
finally:
    await client.cleanup()

# ⚠️ OK: Garbage collection (shows warning if configs exist)
client = LivellmClient(base_url="http://localhost:8000")
response = await client.ping()
# Cleaned up when object is destroyed

API Reference

Client Methods

Configuration

ping() - Health check
update_config(config) / update_configs(configs) - Add/update providers
get_configs() - List all configurations
delete_config(uid) - Remove provider

Agent

agent_run(request | **kwargs) - Run agent (blocking)
agent_run_stream(request | **kwargs) - Run agent (streaming)

Audio

speak(request | **kwargs) - Text-to-speech (blocking)
speak_stream(request | **kwargs) - Text-to-speech (streaming)
transcribe(request | **kwargs) - Speech-to-text

Cleanup

cleanup() - Release resources
async with client: - Auto cleanup (recommended)

Key Models

Core

Settings(uid, provider, api_key, base_url?, blacklist_models?) - Provider config
ProviderKind - OPENAI | GOOGLE | ANTHROPIC | GROQ | ELEVENLABS

Messages

TextMessage(role, content) - Text message
BinaryMessage(role, content, mime_type, caption?) - Image/audio message
MessageRole - USER | MODEL | SYSTEM (or use strings: "user", "model", "system")

Requests

AgentRequest(provider_uid, model, messages, tools?, gen_config?)
SpeakRequest(provider_uid, model, text, voice, mime_type, sample_rate, gen_config?)
TranscribeRequest(provider_uid, file, model, language?, gen_config?)

Tools

WebSearchInput(kind=ToolKind.WEB_SEARCH, search_context_size)
MCPStreamableServerInput(kind=ToolKind.MCP_STREAMABLE_SERVER, url, prefix?, timeout?)

Fallback

AgentFallbackRequest(strategy, requests, timeout_per_request?)
AudioFallbackRequest(strategy, requests, timeout_per_request?)
FallbackStrategy - SEQUENTIAL | PARALLEL

Responses

AgentResponse(output, usage{input_tokens, output_tokens}, ...)
TranscribeResponse(text, language)

Error Handling

import httpx

try:
    response = await client.agent_run(
        provider_uid="openai",
        model="gpt-4",
        messages=[TextMessage(role="user", content="Hi")]
    )
except httpx.HTTPStatusError as e:
    print(f"HTTP {e.response.status_code}: {e.response.text}")
except httpx.RequestError as e:
    print(f"Request failed: {e}")

Development

# Install with dev dependencies
pip install -e ".[testing]"

# Run tests
pytest tests/

# Type checking
mypy livellm

Requirements

Python 3.10+
httpx >= 0.27.0
pydantic >= 2.0.0

License

MIT License - see LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

XvKuoMing

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Typing
- Typed

Release history Release notifications | RSS feed

1.8.0

May 5, 2026

1.7.5

Apr 2, 2026

1.7.4 yanked

Apr 2, 2026

Reason this release was yanked:

ws loop break error

1.7.3 yanked

Apr 2, 2026

Reason this release was yanked:

broken ws reconnection

1.7.2

Feb 4, 2026

1.7.1

Feb 4, 2026

1.6.1

Jan 23, 2026

1.5.5

Dec 19, 2025

1.5.4

Dec 19, 2025

1.5.3

Dec 19, 2025

1.5.2

Dec 19, 2025

1.5.1

Dec 19, 2025

1.4.5

Dec 15, 2025

1.4.0

Nov 21, 2025

1.3.6

Nov 19, 2025

1.3.5 yanked

Nov 18, 2025

Reason this release was yanked:

broken agent streaming

1.3.0

Nov 18, 2025

This version

1.2.0

Nov 5, 2025

1.1.1

Nov 5, 2025

1.1.0 yanked

Nov 5, 2025

Reason this release was yanked:

unstable

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livellm-1.2.0.tar.gz (12.2 kB view details)

Uploaded Nov 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

livellm-1.2.0-py3-none-any.whl (16.5 kB view details)

Uploaded Nov 5, 2025 Python 3

File details

Details for the file livellm-1.2.0.tar.gz.

File metadata

Download URL: livellm-1.2.0.tar.gz
Upload date: Nov 5, 2025
Size: 12.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.9.7

File hashes

Hashes for livellm-1.2.0.tar.gz
Algorithm	Hash digest
SHA256	`a2c452e02395d887a38fc3f0207040e6e71c4d63a2ae89cdcaaefc14fced8097`
MD5	`dbc18c5b3a8e59a8a54d451d825dcbec`
BLAKE2b-256	`29b8ac2b29d9a6b0615f63d1add47d22a7e5473062820816632b4f967fc62ffd`

See more details on using hashes here.

File details

Details for the file livellm-1.2.0-py3-none-any.whl.

File metadata

Download URL: livellm-1.2.0-py3-none-any.whl
Upload date: Nov 5, 2025
Size: 16.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.9.7

File hashes

Hashes for livellm-1.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`11d0613f60203e5cf961195aa9b2b04a4c8f58cc3d4b17a196c893fe71181b77`
MD5	`40c0dd9353326bf24f3534471fdc4845`
BLAKE2b-256	`3ccfe549a32fec1d8c7d90aa3c6710818b240a3b1c37eb895dd00f5ad30d236c`

See more details on using hashes here.

livellm 1.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

LiveLLM Python Client

Features

Installation

Quick Start

Configuration

Client Initialization

Supported Providers

Usage Examples

Agent Services

Two Ways to Call Methods

Basic Agent Run

Streaming Agent Response

Agent with Vision (Binary Messages)

Agent with Tools

Audio Services

Text-to-Speech

Transcription

Fallback Strategies

Resource Management

API Reference

Client Methods

Key Models

Error Handling

Development

Requirements

Links

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes