Python client for the LiveLLM Server

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

XvKuoMing

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Typing
- Typed

Project description

LiveLLM Python Client

Python client library for the LiveLLM Server - a unified proxy for AI agent, audio, and transcription services.

Features

🚀 Async-first design - Built on httpx for high-performance async operations
🔒 Type-safe - Full type hints and Pydantic validation
🎯 Multi-provider support - OpenAI, Google, Anthropic, Groq, ElevenLabs
🔄 Streaming support - Real-time streaming for agent and audio responses
🛠️ Agent tools - Web search and MCP server integration
🎙️ Audio services - Text-to-speech and transcription
⚡ Fallback strategies - Sequential and parallel fallback handling
📦 Smart resource management - Automatic cleanup via GC, context managers, or manual control
🧹 Memory safe - No resource leaks with multiple cleanup strategies

Installation

pip install livellm

Or with development dependencies:

pip install livellm[testing]

Quick Start

import asyncio
from livellm import LivellmClient
from livellm.models import Settings, ProviderKind, AgentRequest, TextMessage, MessageRole
from pydantic import SecretStr

async def main():
    # Initialize the client with context manager for automatic cleanup
    async with LivellmClient(base_url="http://localhost:8000") as client:
        # Configure a provider
        config = Settings(
            uid="my-openai-config",
            provider=ProviderKind.OPENAI,
            api_key=SecretStr("your-api-key")
        )
        await client.update_config(config)
        
        # Run an agent query
        request = AgentRequest(
            provider_uid="my-openai-config",
            model="gpt-4",
            messages=[
                TextMessage(role=MessageRole.USER, content="Hello, how are you?")
            ],
            tools=[]
        )
        
        response = await client.agent_run(request)
        print(response.output)

asyncio.run(main())

Configuration

Client Initialization

from livellm import LivellmClient

# Basic initialization
client = LivellmClient(base_url="http://localhost:8000")

# With timeout
client = LivellmClient(
    base_url="http://localhost:8000",
    timeout=30.0
)

# With pre-configured providers (sync operation)
from livellm.models import Settings, ProviderKind
from pydantic import SecretStr

configs = [
    Settings(
        uid="openai-config",
        provider=ProviderKind.OPENAI,
        api_key=SecretStr("sk-..."),
        base_url="https://api.openai.com/v1"  # Optional custom base URL
    ),
    Settings(
        uid="anthropic-config",
        provider=ProviderKind.ANTHROPIC,
        api_key=SecretStr("sk-ant-..."),
        blacklist_models=["claude-instant-1"]  # Optional model blacklist
    )
]

client = LivellmClient(
    base_url="http://localhost:8000",
    configs=configs
)

Provider Configuration

Supported providers:

OPENAI - OpenAI GPT models
GOOGLE - Google Gemini models
ANTHROPIC - Anthropic Claude models
GROQ - Groq models
ELEVENLABS - ElevenLabs text-to-speech

# Add a provider configuration
config = Settings(
    uid="unique-provider-id",
    provider=ProviderKind.OPENAI,
    api_key=SecretStr("your-api-key"),
    base_url="https://custom-endpoint.com",  # Optional
    blacklist_models=["deprecated-model"]     # Optional
)
await client.update_config(config)

# Get all configurations
configs = await client.get_configs()

# Delete a configuration
await client.delete_config("unique-provider-id")

Usage Examples

Agent Services

Basic Agent Run

from livellm.models import AgentRequest, TextMessage, MessageRole

request = AgentRequest(
    provider_uid="my-openai-config",
    model="gpt-4",
    messages=[
        TextMessage(role=MessageRole.SYSTEM, content="You are a helpful assistant."),
        TextMessage(role=MessageRole.USER, content="Explain quantum computing")
    ],
    tools=[],
    gen_config={"temperature": 0.7, "max_tokens": 500}
)

response = await client.agent_run(request)
print(f"Output: {response.output}")
print(f"Tokens used - Input: {response.usage.input_tokens}, Output: {response.usage.output_tokens}")

Note: You can use either MessageRole enum or string values for the role parameter:

# Using enum (recommended for type safety)
TextMessage(role=MessageRole.USER, content="Hello")

# Using string (more convenient)
TextMessage(role="user", content="Hello")

# Both work identically and serialize correctly

Streaming Agent Response

request = AgentRequest(
    provider_uid="my-openai-config",
    model="gpt-4",
    messages=[
        TextMessage(role=MessageRole.USER, content="Tell me a story")
    ],
    tools=[]
)

stream = await client.agent_run_stream(request)
async for chunk in stream:
    print(chunk.output, end="", flush=True)

Agent with Binary Messages

import base64

# Read and encode image
with open("image.jpg", "rb") as f:
    image_data = base64.b64encode(f.read()).decode("utf-8")

from livellm.models import BinaryMessage

request = AgentRequest(
    provider_uid="my-openai-config",
    model="gpt-4-vision",
    messages=[
        BinaryMessage(
            role=MessageRole.USER,
            content=image_data,
            mime_type="image/jpeg",
            caption="What's in this image?"
        )
    ],
    tools=[]
)

response = await client.agent_run(request)

Agent with Web Search Tool

from livellm.models import WebSearchInput, ToolKind

request = AgentRequest(
    provider_uid="my-openai-config",
    model="gpt-4",
    messages=[
        TextMessage(role=MessageRole.USER, content="What's the latest news about AI?")
    ],
    tools=[
        WebSearchInput(
            kind=ToolKind.WEB_SEARCH,
            search_context_size="high"  # Options: "low", "medium", "high"
        )
    ]
)

response = await client.agent_run(request)

Agent with MCP Server Tool

from livellm.models import MCPStreamableServerInput, ToolKind

request = AgentRequest(
    provider_uid="my-openai-config",
    model="gpt-4",
    messages=[
        TextMessage(role=MessageRole.USER, content="Execute tool")
    ],
    tools=[
        MCPStreamableServerInput(
            kind=ToolKind.MCP_STREAMABLE_SERVER,
            url="http://mcp-server:8080",
            prefix="mcp_",
            timeout=15,
            kwargs={"custom_param": "value"}
        )
    ]
)

response = await client.agent_run(request)

Audio Services

Text-to-Speech

from livellm.models import SpeakRequest, SpeakMimeType

request = SpeakRequest(
    provider_uid="elevenlabs-config",
    model="eleven_turbo_v2",
    text="Hello, this is a test of text to speech.",
    voice="rachel",
    mime_type=SpeakMimeType.MP3,
    sample_rate=44100,
    gen_config={"stability": 0.5, "similarity_boost": 0.75}
)

# Get audio as bytes
audio_bytes = await client.speak(request)
with open("output.mp3", "wb") as f:
    f.write(audio_bytes)

Streaming Text-to-Speech

request = SpeakRequest(
    provider_uid="elevenlabs-config",
    model="eleven_turbo_v2",
    text="This is a longer text that will be streamed.",
    voice="rachel",
    mime_type=SpeakMimeType.MP3,
    sample_rate=44100,
    chunk_size=20  # Chunk size in milliseconds
)

# Stream audio chunks
stream = await client.speak_stream(request)
with open("output.mp3", "wb") as f:
    async for chunk in stream:
        f.write(chunk)

Audio Transcription (Multipart)

# Using multipart upload
with open("audio.mp3", "rb") as f:
    file_tuple = ("audio.mp3", f.read(), "audio/mpeg")

response = await client.transcribe(
    provider_uid="openai-config",
    file=file_tuple,
    model="whisper-1",
    language="en",
    gen_config={"temperature": 0.2}
)

print(f"Transcription: {response.text}")
print(f"Detected language: {response.language}")

Audio Transcription (JSON)

import base64
from livellm.models import TranscribeRequest

with open("audio.mp3", "rb") as f:
    audio_data = base64.b64encode(f.read()).decode("utf-8")

request = TranscribeRequest(
    provider_uid="openai-config",
    model="whisper-1",
    file=("audio.mp3", audio_data, "audio/mpeg"),
    language="en"
)

response = await client.transcribe_json(request)

Fallback Strategies

Sequential Fallback (Try each provider in order)

from livellm.models import AgentFallbackRequest, FallbackStrategy

fallback_request = AgentFallbackRequest(
    requests=[
        AgentRequest(
            provider_uid="primary-provider",
            model="gpt-4",
            messages=[TextMessage(role=MessageRole.USER, content="Hello")],
            tools=[]
        ),
        AgentRequest(
            provider_uid="backup-provider",
            model="claude-3",
            messages=[TextMessage(role=MessageRole.USER, content="Hello")],
            tools=[]
        )
    ],
    strategy=FallbackStrategy.SEQUENTIAL,
    timeout_per_request=30
)

response = await client.agent_run(fallback_request)

Parallel Fallback (Try all providers simultaneously)

fallback_request = AgentFallbackRequest(
    requests=[
        AgentRequest(provider_uid="provider-1", model="gpt-4", messages=messages, tools=[]),
        AgentRequest(provider_uid="provider-2", model="claude-3", messages=messages, tools=[]),
        AgentRequest(provider_uid="provider-3", model="gemini-pro", messages=messages, tools=[])
    ],
    strategy=FallbackStrategy.PARALLEL,
    timeout_per_request=10
)

response = await client.agent_run(fallback_request)

Audio Fallback

from livellm.models import AudioFallbackRequest

fallback_request = AudioFallbackRequest(
    requests=[
        SpeakRequest(provider_uid="elevenlabs", model="model-1", text=text, voice="voice1", 
                     mime_type=SpeakMimeType.MP3, sample_rate=44100),
        SpeakRequest(provider_uid="openai", model="tts-1", text=text, voice="alloy",
                     mime_type=SpeakMimeType.MP3, sample_rate=44100)
    ],
    strategy=FallbackStrategy.SEQUENTIAL
)

audio = await client.speak(fallback_request)

Resource Management

The client provides multiple ways to manage resources and cleanup:

1. Automatic Cleanup (Garbage Collection)

The client automatically cleans up when garbage collected:

async def main():
    client = LivellmClient(base_url="http://localhost:8000")
    
    # Use client...
    response = await client.ping()
    
    # No explicit cleanup needed - handled automatically when object is destroyed
    # Note: Provider configs are deleted synchronously from the server

asyncio.run(main())

Note: While automatic cleanup works, it shows a ResourceWarning if configs exist to encourage explicit cleanup for immediate resource release.

2. Context Manager (Recommended)

Use async context managers for guaranteed cleanup:

async with LivellmClient(base_url="http://localhost:8000") as client:
    config = Settings(uid="temp-config", provider=ProviderKind.OPENAI, 
                      api_key=SecretStr("key"))
    await client.update_config(config)
    
    # Use client...
    response = await client.ping()
    
# Automatically cleans up configs and closes HTTP client

3. Manual Cleanup

Explicitly call cleanup in a try/finally block:

client = LivellmClient(base_url="http://localhost:8000")
try:
    # Use client...
    response = await client.ping()
finally:
    await client.cleanup()

Cleanup Behavior

The cleanup() method:

Deletes all provider configs created by the client
Closes the HTTP client connection
Is idempotent (safe to call multiple times)

The __del__() destructor (automatic cleanup):

Triggers when the object is garbage collected
Synchronously deletes provider configs from the server
Closes the HTTP client connection
Shows a ResourceWarning if configs exist (to encourage explicit cleanup)

API Reference

Client Methods

Health Check

ping() -> SuccessResponse - Check server health

Configuration Management

update_config(config: Settings) -> SuccessResponse - Add/update a provider config
update_configs(configs: List[Settings]) -> SuccessResponse - Add/update multiple configs
get_configs() -> List[Settings] - Get all provider configurations
delete_config(config_uid: str) -> SuccessResponse - Delete a provider config

Agent Services

agent_run(request: AgentRequest | AgentFallbackRequest) -> AgentResponse - Run agent query
agent_run_stream(request: AgentRequest | AgentFallbackRequest) -> AsyncIterator[AgentResponse] - Stream agent response

Audio Services

speak(request: SpeakRequest | AudioFallbackRequest) -> bytes - Text-to-speech
speak_stream(request: SpeakRequest | AudioFallbackRequest) -> AsyncIterator[bytes] - Streaming TTS
transcribe(provider_uid, file, model, language?, gen_config?) -> TranscribeResponse - Multipart transcription
transcribe_json(request: TranscribeRequest | TranscribeFallbackRequest) -> TranscribeResponse - JSON transcription

Cleanup

cleanup() -> None - Clean up resources and close client (async)
__aenter__() / __aexit__() - Async context manager support
__del__() - Automatic cleanup when garbage collected (sync)

Models

Common Models

Settings - Provider configuration
ProviderKind - Enum of supported providers
SuccessResponse - Generic success response
BaseRequest - Base class for all requests

Agent Models

AgentRequest - Agent query request
AgentResponse - Agent query response
AgentResponseUsage - Token usage information
TextMessage - Text-based message
BinaryMessage - Binary message (images, audio, etc.)
MessageRole - Enum: USER, MODEL, SYSTEM

Tool Models

ToolKind - Enum: WEB_SEARCH, MCP_STREAMABLE_SERVER
WebSearchInput - Web search tool configuration
MCPStreamableServerInput - MCP server tool configuration

Audio Models

SpeakRequest - Text-to-speech request
SpeakMimeType - Enum: PCM, WAV, MP3, ULAW, ALAW
TranscribeRequest - Transcription request
TranscribeResponse - Transcription response

Fallback Models

FallbackStrategy - Enum: SEQUENTIAL, PARALLEL
AgentFallbackRequest - Agent fallback configuration
AudioFallbackRequest - Audio fallback configuration
TranscribeFallbackRequest - Transcription fallback configuration

Error Handling

The client raises exceptions for HTTP errors:

try:
    response = await client.agent_run(request)
except Exception as e:
    print(f"Error: {e}")

For more granular error handling:

import httpx

try:
    response = await client.ping()
except httpx.HTTPStatusError as e:
    print(f"HTTP error: {e.response.status_code}")
except httpx.RequestError as e:
    print(f"Request error: {e}")

Development

Running Tests

# Install development dependencies
pip install -e ".[testing]"

# Run tests
pytest tests/

Type Checking

The library is fully typed. Run type checking with:

pip install mypy
mypy livellm

Requirements

Python 3.10+
httpx >= 0.27.0
pydantic >= 2.0.0

License

MIT License - see LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Changelog

See CHANGELOG.md for version history and changes.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

XvKuoMing

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
Typing
- Typed

Release history Release notifications | RSS feed

1.8.0

May 5, 2026

1.7.5

Apr 2, 2026

1.7.4 yanked

Apr 2, 2026

Reason this release was yanked:

ws loop break error

1.7.3 yanked

Apr 2, 2026

Reason this release was yanked:

broken ws reconnection

1.7.2

Feb 4, 2026

1.7.1

Feb 4, 2026

1.6.1

Jan 23, 2026

1.5.5

Dec 19, 2025

1.5.4

Dec 19, 2025

1.5.3

Dec 19, 2025

1.5.2

Dec 19, 2025

1.5.1

Dec 19, 2025

1.4.5

Dec 15, 2025

1.4.0

Nov 21, 2025

1.3.6

Nov 19, 2025

1.3.5 yanked

Nov 18, 2025

Reason this release was yanked:

broken agent streaming

1.3.0

Nov 18, 2025

1.2.0

Nov 5, 2025

This version

1.1.1

Nov 5, 2025

1.1.0 yanked

Nov 5, 2025

Reason this release was yanked:

unstable

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livellm-1.1.1.tar.gz (11.6 kB view details)

Uploaded Nov 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

livellm-1.1.1-py3-none-any.whl (15.8 kB view details)

Uploaded Nov 5, 2025 Python 3

File details

Details for the file livellm-1.1.1.tar.gz.

File metadata

Download URL: livellm-1.1.1.tar.gz
Upload date: Nov 5, 2025
Size: 11.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.9.7

File hashes

Hashes for livellm-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`c18c83b257db6f0c89868a83409536a4727576226fcecad7adef25ed61a9f3ab`
MD5	`f81bd1e992453673984912af1003bf18`
BLAKE2b-256	`aac3536b7771155a7888528c1b7706666ffbbe4bde9a17ed9b157da7085ecb6f`

See more details on using hashes here.

File details

Details for the file livellm-1.1.1-py3-none-any.whl.

File metadata

Download URL: livellm-1.1.1-py3-none-any.whl
Upload date: Nov 5, 2025
Size: 15.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.9.7

File hashes

Hashes for livellm-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a12eff14ab092adae32155bc4e7006ac9013e00354233aa823ddff6c0aff8fbf`
MD5	`cec2e5e7f5f197c5bacb752f4e85ef77`
BLAKE2b-256	`2a1c0188ea7e110f50762e300ba48c626d43825da290ab8ae377bddb755e0950`

See more details on using hashes here.

livellm 1.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

LiveLLM Python Client

Features

Installation

Quick Start

Configuration

Client Initialization

Provider Configuration

Usage Examples

Agent Services

Basic Agent Run

Streaming Agent Response

Agent with Binary Messages

Agent with Web Search Tool

Agent with MCP Server Tool

Audio Services

Text-to-Speech

Streaming Text-to-Speech

Audio Transcription (Multipart)

Audio Transcription (JSON)

Fallback Strategies

Sequential Fallback (Try each provider in order)

Parallel Fallback (Try all providers simultaneously)

Audio Fallback

Resource Management

1. Automatic Cleanup (Garbage Collection)

2. Context Manager (Recommended)

3. Manual Cleanup

Cleanup Behavior

API Reference

Client Methods

Health Check

Configuration Management

Agent Services

Audio Services

Cleanup

Models

Common Models

Agent Models

Tool Models

Audio Models

Fallback Models

Error Handling

Development

Running Tests

Type Checking

Requirements

License

Contributing

Links

Changelog

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes