Unified LLM API client library with Chat, Embedding, Rerank, and Tokenizer support

These details have not been verified by PyPI

Project links

Project description

Lexilux

Lexilux is a unified LLM API client library that makes calling Chat, Embedding, Rerank, and Tokenizer APIs as simple as calling a function.

Features

Function-like API: Call APIs like functions (chat("hi"), embed(["text"]))
Streaming Support: Built-in streaming for Chat with usage tracking
Unified Usage: Consistent usage statistics across all APIs
Flexible Input: Support multiple input formats (string, list, dict)
OpenAI-Compatible: Works with OpenAI-compatible APIs
Automatic Retry: Built-in retry logic with exponential backoff
Connection Pooling: HTTP connection pooling for better performance
Rate Limiting: Built-in rate limiter for API request throttling
SSL Control: Configurable SSL certificate verification
Input Validation: Comprehensive parameter validation with clear errors
Exception Hierarchy: Comprehensive exception system with error codes
Function Calling: OpenAI-compatible function/tool calling support
Multimodal Support: Vision capabilities with image inputs
Async Support: Full async/await API for concurrent operations

One Client, Multiple Providers

Lexilux is designed to work seamlessly with all major LLM providers through their OpenAI-compatible APIs:

Provider	Base URL
OpenAI	`https://api.openai.com/v1`
DeepSeek	`https://api.deepseek.com`
GLM / ZhipuAI	`https://open.bigmodel.cn/api/paas/v4`
Kimi / Moonshot	`https://api.moonshot.cn/v1`
Minimax	`https://api.minimax.chat/v1`
Qwen / Alibaba	`https://dashscope.aliyuncs.com/compatible-mode/v1`
Groq	`https://api.groq.com/openai/v1`

Simply change base_url and api_key to switch providers:

# OpenAI
chat = Chat(base_url="https://api.openai.com/v1", api_key="sk-...", model="gpt-4o")

# DeepSeek
chat = Chat(base_url="https://api.deepseek.com", api_key="sk-...", model="deepseek-chat")

# GLM (智谱)
chat = Chat(base_url="https://open.bigmodel.cn/api/paas/v4", api_key="...", model="glm-4-plus")

See PROVIDERS.md for complete provider documentation.

Installation

Quick Install

pip install lexilux

With Tokenizer Support

pip install lexilux[tokenizer]

Development Setup with uv (Recommended)

This project uses uv for fast dependency management.

# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh

# For active development
make dev-install

# Or manually with uv
uv sync --group docs --all-extras

Legacy pip Setup

pip install -e ".[dev]"

Quick Start

Basic Chat

from lexilux import Chat

chat = Chat(base_url="https://api.example.com/v1", api_key="your-key", model="gpt-4")
result = chat("Hello, world!")
print(result.text)
print(result.usage.total_tokens)

Streaming

for chunk in chat.stream("Tell me a joke"):
    print(chunk.delta, end="", flush=True)
    if chunk.done:
        print(f"\nTokens: {chunk.usage.total_tokens}")

Error Handling

from lexilux import LexiluxError, AuthenticationError, RateLimitError

try:
    result = chat("Hello, world!")
except AuthenticationError as e:
    print(f"Authentication failed: {e.message}")
except RateLimitError as e:
    if e.retryable:
        print(f"Rate limited: {e.message}")
except LexiluxError as e:
    print(f"Error: {e.code} - {e.message}")

Function Calling

from lexilux import Chat, FunctionTool

get_weather = FunctionTool(
    name="get_weather",
    description="Get weather for a location",
    parameters={
        "type": "object",
        "properties": {
            "location": {"type": "string", "description": "City name"},
            "unit": {"type": "string", "enum": ["celsius", "fahrenheit"]}
        },
        "required": ["location"]
    }
)

result = chat("What's the weather in Paris?", tools=[get_weather])
if result.has_tool_calls:
    for tool_call in result.tool_calls:
        print(f"Calling: {tool_call.name}")
        print(f"Arguments: {tool_call.get_arguments()}")

Rate Limiting

from lexilux import Chat

# Limit to 10 requests per second
chat = Chat(
    base_url="https://api.example.com/v1",
    api_key="your-key",
    model="gpt-4",
    rate_limit=10  # requests per second
)

SSL Verification

from lexilux import Chat

# Disable SSL verification for testing (not recommended for production)
chat = Chat(
    base_url="https://api.example.com/v1",
    api_key="your-key",
    model="gpt-4",
    verify_ssl=False
)

Async

import asyncio
from lexilux import Chat

async def main():
    chat = Chat(base_url="...", api_key="...", model="gpt-4")
    result = await chat.a("Hello, async world!")
    print(result.text)

asyncio.run(main())

Connection Pooling

By default, Lexilux uses a connection pool with size 2 to reuse HTTP connections and improve performance. You can customize this based on your API provider's limits:

chat = Chat(
    base_url="https://api.openai.com/v1",
    api_key="your-key",
    model="gpt-4",
    pool_size=10,  # Increase for higher concurrency
)

Provider Limits:

OpenAI: Recommended <= 10
Anthropic: Recommended <= 5
Other providers: Check their documentation

Automatic Retries

Lexilux automatically retries failed requests with exponential backoff when:

Rate limit errors (HTTP 429)
Server errors (HTTP 500, 502, 503, 504)
Network timeouts or connection errors

Configure retry behavior:

chat = Chat(
    base_url="https://api.openai.com/v1",
    api_key="your-key",
    max_retries=3,  # Retry up to 3 times on transient errors
)

Note: Only retryable=True errors trigger automatic retries. Authentication and validation errors are never retried.

Chat API Selection Guide

Method	Streaming	Ensures Complete	History Behavior
`chat()`	No	No	Read-only
`stream()`	Yes	No	Read-only
`complete()`	No	Yes	Internal working copy
`complete_stream()`	Yes	Yes	Internal working copy

History Behavior:

chat() and stream() never modify your history object
complete() methods create an internal working copy for state management
Your original ChatHistory is always preserved

Documentation

Full documentation available at: lexilux.readthedocs.io

Examples

Check out the examples/ directory for practical examples:

examples/01_hello_world.py - Basic chat completion
examples/02_system_message.py - Using system messages
examples/10_streaming.py - Streaming chat
examples/11_conversation.py - Multi-turn conversations
examples/12_chat_params.py - Custom chat parameters
examples/20_embedding.py - Text embedding
examples/21_rerank.py - Document reranking
examples/22_tokenizer.py - Tokenization
examples/30_function_calling.py - Function calling
examples/31_multimodal.py - Vision capabilities
examples/32_async.py - Async operations
examples/40_chat_history.py - History management
examples/41_auto_continue.py - Continue cut-off responses
examples/42_error_handling.py - Error handling patterns
examples/43_custom_formatting.py - Custom response formatting

Run examples:

python examples/01_hello_world.py

Testing

# Run unit tests
make test

# Run integration tests
make test-integration

# Run with coverage
make test-cov

# Run linting
make lint

# Format code
make format

Build documentation locally:

cd docs && make html

About Agentsmith

Lexilux is part of the Agentsmith open-source ecosystem. Agentsmith is a ToB AI agent and algorithm development platform, currently deployed in multiple highway management companies, securities firms, and regulatory agencies in China. The Agentsmith team is gradually open-sourcing the platform by removing proprietary code and algorithm modules, as well as enterprise-specific customizations, while decoupling the system for modular use by the open-source community.

Agentsmith Open-Source Projects

Varlord - Configuration management library
Routilux - Event-driven workflow orchestration
Serilux - Flexible serialization framework
Lexilux - Unified LLM API client library

License

Lexilux is licensed under the Apache License 2.0. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

2.8.0

Feb 13, 2026

2.7.3

Feb 13, 2026

2.7.2

Feb 13, 2026

2.7.1

Feb 13, 2026

2.7.0

Feb 12, 2026

2.5.0

Jan 26, 2026

2.4.0

Jan 26, 2026

2.2.0

Jan 15, 2026

2.1.0

Jan 9, 2026

2.0.0

Jan 6, 2026

0.5.1

Jan 5, 2026

0.5.0

Jan 5, 2026

0.3.1

Jan 4, 2026

0.3.0

Jan 3, 2026

0.2.0

Jan 2, 2026

0.1.2

Dec 29, 2025

0.1.1

Dec 29, 2025

0.1.0

Dec 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lexilux-2.8.0.tar.gz (389.0 kB view details)

Uploaded Feb 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lexilux-2.8.0-py3-none-any.whl (206.6 kB view details)

Uploaded Feb 13, 2026 Python 3

File details

Details for the file lexilux-2.8.0.tar.gz.

File metadata

Download URL: lexilux-2.8.0.tar.gz
Upload date: Feb 13, 2026
Size: 389.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for lexilux-2.8.0.tar.gz
Algorithm	Hash digest
SHA256	`511cf235fed1cd9085182c2b9b786dc489fc670afbfb069de76dd805299fb5f3`
MD5	`098f47cca15773cb180ed940e9720d7e`
BLAKE2b-256	`af595578046507c3ff3260aad97d60cc5a65596bbc63d32b95b5f4dc6488227f`

See more details on using hashes here.

File details

Details for the file lexilux-2.8.0-py3-none-any.whl.

File metadata

Download URL: lexilux-2.8.0-py3-none-any.whl
Upload date: Feb 13, 2026
Size: 206.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for lexilux-2.8.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2bd49fa5e76a8ec6c530e07d578ff705b408cd5d2c6daffe0a3766f1c1ff7002`
MD5	`2c7590739148bf28fd93eb9e10e469c9`
BLAKE2b-256	`159eb0ba1de37d9ac9534e95b6bf7f5b244bb7f5faee91f340704a55e0b2c8f5`

See more details on using hashes here.

lexilux 2.8.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Lexilux

Features

One Client, Multiple Providers

Installation

Quick Install

With Tokenizer Support

Development Setup with uv (Recommended)

Legacy pip Setup

Quick Start

Basic Chat

Streaming

Error Handling

Function Calling

Rate Limiting

SSL Verification

Async

Connection Pooling

Automatic Retries

Chat API Selection Guide

Documentation

Examples

Testing

About Agentsmith

Agentsmith Open-Source Projects

License

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes