Add your description here

These details have not been verified by PyPI

Project description

any-llm-client

A unified and lightweight asynchronous Python API for communicating with LLMs.

Supports multiple providers, including OpenAI Chat Completions API (and any OpenAI-compatible API, such as Ollama and vLLM) and YandexGPT API.

How To Use

Before starting using any-llm-client, make sure you have it installed:

uv add any-llm-client
poetry add any-llm-client

Response API

Here's a full example that uses Ollama and Qwen2.5-Coder:

import asyncio

import any_llm_client


config = any_llm_client.OpenAIConfig(url="http://127.0.0.1:11434/v1/chat/completions", model_name="qwen2.5-coder:1.5b")


async def main() -> None:
    async with any_llm_client.get_client(config) as client:
        print(await client.request_llm_message("Кек, чо как вообще на нарах?"))


asyncio.run(main())

To use YandexGPT, replace the config:

config = any_llm_client.YandexGPTConfig(
    auth_header=os.environ["YANDEX_AUTH_HEADER"], folder_id=os.environ["YANDEX_FOLDER_ID"], model_name="yandexgpt"
)

Streaming API

LLMs often take long time to respond fully. Here's an example of streaming API usage:

import asyncio

import any_llm_client


config = any_llm_client.OpenAIConfig(url="http://127.0.0.1:11434/v1/chat/completions", model_name="qwen2.5-coder:1.5b")


async def main() -> None:
    async with (
        any_llm_client.get_client(config) as client,
        client.stream_llm_partial_messages("Кек, чо как вообще на нарах?") as partial_messages,
    ):
        async for message in partial_messages:
            print("\033[2J")  # clear screen
            print(message)


asyncio.run(main())

Note that this will yield partial growing message, not message chunks, for example: "Hi", "Hi there!", "Hi there! How can I help you?".

Passing chat history and temperature

You can pass list of messages instead of str as the first argument, and set temperature:

async with (
    any_llm_client.get_client(config) as client,
    client.stream_llm_partial_messages(
        messages=[
            any_llm_client.SystemMessage("Ты — опытный ассистент"),
            any_llm_client.UserMessage("Кек, чо как вообще на нарах?"),
        ],
        temperature=1.0,
    ) as partial_messages,
):
    ...

Other

Mock client

You can use a mock client for testing:

config = any_llm_client.MockLLMConfig(
    response_message=...,
    stream_messages=["Hi!"],
)

async with any_llm_client.get_client(config, ...) as client:
    ...

Configuration with environment variables

Credentials

Instead of passing credentials directly, you can set corresponding environment variables:

OpenAI: ANY_LLM_CLIENT_OPENAI_AUTH_TOKEN,
YandexGPT: ANY_LLM_CLIENT_YANDEXGPT_AUTH_HEADER, ANY_LLM_CLIENT_YANDEXGPT_FOLDER_ID.

LLM model config (with pydantic-settings)

import os

import pydantic_settings

import any_llm_client


class Settings(pydantic_settings.BaseSettings):
    llm_model: any_llm_client.AnyLLMConfig


os.environ["LLM_MODEL"] = """{
    "api_type": "openai",
    "url": "http://127.0.0.1:11434/v1/chat/completions",
    "model_name": "qwen2.5-coder:1.5b"
}"""
settings = Settings()

async with any_llm_client.get_client(settings.llm_model, ...) as client:
    ...

Combining with environment variables from previous section, you can keep LLM model configuration and secrets separate.

Using clients directly

The recommended way to get LLM client is to call any_llm_client.get_client(). This way you can easily swap LLM models. If you prefer, you can use any_llm_client.OpenAIClient or any_llm_client.YandexGPTClient directly:

config = any_llm_client.OpenAIConfig(
    url=pydantic.HttpUrl("https://api.openai.com/v1/chat/completions"),
    auth_token=os.environ["OPENAI_API_KEY"],
    model_name="gpt-4o-mini",
)

async with any_llm_client.OpenAIClient(config, ...) as client:
    ...

Errors

any_llm_client.LLMClient.request_llm_message() and any_llm_client.LLMClient.stream_llm_partial_messages() will raise any_llm_client.LLMError or any_llm_client.OutOfTokensOrSymbolsError when the LLM API responds with a failed HTTP status.

Timeouts, proxy & other HTTP settings

Pass custom niquests kwargs to any_llm_client.get_client():

import urllib3

import any_llm_client


async with any_llm_client.get_client(
    ...,
    proxies={"https://api.openai.com": "http://localhost:8030"},
    timeout=urllib3.Timeout(total=10.0, connect=5.0),
) as client:
    ...

Default timeout is urllib3.Timeout(total=None, connect=5.0).

Retries

By default, requests are retried 3 times on HTTP status errors. You can change the retry behaviour by supplying request_retry parameter:

async with any_llm_client.get_client(..., request_retry=any_llm_client.RequestRetryConfig(attempts=5, ...)) as client:
    ...

Passing extra data to LLM

await client.request_llm_message("Кек, чо как вообще на нарах?", extra={"best_of": 3})

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

3.2.1

Jul 14, 2025

3.2.0

Apr 22, 2025

3.1.1

Apr 22, 2025

3.1.0

Apr 22, 2025

3.0.0

Apr 1, 2025

2.3.0

Mar 25, 2025

2.2.0

Mar 11, 2025

2.1.0

Jan 27, 2025

2.0.0

Dec 5, 2024

This version

1.3.0

Nov 25, 2024

1.2.0

Nov 22, 2024

1.1.0

Nov 22, 2024

1.0.2

Nov 21, 2024

1.0.1

Nov 21, 2024

1.0.0

Nov 21, 2024

0.4.0

Nov 21, 2024

0.3.0

Nov 21, 2024

0.2.0

Nov 21, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

any_llm_client-1.3.0.tar.gz (14.2 kB view details)

Uploaded Nov 25, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

any_llm_client-1.3.0-py3-none-any.whl (13.4 kB view details)

Uploaded Nov 25, 2024 Python 3

File details

Details for the file any_llm_client-1.3.0.tar.gz.

File metadata

Download URL: any_llm_client-1.3.0.tar.gz
Upload date: Nov 25, 2024
Size: 14.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.4

File hashes

Hashes for any_llm_client-1.3.0.tar.gz
Algorithm	Hash digest
SHA256	`03fcb2b219a2bbce25515ec6fadf725291bd87ae3d879afb3f8bde89dd1210d9`
MD5	`b1673b6217fe31ff21adae086064beb5`
BLAKE2b-256	`d8ec2c3476ab1b3a37cde22c8e7e44548a533899f950fc11f1da2cb983748d63`

See more details on using hashes here.

File details

Details for the file any_llm_client-1.3.0-py3-none-any.whl.

File metadata

Download URL: any_llm_client-1.3.0-py3-none-any.whl
Upload date: Nov 25, 2024
Size: 13.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.4

File hashes

Hashes for any_llm_client-1.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ecd4ac59cd137fe7a338c84e5531480ce20f02e86e787935e7c4070f241580d0`
MD5	`d5f4a20bff49d2bd6910f25503a3410a`
BLAKE2b-256	`33304273f83dbf36461c89a01f8e3e73fab6c479fcc1290eebb6e3e70617f847`

See more details on using hashes here.

any-llm-client 1.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

any-llm-client

How To Use

Response API

Streaming API

Passing chat history and temperature

Other

Mock client

Configuration with environment variables

Credentials

LLM model config (with pydantic-settings)

Using clients directly

Errors

Timeouts, proxy & other HTTP settings

Retries

Passing extra data to LLM

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes