Python utility for using LLM API models.

Project description

lm_deluge

lm_deluge is a lightweight helper library for talking to large language model APIs. It wraps several providers under a single interface, handles rate limiting, and exposes a few useful utilities for common NLP tasks.

Features

Unified client – send prompts to OpenAI‑compatible models, Anthropic, Cohere and Vertex hosted Claude models using the same API.
Async or sync – process prompts concurrently with process_prompts_async or run them synchronously with process_prompts_sync.
Spray across providers – configure multiple model names with weighting so requests are distributed across different providers.
Caching – optional LevelDB, SQLite or custom caches to avoid duplicate calls.
Embeddings and reranking – helper functions for embedding text and reranking documents via Cohere/OpenAI endpoints.
Built‑in tools – simple extract, translate and score_llm helpers for common patterns.

Installation

pip install lm_deluge

The package relies on environment variables for API keys. Typical variables include OPENAI_API_KEY, ANTHROPIC_API_KEY, COHERE_API_KEY, META_API_KEY (for Llama) and GOOGLE_APPLICATION_CREDENTIALS for Vertex.

Quickstart

from lm_deluge import LLMClient

client = LLMClient.basic(
    model=["gpt-4o-mini"],    # any model id from lm_deluge.models.registry
    temperature=0.2,
    max_new_tokens=256,
)

resp = client.process_prompts_sync(["Hello, world!"])  # returns list[APIResponse]
print(resp[0].completion)

Asynchronous usage

import asyncio

async def main():
    responses = await client.process_prompts_async(
        ["an async call"],
        return_completions_only=True,
    )
    print(responses[0])

asyncio.run(main())

Distributing requests across models

You can provide multiple model_names and optional model_weights when creating an LLMClient. Each prompt will be sent to one of the models based on those weights.

client = LLMClient(
    model_names=["gpt-4o-mini", "claude-haiku-anthropic"],
    model_weights="rate_limit",        # or a list like [0.7, 0.3]
    max_requests_per_minute=5000,
    max_tokens_per_minute=1_000_000,
    max_concurrent_requests=100,
)

Provider specific notes

OpenAI and compatible providers – set OPENAI_API_KEY. Model ids in the registry include OpenAI models as well as Meta Llama, Grok and many others that expose OpenAI style APIs.
Anthropic – set ANTHROPIC_API_KEY. Use model ids such as claude-haiku-anthropic or claude-sonnet-anthropic.
Cohere – set COHERE_API_KEY. Models like command-r are available.
Vertex Claude – set GOOGLE_APPLICATION_CREDENTIALS and PROJECT_ID. Use a model id such as claude-sonnet-vertex.

The models.py file lists every supported model and the required environment variable.

Built‑in tools

The lm_deluge.llm_tools package exposes a few helper functions:

extract – structure text or images into a Pydantic model based on a schema.
translate – translate a list of strings to English if needed.
score_llm – simple yes/no style scoring with optional log probability output.

Embeddings (embed.embed_parallel_async) and document reranking (rerank.rerank_parallel_async) are also provided.

Caching results

lm_deluge.cache includes LevelDB, SQLite and custom dictionary based caches. Pass an instance via LLMClient(..., cache=my_cache) and previously seen prompts will not be re‑sent.

Development notes

Models and costs are defined in src/lm_deluge/models.py. Conversations are built using the Conversation and Message helpers in src/lm_deluge/prompt.py, which also support images.

Project details

Release history Release notifications | RSS feed

0.0.138

Apr 8, 2026

0.0.137

Apr 1, 2026

0.0.136

Mar 17, 2026

0.0.135

Mar 9, 2026

0.0.134

Mar 9, 2026

0.0.133

Mar 8, 2026

0.0.132

Mar 6, 2026

0.0.131

Mar 6, 2026

0.0.130

Mar 4, 2026

0.0.129

Mar 4, 2026

0.0.128

Mar 3, 2026

0.0.127

Mar 3, 2026

0.0.126

Feb 27, 2026

0.0.125

Feb 27, 2026

0.0.124

Feb 27, 2026

0.0.123

Feb 25, 2026

0.0.122

Feb 24, 2026

0.0.121

Feb 22, 2026

0.0.120

Feb 20, 2026

0.0.119

Feb 20, 2026

0.0.118

Feb 20, 2026

0.0.117

Feb 18, 2026

0.0.116

Feb 17, 2026

0.0.115

Feb 13, 2026

0.0.114

Feb 12, 2026

0.0.113

Feb 11, 2026

0.0.112

Feb 10, 2026

0.0.111

Feb 9, 2026

0.0.110

Feb 7, 2026

0.0.109

Feb 5, 2026

0.0.108

Feb 4, 2026

0.0.107

Feb 4, 2026

0.0.106

Feb 3, 2026

0.0.105

Feb 3, 2026

0.0.104

Feb 2, 2026

0.0.103

Feb 1, 2026

0.0.102

Jan 30, 2026

0.0.101

Jan 19, 2026

0.0.100

Jan 14, 2026

0.0.99

Jan 11, 2026

0.0.98

Jan 11, 2026

0.0.97

Jan 10, 2026

0.0.96

Jan 8, 2026

0.0.95

Jan 2, 2026

0.0.94

Jan 2, 2026

0.0.93

Jan 2, 2026

0.0.92

Jan 1, 2026

0.0.91

Dec 28, 2025

0.0.90

Dec 27, 2025

0.0.89

Dec 17, 2025

0.0.88

Dec 16, 2025

0.0.87

Dec 11, 2025

0.0.86

Dec 5, 2025

0.0.85

Dec 5, 2025

0.0.84

Dec 5, 2025

0.0.83

Dec 3, 2025

0.0.82

Nov 30, 2025

0.0.81

Nov 29, 2025

0.0.80

Nov 25, 2025

0.0.79

Nov 22, 2025

0.0.78

Nov 19, 2025

0.0.76

Nov 19, 2025

0.0.75

Nov 16, 2025

0.0.74

Nov 16, 2025

0.0.73

Nov 13, 2025

0.0.72

Nov 12, 2025

0.0.71

Nov 11, 2025

0.0.70

Nov 10, 2025

0.0.69

Nov 10, 2025

0.0.68

Nov 2, 2025

0.0.67

Oct 31, 2025

0.0.66

Oct 31, 2025

0.0.65

Oct 31, 2025

0.0.64

Oct 31, 2025

0.0.63

Oct 30, 2025

0.0.62

Oct 23, 2025

0.0.61

Oct 23, 2025

0.0.60

Oct 22, 2025

0.0.59

Oct 19, 2025

0.0.58

Oct 18, 2025

0.0.57

Oct 4, 2025

0.0.56

Oct 1, 2025

0.0.55

Sep 30, 2025

0.0.54

Sep 29, 2025

0.0.53

Sep 29, 2025

0.0.52

Sep 28, 2025

0.0.51

Sep 28, 2025

0.0.50

Sep 16, 2025

0.0.49

Sep 16, 2025

0.0.48

Aug 24, 2025

0.0.47

Aug 24, 2025

0.0.46

Aug 22, 2025

0.0.45

Aug 21, 2025

0.0.44

Aug 21, 2025

0.0.43

Aug 21, 2025

0.0.42

Aug 21, 2025

0.0.41

Aug 17, 2025

0.0.40

Aug 16, 2025

0.0.39

Aug 15, 2025

0.0.38

Aug 15, 2025

0.0.37

Aug 15, 2025

0.0.36

Aug 15, 2025

0.0.35

Aug 8, 2025

0.0.34

Aug 6, 2025

0.0.33

Aug 6, 2025

0.0.32

Aug 1, 2025

0.0.31

Aug 1, 2025

0.0.30

Jul 30, 2025

0.0.29

Jul 30, 2025

0.0.28

Jul 27, 2025

0.0.27

Jul 26, 2025

0.0.26

Jul 26, 2025

0.0.25

Jul 24, 2025

0.0.24

Jul 23, 2025

0.0.23

Jul 23, 2025

0.0.22

Jul 23, 2025

0.0.21

Jul 9, 2025

0.0.20

Jul 9, 2025

0.0.19

Jul 9, 2025

0.0.18

Jul 9, 2025

0.0.17

Jun 24, 2025

0.0.16

Jun 9, 2025

0.0.15

Jun 3, 2025

0.0.14

Jun 2, 2025

0.0.13

May 30, 2025

0.0.12

May 25, 2025

0.0.11

May 24, 2025

0.0.10

May 24, 2025

0.0.9

May 23, 2025

0.0.8

May 22, 2025

0.0.7

May 22, 2025

0.0.6

May 22, 2025

0.0.5

May 21, 2025

This version

0.0.4

May 21, 2025

0.0.3

May 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lm_deluge-0.0.4.tar.gz (50.6 kB view details)

Uploaded May 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lm_deluge-0.0.4-py3-none-any.whl (63.0 kB view details)

Uploaded May 21, 2025 Python 3

File details

Details for the file lm_deluge-0.0.4.tar.gz.

File metadata

Download URL: lm_deluge-0.0.4.tar.gz
Upload date: May 21, 2025
Size: 50.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for lm_deluge-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`544a96376bc4307927895e9ad71cee99d0a20f260365e23e6a65311871dd2ac9`
MD5	`5faa1eaaa2288a6d6e052a355b9a222f`
BLAKE2b-256	`ce61ad6fc777e989ee78dce8f77ea035c1ecb42e1a1864ac834cf2ad68e0ca18`

See more details on using hashes here.

File details

Details for the file lm_deluge-0.0.4-py3-none-any.whl.

File metadata

Download URL: lm_deluge-0.0.4-py3-none-any.whl
Upload date: May 21, 2025
Size: 63.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for lm_deluge-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`161ebb65a7dcff48219e11c3baf2cb706cb99349f087eb3f90cce052f95bee71`
MD5	`0034d5d2c2dccd0a435c2c5c0bcefcd8`
BLAKE2b-256	`a1f49d36df66e7f57319a408971936487c2bde36688f525df53830c1b0a50694`

See more details on using hashes here.

lm-deluge 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

lm_deluge

Features

Installation

Quickstart

Asynchronous usage

Distributing requests across models

Provider specific notes

Built‑in tools

Caching results

Development notes

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes