A high-level Python SDK for Large Language Models with automatic tool execution, structured output support, multi-agent workflows, and evaluation data recording

These details have not been verified by PyPI

Project links

Project description

GlueLLM

TL;DR: A high-level Python SDK for LLMs that handles the annoying stuff (tools, retries, structured output, batching) so you can ship features instead of glue code.

GlueLLM is opinionated in the “I’ve been burned by this in production” way. If you like sensible defaults, clear APIs, and fewer bespoke wrappers, you’ll feel at home.

Works great with Spiderweb

If you’re building RAG, you probably don’t just need LLM calls — you need crawling, extraction, chunking, validation, and storage too. That’s Spiderweb.

GlueLLM: LLM calls + tool execution + structured output + embeddings + batching
Spiderweb: documents/web → clean chunks → vector store → query

Tiny “together” example:

import asyncio
from gluellm import GlueLLM
from spiderweb import Spiderweb

async def main():
    async with Spiderweb(llm_client=GlueLLM()) as web:
        await web.crawl("https://example.com", ingest=True, save_to="./crawled")
        results = await web.query("What is this site about?", top_k=5)
        print(results.chunks[0]["content"][:200])

asyncio.run(main())

What is this?

GlueLLM is a high-level SDK that makes working with LLMs actually pleasant:

You call complete() or structured_complete() and get results.
Tools are plain Python functions.
Retries and error classification are built-in.
Batching and rate limiting are first-class.
Providers are unified via any-llm-sdk.

Why you might like it

Zero ceremony: minimal code to get real results
Tool execution loop: automatic tool calling orchestration
Structured output: Pydantic models, validated (including streaming: parse on final chunk)
Streaming: stream_complete() with optional structured output on the last chunk
Process status events: optional on_status callback for LLM/tool/stream progress
Provider-agnostic: one API for OpenAI, Anthropic, XAI, and others
Embeddings: same ergonomics + error handling
Batch processing: concurrency control, retry strategies, key pools
Observability hooks: logging + optional tracing

Why you might not

If you want a thin client that exposes every raw provider knob, GlueLLM isn’t trying to be that.
If you hate opinions, you’ll hate opinions (mine included).

Installation

# Using uv (recommended)
uv pip install gluellm

# From source (dev)
uv pip install -e ".[dev]"

Quick start

Simple completion

import asyncio
from gluellm.api import complete

async def main():
    result = await complete(
        user_message="What is the capital of France?",
        system_prompt="You are a helpful geography assistant.",
    )
    print(result.final_response)

asyncio.run(main())

Tool calling (tools are just functions)

import asyncio
from gluellm.api import complete

def get_weather(location: str, unit: str = "celsius") -> str:
    """Get the current weather for a location."""
    return f"Weather in {location}: 22°{unit[0].upper()}, sunny"

async def main():
    result = await complete(
        user_message="What's the weather in Tokyo and Paris?",
        system_prompt="Use get_weather for weather queries.",
        tools=[get_weather],
    )
    print(result.final_response)

asyncio.run(main())

Structured output

import asyncio
from pydantic import BaseModel, Field
from typing import Annotated

from gluellm.api import structured_complete

class PersonInfo(BaseModel):
    name: Annotated[str, Field(description="Full name")]
    age: Annotated[int, Field(description="Age in years")]
    city: Annotated[str, Field(description="City of residence")]

async def main():
    person = await structured_complete(
        user_message="Extract info: John Smith, 35, lives in Seattle",
        response_format=PersonInfo,
    )
    print(person.model_dump())

asyncio.run(main())

Streaming

Stream token-by-token with stream_complete(). When tools are enabled, the final response after tool runs is returned as one chunk (streaming resumes between tool rounds).

import asyncio
from gluellm import stream_complete

async def main():
    async for chunk in stream_complete("Tell me a short joke."):
        print(chunk.content, end="", flush=True)
        if chunk.done:
            print(f"\nTool calls: {chunk.tool_calls_made}")

asyncio.run(main())

Streaming + structured output: Pass response_format to get a parsed Pydantic instance on the final chunk (the stream is plain text; we parse when the stream ends).

from pydantic import BaseModel, Field
from gluellm import stream_complete

class Answer(BaseModel):
    word: str

async for chunk in stream_complete(
    "Reply with JSON: {\"word\": \"hello\"}",
    response_format=Answer,
    tools=[],
):
    if chunk.done and chunk.structured_output:
        print(chunk.structured_output.word)  # hello

Process status events

Use the optional on_status callback to observe what’s happening (LLM call start/end, tool execution, stream start/chunk/end, completion). Handy for progress UIs or logging.

from gluellm import complete, ProcessEvent

def on_status(e: ProcessEvent) -> None:
    print(f"{e.kind}: {e.tool_name or e.iteration or ''}")

result = await complete(
    "What is 2+2?",
    on_status=on_status,
)
# llm_call_start, llm_call_end, complete (and tool_call_* if tools run)

on_status is supported on complete(), stream_complete(), and structured_complete() (and the GlueLLM client methods).

Embeddings

import asyncio
from gluellm import embed

async def main():
    result = await embed("Hello, world!")
    print(result.dimension, result.tokens_used)

asyncio.run(main())

Configuration (the boring part)

Providers are configured via environment variables:

export OPENAI_API_KEY=sk-...
export ANTHROPIC_API_KEY=sk-ant-...
export XAI_API_KEY=xai-...

Models use provider:model strings:

openai:gpt-4o-mini
anthropic:claude-3-5-sonnet-20241022

Docs (when you want the details)

GlueLLM keeps deeper docs in docs/ so the README stays readable:

More runnable examples live in examples/.

Contributing

PRs welcome. Please read CONTRIBUTING.md.

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.4.1

Apr 22, 2026

1.3.1

Apr 8, 2026

1.3.0

Apr 4, 2026

1.2.5

Apr 2, 2026

1.2.4

Apr 1, 2026

1.2.3

Apr 1, 2026

1.2.2

Mar 23, 2026

1.2.1

Mar 18, 2026

1.2.0

Mar 17, 2026

1.1.30

Mar 14, 2026

1.1.29

Mar 13, 2026

1.1.28

Mar 13, 2026

1.1.26

Mar 12, 2026

1.1.25

Mar 12, 2026

1.1.24

Mar 12, 2026

1.1.23

Mar 12, 2026

1.1.22

Mar 11, 2026

1.1.21

Mar 11, 2026

1.1.20

Mar 11, 2026

1.1.19

Mar 11, 2026

1.1.18

Mar 11, 2026

1.1.17

Mar 10, 2026

1.1.15

Mar 6, 2026

1.1.14

Mar 5, 2026

This version

1.1.13

Mar 5, 2026

1.1.12

Mar 3, 2026

1.1.11

Feb 13, 2026

1.1.10

Feb 13, 2026

1.1.9

Feb 13, 2026

1.1.8

Feb 13, 2026

1.1.7

Feb 13, 2026

1.1.6

Feb 9, 2026

1.1.5

Feb 7, 2026

1.1.4

Feb 7, 2026

1.1.3

Jan 22, 2026

1.1.2

Jan 9, 2026

1.1.1

Jan 9, 2026

1.0.15

Jan 7, 2026

1.0.14

Jan 7, 2026

1.0.13

Jan 6, 2026

1.0.12

Jan 6, 2026

1.0.10

Jan 6, 2026

1.0.9

Jan 6, 2026

1.0.8

Jan 6, 2026

1.0.7

Jan 6, 2026

1.0.6

Jan 6, 2026

1.0.5

Jan 6, 2026

1.0.4

Jan 6, 2026

1.0.3

Jan 6, 2026

1.0.2

Jan 6, 2026

1.0.1

Jan 5, 2026

1.0.0

Jan 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gluellm-1.1.13.tar.gz (185.1 kB view details)

Uploaded Mar 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gluellm-1.1.13-py3-none-any.whl (167.4 kB view details)

Uploaded Mar 5, 2026 Python 3

File details

Details for the file gluellm-1.1.13.tar.gz.

File metadata

Download URL: gluellm-1.1.13.tar.gz
Upload date: Mar 5, 2026
Size: 185.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.13 {"installer":{"name":"uv","version":"0.9.13"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for gluellm-1.1.13.tar.gz
Algorithm	Hash digest
SHA256	`11f922491cc841846b11a26e6da7667235209fb093b118becafac5e043cb2d64`
MD5	`43c6b38e991e03d87fed2f1550d5e1cb`
BLAKE2b-256	`434a16a10e8c06e3d97491cef968982fdf2e375e98829140e1eef05ad5de70d9`

See more details on using hashes here.

File details

Details for the file gluellm-1.1.13-py3-none-any.whl.

File metadata

Download URL: gluellm-1.1.13-py3-none-any.whl
Upload date: Mar 5, 2026
Size: 167.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.13 {"installer":{"name":"uv","version":"0.9.13"},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for gluellm-1.1.13-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e138c48fa7dddb78817ec6b13d7b1761659bb3d58cfc1a18f7fc4324e4a67281`
MD5	`b5e7bbadab0be24423f8f9c30101f87e`
BLAKE2b-256	`bd359f9dc4d7b75941bf380eb17d41b351f61e5a292d8a16dca8bb2426109229`

See more details on using hashes here.

gluellm 1.1.13

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

GlueLLM

Works great with Spiderweb

What is this?

Why you might like it

Why you might not

Installation

Quick start

Simple completion

Tool calling (tools are just functions)

Structured output

Streaming

Process status events

Embeddings

Configuration (the boring part)

Docs (when you want the details)

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes