Unified Python interface for multiple LLM providers with cost tracking

These details have not been verified by PyPI

Project links

Project description

majordomo-llm

A unified Python interface for multiple LLM providers with automatic cost tracking, retry logic, and structured output support.

Features

Unified API - Same interface for OpenAI, Anthropic (Claude), Google Gemini, DeepSeek, and Cohere
Cost Tracking - Automatic calculation of input/output token costs per request
Structured Outputs - Native support for Pydantic models as response schemas
Automatic Retries - Built-in exponential backoff retry logic using tenacity
Automatic Fallback - Cascade across providers with LLMCascade for resilience
Request Logging - Optional async logging to PostgreSQL/MySQL with S3 storage for request/response bodies
Async First - Fully async/await compatible for high-performance applications
Type Safe - Complete type annotations and py.typed marker for IDE support

Installation

pip install majordomo-llm

Or with uv:

uv add majordomo-llm

Optional: Request Logging

To enable request logging to PostgreSQL, MySQL, or S3:

pip install majordomo-llm[logging]

Quick Start

Basic Text Response

import asyncio
from majordomo_llm import get_llm_instance

async def main():
    # Create an LLM instance
    llm = get_llm_instance("anthropic", "claude-sonnet-4-20250514")

    # Get a response
    response = await llm.get_response(
        user_prompt="What is the capital of France?",
        system_prompt="You are a helpful geography assistant.",
    )

    print(response.content)
    print(f"Tokens: {response.input_tokens} in, {response.output_tokens} out")
    print(f"Cost: ${response.total_cost:.6f}")

asyncio.run(main())

JSON Response

response = await llm.get_json_response(
    user_prompt="List the top 3 largest countries by area as JSON",
    system_prompt="Respond with valid JSON only.",
)

# response.content is a parsed Python dict
for country in response.content["countries"]:
    print(country["name"])

Structured Output with Pydantic

from pydantic import BaseModel

class CountryInfo(BaseModel):
    name: str
    capital: str
    population: int
    area_km2: float

response = await llm.get_structured_json_response(
    response_model=CountryInfo,
    user_prompt="Give me information about Japan",
)

# response.content is a validated CountryInfo instance
country = response.content
print(f"{country.name}: {country.capital}, pop. {country.population:,}")

Configuration

Environment Variables

Set API keys for the providers you want to use:

# OpenAI
export OPENAI_API_KEY="sk-..."

# Anthropic (Claude)
export ANTHROPIC_API_KEY="sk-ant-..."

# Google Gemini
export GEMINI_API_KEY="..."

# DeepSeek
export DEEPSEEK_API_KEY="sk-..."

# Cohere
export CO_API_KEY="..."

Available Models

OpenAI

gpt-5, gpt-5-mini, gpt-5-nano
gpt-4o, gpt-4.1, gpt-4.1-mini, gpt-4.1-nano

Anthropic

claude-sonnet-4-5-20250929, claude-opus-4-1-20250805
claude-opus-4-20250514, claude-sonnet-4-20250514
claude-3-7-sonnet-latest, claude-3-5-haiku-latest

Gemini

gemini-2.5-flash, gemini-2.5-flash-lite
gemini-2.0-flash, gemini-2.0-flash-lite

DeepSeek

deepseek-chat, deepseek-reasoner

Cohere

command-a-03-2025, command-r-plus-08-2024
command-r-08-2024, command-r7b-12-2024

API Reference

Factory Functions

`get_llm_instance(provider: str, model: str) -> LLM`

Create an LLM instance for the specified provider and model.

from majordomo_llm import get_llm_instance

llm = get_llm_instance("openai", "gpt-4o")

LLM Methods

All LLM instances support these async methods:

`get_response(user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMResponse`

Get a plain text response.

`get_json_response(user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMJSONResponse`

Get a JSON response (automatically parsed).

`get_structured_json_response(response_model, user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMStructuredResponse`

Get a response validated against a Pydantic model.

Response Objects

All response objects include usage metrics:

Field	Type	Description
`content`	`str` / `dict` / `BaseModel`	The response content
`input_tokens`	`int`	Number of input tokens
`output_tokens`	`int`	Number of output tokens
`cached_tokens`	`int`	Number of cached tokens (if applicable)
`input_cost`	`float`	Cost for input tokens (USD)
`output_cost`	`float`	Cost for output tokens (USD)
`total_cost`	`float`	Total cost (USD)
`response_time`	`float`	Response time in seconds

Advanced Usage

Automatic Fallback with LLMCascade

Use LLMCascade for automatic failover between providers:

from majordomo_llm import LLMCascade

# Providers are tried in order - first is primary, rest are fallbacks
cascade = LLMCascade([
    ("anthropic", "claude-sonnet-4-20250514"),  # Primary
    ("openai", "gpt-4o"),                        # First fallback
    ("gemini", "gemini-2.5-flash"),              # Last resort
])

# If Anthropic fails, automatically tries OpenAI, then Gemini
response = await cascade.get_response("Hello!")

All three response methods (get_response, get_json_response, get_structured_json_response) support automatic fallback.

Direct Provider Access

You can also instantiate providers directly for more control:

from majordomo_llm import Anthropic

llm = Anthropic(
    model="claude-sonnet-4-20250514",
    input_cost=3.0,    # per million tokens
    output_cost=15.0,  # per million tokens
)

Web Search (Anthropic)

Enable web search for supported Claude models:

from majordomo_llm.providers.anthropic import Anthropic

llm = Anthropic(
    model="claude-sonnet-4-5-20250929",
    input_cost=3.0,
    output_cost=15.0,
    use_web_search=True,
)

Request Logging

Log all LLM requests asynchronously to a database with optional S3 storage for request/response bodies. Logging is fire-and-forget and does not block your main request flow.

from majordomo_llm import get_llm_instance
from majordomo_llm.logging import LoggingLLM, PostgresAdapter, S3Adapter

async def main():
    # Create your LLM instance
    llm = get_llm_instance("anthropic", "claude-sonnet-4-20250514")

    # Set up database adapter (PostgreSQL or MySQL)
    db = await PostgresAdapter.create(
        host="localhost",
        port=5432,
        database="llm_logs",
        user="postgres",
        password="password",
    )

    # Optional: Set up S3 for storing request/response bodies
    storage = await S3Adapter.create(
        bucket="my-llm-logs",
        prefix="requests",  # optional, defaults to "llm-logs"
    )

    # Wrap your LLM with logging
    logged_llm = LoggingLLM(llm, db, storage)

    # Use as normal - all requests are logged automatically
    response = await logged_llm.get_response("Hello!")

    # Don't forget to close connections when done
    await logged_llm.close()

Database Schema

Create the logging table using the included schema:

CREATE TABLE IF NOT EXISTS llm_requests (
    request_id VARCHAR(36) PRIMARY KEY,
    provider VARCHAR(50) NOT NULL,
    model VARCHAR(100) NOT NULL,
    timestamp TIMESTAMP NOT NULL,
    response_time FLOAT,
    input_tokens INTEGER,
    output_tokens INTEGER,
    cached_tokens INTEGER,
    input_cost DECIMAL(10, 8),
    output_cost DECIMAL(10, 8),
    total_cost DECIMAL(10, 8),
    s3_request_key VARCHAR(255),
    s3_response_key VARCHAR(255),
    status VARCHAR(20) NOT NULL,
    error_message TEXT
);

Available Adapters

PostgresAdapter - PostgreSQL via asyncpg
MySQLAdapter - MySQL via aiomysql
S3Adapter - AWS S3 via aioboto3

Development

Setup

git clone https://github.com/superset-studio/majordomo-llm.git
cd majordomo-llm
uv sync --all-extras

Running Tests

uv run pytest

Type Checking

uv run mypy src/majordomo_llm

Linting

uv run ruff check src/majordomo_llm

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.0

Mar 25, 2026

0.4.1

Mar 23, 2026

0.4.0

Mar 23, 2026

0.3.2

Feb 20, 2026

0.3.1

Feb 20, 2026

0.3.0

Feb 16, 2026

0.2.0

Feb 9, 2026

0.1.6

Feb 1, 2026

0.1.5

Jan 27, 2026

0.1.4

Jan 26, 2026

This version

0.1.3

Jan 26, 2026

0.1.2

Jan 26, 2026

0.1.1

Jan 26, 2026

0.1.0

Jan 25, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

majordomo_llm-0.1.3.tar.gz (132.9 kB view details)

Uploaded Jan 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

majordomo_llm-0.1.3-py3-none-any.whl (32.7 kB view details)

Uploaded Jan 26, 2026 Python 3

File details

Details for the file majordomo_llm-0.1.3.tar.gz.

File metadata

Download URL: majordomo_llm-0.1.3.tar.gz
Upload date: Jan 26, 2026
Size: 132.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for majordomo_llm-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`ffd0863a1a32aa169cc55d7a7b77adfbe82c1f7e5e64e4a614f060db25401ab2`
MD5	`5f3b50995f9af9981b61a0b20293fbdf`
BLAKE2b-256	`b5e99ff183aac8f36772c7817072b72642599a8e65913046fdeae4abce18e861`

See more details on using hashes here.

File details

Details for the file majordomo_llm-0.1.3-py3-none-any.whl.

File metadata

Download URL: majordomo_llm-0.1.3-py3-none-any.whl
Upload date: Jan 26, 2026
Size: 32.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for majordomo_llm-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`79f429ca72852f2cb91ebee77d8f3d69f2e14b2affbba723d2bba5db1c23912f`
MD5	`3f8e740f336e9517c925943b11f9e945`
BLAKE2b-256	`e433dcd90b0f3768c214650b32f94daa3ea72876103e4a6a65d78f10cfefa350`

See more details on using hashes here.

majordomo-llm 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

majordomo-llm

Features

Installation

Optional: Request Logging

Quick Start

Basic Text Response

JSON Response

Structured Output with Pydantic

Configuration

Environment Variables

Available Models

OpenAI

Anthropic

Gemini

DeepSeek

Cohere

API Reference

Factory Functions

get_llm_instance(provider: str, model: str) -> LLM

LLM Methods

get_response(user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMResponse

get_json_response(user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMJSONResponse

get_structured_json_response(response_model, user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMStructuredResponse

Response Objects

Advanced Usage

Automatic Fallback with LLMCascade

Direct Provider Access

Web Search (Anthropic)

Request Logging

Database Schema

Available Adapters

Development

Setup

Running Tests

Type Checking

Linting

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`get_llm_instance(provider: str, model: str) -> LLM`

`get_response(user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMResponse`

`get_json_response(user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMJSONResponse`

`get_structured_json_response(response_model, user_prompt, system_prompt=None, temperature=0.3, top_p=1.0) -> LLMStructuredResponse`