Unified Python interface for OpenAI, Anthropic, Google, and Ollama LLMs

These details have not been verified by PyPI

Project links

Project description

LLMRing

Alias-first LLM service for Python. Map tasks to models, not code to model IDs. Supports OpenAI, Anthropic, Google, and Ollama with a unified interface.

Complies with source-of-truth v3.5

Highlights

Alias-first identity: Map semantic tasks to models via lockfile
Lockfile-based configuration: Version-controlled, reproducible model bindings
Multi-provider support: OpenAI, Anthropic, Google, Ollama
Profile support: Different configurations for prod/staging/dev
Registry integration: Automatic model capabilities and pricing from GitHub Pages
Local-first: Fully functional without backend services
Smart defaults: Auto-detects API keys and suggests appropriate models
Cost tracking: Automatic cost calculation based on registry pricing

Installation

# Basic installation
pip install llmring

# Or with uv
uv add llmring

# Development installation (from source)
uv pip install -e ".[dev]"

Requirements

Python 3.10+
API keys for the LLM providers you want to use

Quick Start

1) Initialize a lockfile

# Create a lockfile with auto-detected defaults based on available API keys
llmring lock init

# This creates llmring.lock with smart defaults based on your API keys:
# If OPENAI_API_KEY is set:
#   - long_context → openai:gpt-4-turbo-preview
#   - low_cost → openai:gpt-3.5-turbo
# If ANTHROPIC_API_KEY is set:
#   - deep → anthropic:claude-3-opus-20240229
#   - balanced → anthropic:claude-3-sonnet-20240229
# Always available:
#   - default → ollama:llama3

2) Bind aliases to models

# Bind an alias to a specific model
llmring bind summarizer ollama:llama3.3

# List all aliases
llmring aliases

3) Use aliases in code

import asyncio
from llmring import LLMRing, LLMRequest, Message

async def main():
    # Initialize service
    service = LLMRing()
    
    # Use an alias instead of hardcoding model names
    request = LLMRequest(
        messages=[Message(role="user", content="Summarize this text...")],
        model="summarizer"  # Uses the alias from lockfile
    )
    
    response = await service.chat(request)
    print(response.content)

asyncio.run(main())

4) Direct model usage (without aliases)

# You can still use provider:model format directly
request = LLMRequest(
    messages=[Message(role="user", content="Hello!")],
    model="openai:gpt-4o-mini"  # Direct model reference
)

Lockfile Configuration

The llmring.lock file is the authoritative configuration source:

version = "1.0"
default_profile = "default"

[profiles.default]
name = "default"

[[profiles.default.bindings]]
alias = "summarizer"
provider = "ollama"
model = "llama3.3"

[[profiles.default.bindings]]
alias = "deep"
provider = "anthropic"
model = "claude-3-opus"

[profiles.prod]
name = "prod"
# Production-specific bindings...

[profiles.dev]
name = "dev"
# Development-specific bindings...

Profiles

Switch between different configurations using profiles:

# Use a specific profile
llmring chat "Hello" --model summarizer --profile prod

# Or via environment variable
export LLMRING_PROFILE=prod
llmring chat "Hello" --model summarizer

Registry Integration

Track model changes and detect drift:

# Validate lockfile against current registry
llmring lock validate

# Update registry versions to latest
llmring lock bump-registry

CLI Reference

Lockfile Management

# Initialize lockfile with defaults
llmring lock init [--force]

# Validate lockfile bindings against registry
llmring lock validate

# Update pinned registry versions
llmring lock bump-registry

Alias Management

# Bind an alias to a model
llmring bind <alias> <provider:model> [--profile <profile>]

# Remove an alias
llmring unbind <alias> [--profile <profile>]

# List all aliases
llmring aliases [--profile <profile>]

Chat & Model Usage

# Send a chat message (supports aliases)
llmring chat "Your message" --model <alias_or_model> [options]
  --system <prompt>      # System prompt
  --temperature <float>  # Temperature (0.0-2.0)
  --max-tokens <int>     # Max tokens to generate
  --profile <profile>    # Profile for alias resolution
  --json                 # Output as JSON
  --verbose              # Show usage stats

# Show model information
llmring info <provider:model> [--json]

# List available models
llmring list [--provider <provider>]

# List configured providers
llmring providers [--json]

Provider Configuration

Set API keys via environment variables:

export OPENAI_API_KEY=sk-...
export ANTHROPIC_API_KEY=sk-ant-...
export GOOGLE_API_KEY=...  # or GEMINI_API_KEY
# Ollama doesn't require an API key (local)

Cost Tracking

Track costs automatically using registry pricing data:

# After any API call
response = await service.chat("summarizer", messages)

# Calculate cost from response
cost = await service.calculate_cost(response)

if cost:
    print(f"Cost: ${cost['total_cost']:.4f}")
    print(f"Tokens: {response.total_tokens}")

Advanced Usage

Constraints in Lockfile

Apply model constraints through the lockfile:

[[profiles.default.bindings]]
alias = "creative_writer"
provider = "openai"
model = "gpt-4"
constraints = { temperature = 0.9, max_tokens = 2000 }

[[profiles.default.bindings]]
alias = "code_reviewer"  
provider = "anthropic"
model = "claude-3-5-sonnet-20241022"
constraints = { temperature = 0.2 }

Convenience Method for Aliases

# Use the convenience method for simpler alias-based chat
async def main():
    service = LLMRing()
    
    # Direct alias usage without creating a request object
    response = await service.chat_with_alias(
        "summarizer",  # Alias or model string
        messages=[{"role": "user", "content": "Summarize quantum computing"}],
        temperature=0.5,
        max_tokens=200,
        profile="prod"  # Optional profile
    )
    print(response.content)

Programmatic Alias Management

# Manage aliases from code
service = LLMRing()

# Bind an alias
service.bind_alias("translator", "openai:gpt-4o", profile="default")

# List aliases
aliases = service.list_aliases(profile="default")
print(aliases)  # {'translator': 'openai:gpt-4o', ...}

# Resolve an alias
model = service.resolve_alias("translator")
print(model)  # 'openai:gpt-4o'

Working with Files and Images

from llmring.file_utils import create_image_content, analyze_image

# Analyze an image
image_content = create_image_content("path/to/image.png")
messages = [
    Message(role="user", content=[
        {"type": "text", "text": "What's in this image?"},
        image_content
    ])
]

response = await analyze_image(
    service, 
    "path/to/image.png",
    "Describe this image",
    model="openai:gpt-4o"  # Or use an alias
)

Custom System Prompts

messages = [
    Message(role="system", content="You are a helpful assistant."),
    Message(role="user", content="Hello!")
]

request = LLMRequest(
    messages=messages,
    model="summarizer",
    temperature=0.7,
    max_tokens=1000
)

Security

API Key Management

LLMRing never stores API keys in files. Keys are only read from environment variables:

export OPENAI_API_KEY=sk-...
export ANTHROPIC_API_KEY=sk-ant-...

Best practices:

Use .env files locally (never commit them)
Use secrets management in production (AWS Secrets Manager, Vault, etc.)
The lockfile is safe to commit - it contains no secrets

For technical details and security specifications, see docs/technical.md.

Architecture

LLMRing follows an alias-first, lockfile-based architecture:

Lockfile (llmring.lock): The authoritative configuration source containing alias→model bindings, profiles, and registry versions
Registry: Public model information hosted on GitHub Pages for drift detection
Service: Lightweight routing layer that resolves aliases and forwards to providers
Receipts: Optional Ed25519-signed receipts when connected to server/SaaS

The system is designed to be:

Local-first: Fully functional without backend services
Version-controlled: Lockfile can be committed for reproducible deployments
Drift-aware: Detects when models change between registry versions

License

MIT

Contributing

Contributions are welcome! Please read our contributing guidelines and submit pull requests to our repository.

Support

For issues and questions, please use the GitHub issue tracker.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.4.0

Jan 3, 2026

1.3.0

Nov 2, 2025

1.2.0

Oct 26, 2025

1.1.1

Oct 14, 2025

1.1.0

Sep 29, 2025

1.0.0

Sep 29, 2025

0.4.0

Sep 29, 2025

This version

0.3.0

Aug 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llmring-0.3.0.tar.gz (43.4 kB view details)

Uploaded Aug 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llmring-0.3.0-py3-none-any.whl (54.7 kB view details)

Uploaded Aug 20, 2025 Python 3

File details

Details for the file llmring-0.3.0.tar.gz.

File metadata

Download URL: llmring-0.3.0.tar.gz
Upload date: Aug 20, 2025
Size: 43.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.4

File hashes

Hashes for llmring-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`cacd2e525d59a81cc2c14eb3bf620586f83d78f0941c2193a46d6807adc88a15`
MD5	`9cfbdf58d23877710623c5de37c5308f`
BLAKE2b-256	`ccdb9150a02240ed5cec432cc69c20d67bd27bbb619f59de36e3f19028f0099e`

See more details on using hashes here.

File details

Details for the file llmring-0.3.0-py3-none-any.whl.

File metadata

Download URL: llmring-0.3.0-py3-none-any.whl
Upload date: Aug 20, 2025
Size: 54.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.4

File hashes

Hashes for llmring-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f87843166ac51ca94de8ad089c91125270e91d4aad542870dfb53e8ab905abf5`
MD5	`328272ba9e3db47da34c6f19ef7d04cf`
BLAKE2b-256	`3128c0bfd36d7801512253bc14b336a564100b1f2bf6d0c04cce6bf7322c5f25`

See more details on using hashes here.

llmring 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LLMRing

Highlights

Installation

Requirements

Quick Start

1) Initialize a lockfile

2) Bind aliases to models

3) Use aliases in code

4) Direct model usage (without aliases)

Lockfile Configuration

Profiles

Registry Integration

CLI Reference

Lockfile Management

Alias Management

Chat & Model Usage

Provider Configuration

Cost Tracking

Advanced Usage

Constraints in Lockfile

Convenience Method for Aliases

Programmatic Alias Management

Working with Files and Images

Custom System Prompts

Security

API Key Management

Architecture

License

Contributing

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes