Production-ready Python library for multi-provider LLM orchestration with intelligent routing

These details have not been verified by PyPI

Project links

Project description

JustLLMs

A production-ready Python library that simplifies working with multiple Large Language Model providers through intelligent routing, comprehensive analytics, and enterprise-grade features.

Why JustLLMs?

Managing multiple LLM providers is complex. You need to handle different APIs, optimize costs, monitor usage, and ensure reliability. JustLLMs solves these challenges by providing a unified interface that automatically routes requests to the best provider based on your criteria—whether that's cost, speed, or quality.

Installation

# Basic installation
pip install justllms

# With PDF export capabilities
pip install "justllms[pdf]"

# All optional dependencies (PDF export, Redis caching, advanced analytics)
pip install "justllms[all]"

Package size: 1.1MB | Lines of code: ~11K | Dependencies: Minimal production requirements

Quick Start

from justllms import JustLLM

# Initialize with your API keys
client = JustLLM({
    "providers": {
        "openai": {"api_key": "your-openai-key"},
        "google": {"api_key": "your-google-key"},
        "anthropic": {"api_key": "your-anthropic-key"}
    }
})

# Simple completion - automatically routes to best provider
response = client.completion.create(
    messages=[{"role": "user", "content": "Explain quantum computing briefly"}]
)
print(response.content)

Core Features

Multi-Provider Support

Connect to all major LLM providers with a single, consistent interface:

OpenAI (GPT-5, GPT-4, etc.) <yes, you can use GPT 5 :)>
Google (Gemini 2.5, Gemini 1.5 models)
Anthropic (Claude 3.5, Claude 3 models)
Azure OpenAI (with deployment mapping)
xAI Grok, DeepSeek, and more

# Switch between providers seamlessly
client = JustLLM({
    "providers": {
        "openai": {"api_key": "your-key"},
        "google": {"api_key": "your-key"},
        "anthropic": {"api_key": "your-key"}
    }
})

# Same interface, different providers automatically chosen
response1 = client.completion.create(
    messages=[{"role": "user", "content": "Explain AI"}],
    provider="openai"  # Force specific provider
)

response2 = client.completion.create(
    messages=[{"role": "user", "content": "Explain AI"}]
    # Auto-routes to best provider based on your strategy
)

Intelligent Routing

The game-changing feature that sets JustLLMs apart. Instead of manually choosing models, let our intelligent routing engine automatically select the optimal provider and model for each request based on your priorities.

Available Strategies

🆕 Cluster-Based Routing - AI-Powered Query Analysis Our most advanced routing strategy uses machine learning to analyze query semantics and route to the optimal model based on similarity to training data. Achieves +7% accuracy improvement and -27% cost reduction compared to single-model approaches.

# Cluster-based routing (recommended for production)
client = JustLLM({
    "providers": {...},
    "routing": {"strategy": "cluster"}
})

Based on research from "Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing" - AvengersPro framework

Traditional Routing Strategies

# Cost-optimized: Always picks the cheapest option
client = JustLLM({
    "providers": {...},
    "routing": {"strategy": "cost"}
})

# Speed-optimized: Prioritizes fastest response times
client = JustLLM({
    "providers": {...},
    "routing": {"strategy": "latency"}
})

# Quality-optimized: Uses the best models for complex tasks
client = JustLLM({
    "providers": {...},
    "routing": {"strategy": "quality"}
})

# Task-based: Automatically detects query type and routes accordingly
client = JustLLM({
    "providers": {...},
    "routing": {"strategy": "task"}
})

How Cluster Routing Works

Query Analysis: Your request is embedded using Qwen3-Embedding-0.6B
Cluster Matching: Finds the most similar cluster from pre-trained data
Model Selection: Routes to the best-performing model for that cluster
Fallback: Falls back to quality-based routing if needed

Result: Up to 60% cost reduction while improving accuracy, with automatic failover to backup providers.

Real-time Streaming

Full streaming support with proper token handling across all providers:

stream = client.completion.create(
    messages=[{"role": "user", "content": "Write a short story"}],
    stream=True
)

for chunk in stream:
    print(chunk.content, end="", flush=True)

Conversation Management

Built-in conversation state management with context preservation:

# Create client
conversation = Conversation(client=client)

# Set system message
conversation.add_system_message("You are a helpful math tutor. Keep answers concise.")

# Turn 1
response = conversation.send("What is 15 + 25?")

# Turn 2 - Context is automatically preserved
response = conversation.send("Now divide that by 8")

# Get conversation stats
history = conversation.get_history()

Conversation Features:

Auto-save: Persist conversations automatically
Context management: Smart context window handling
Export/Import: JSON, Markdown, and TXT formats
Analytics: Track usage, costs, and performance per conversation
Search: Find conversations by content or metadata

Smart Caching

Intelligent response caching that dramatically reduces costs and improves response times:

client = JustLLM({
    "providers": {...},
    "caching": {
        "enabled": True,
        "ttl": 3600,  # 1 hour
        "max_size": 1000
    }
})

# First call - cache miss
response1 = client.completion.create(
    messages=[{"role": "user", "content": "What is AI?"}]
)  # ~2 seconds, full cost

# Second call - cache hit
response2 = client.completion.create(
    messages=[{"role": "user", "content": "What is AI?"}]
)  # ~50ms, no cost

Enterprise Analytics

Comprehensive usage tracking and cost analysis that gives you complete visibility into your LLM operations. Unlike other solutions that require external tools, JustLLMs provides built-in analytics that finance and engineering teams actually need.

What You Get

Cross-provider metrics: Compare performance across providers
Cost tracking: Detailed cost analysis per model/provider
Performance insights: Latency, throughput, success rates
Export capabilities: CSV, PDF with charts
Time series analysis: Usage patterns over time
Top models/providers: Usage and cost rankings

# Generate detailed reports
report = client.analytics.generate_report()
print(f"Total requests: {report.cross_provider_metrics.total_requests}")
print(f"Total cost: ${report.cross_provider_metrics.total_cost:.2f}")
print(f"Fastest provider: {report.cross_provider_metrics.fastest_provider}")
print(f"Cost per request: ${report.cross_provider_metrics.avg_cost_per_request:.4f}")

# Get granular insights
print(f"Cache hit rate: {report.performance_metrics.cache_hit_rate:.1f}%")
print(f"Token efficiency: {report.optimization_suggestions.token_savings:.1f}%")

# Export reports for finance teams
from justllms.analytics.reports import CSVExporter, PDFExporter
csv_exporter = CSVExporter()
csv_exporter.export(report, "monthly_llm_costs.csv")

pdf_exporter = PDFExporter(include_charts=True)
pdf_exporter.export(report, "executive_summary.pdf")

Business Impact: Teams typically save 40-70% on LLM costs within the first month by identifying usage patterns and optimizing model selection.

Unified LLM Interface

Streamlined access to multiple LLM providers with intelligent routing, comprehensive analytics, and enterprise-grade features for production deployments.

Business Rule Validation

Enterprise-grade content filtering and compliance built for regulated industries. Ensure your LLM applications meet security, privacy, and business requirements without custom development.

Compliance Features

PII Detection - Automatically detect and handle social security numbers, credit cards, phone numbers
Content Filtering - Block inappropriate content, profanity, or sensitive topics
Custom Business Rules - Define your own validation logic with regex patterns or custom functions
Audit Trail - Complete logging of all validation actions for compliance reporting

from justllms.validation import ValidationConfig, BusinessRule, RuleType, ValidationAction

client = JustLLM({
    "providers": {...},
    "validation": ValidationConfig(
        enabled=True,
        business_rules=[
            # Block sensitive data patterns
            BusinessRule(
                name="no_ssn",
                type=RuleType.PATTERNS,
                pattern=r"\\b\\d{3}-\\d{2}-\\d{4}\\b",
                action=ValidationAction.BLOCK,
                message="SSN detected - request blocked for privacy"
            ),
            # Content filtering
            BusinessRule(
                name="professional_content",
                type=RuleType.CONTENT_FILTER,
                categories=["hate", "violence", "adult"],
                action=ValidationAction.SANITIZE
            ),
            # Custom business logic
            BusinessRule(
                name="company_policy",
                type=RuleType.CUSTOM,
                validator=lambda content: "competitor" not in content.lower(),
                action=ValidationAction.WARN
            )
        ],
        # Compliance presets
        compliance_mode="GDPR",  # or "HIPAA", "PCI_DSS"
        audit_logging=True
    )
})

# All requests are automatically validated
response = client.completion.create(
    messages=[{"role": "user", "content": "My SSN is 123-45-6789"}]
)
# This request would be blocked and logged for compliance

Regulatory Compliance: Built-in support for major compliance frameworks saves months of custom security development.

Advanced Usage

Async Operations

Full async/await support for high-performance applications:

import asyncio

async def process_batch():
    tasks = []
    for prompt in prompts:
        task = client.completion.acreate(
            messages=[{"role": "user", "content": prompt}]
        )
        tasks.append(task)
    
    responses = await asyncio.gather(*tasks)
    return responses

Error Handling & Reliability

Automatic retries and fallback providers ensure high availability:

client = JustLLM({
    "providers": {...},
    "retry": {
        "max_attempts": 3,
        "backoff_factor": 2,
        "retry_on": ["timeout", "rate_limit", "server_error"]
    }
})

# Automatically retries on failures
try:
    response = client.completion.create(
        messages=[{"role": "user", "content": "Hello"}],
        provider="invalid-provider"  # Will fail and retry
    )
except Exception as e:
    print(f"All retries failed: {e}")

Configuration Management

Flexible configuration with environment variable support:

# Environment-based config
import os
client = JustLLM({
    "providers": {
        "openai": {"api_key": os.getenv("OPENAI_API_KEY")},
        "azure_openai": {
            "api_key": os.getenv("AZURE_OPENAI_KEY"),
            "resource_name": os.getenv("AZURE_RESOURCE_NAME"),
            "api_version": "2024-12-01-preview"
        }
    }
})

# File-based config
import yaml
with open("config.yaml") as f:
    config = yaml.safe_load(f)
client = JustLLM(config)

🏆 Comparison with Alternatives

Feature	JustLLMs	LangChain	LiteLLM	OpenAI SDK	Haystack
Package Size	1.1MB	~50MB	~5MB	~1MB	~20MB
Setup Complexity	Simple config	Complex chains	Medium	Simple	Complex
Multi-Provider	✅ 6+ providers	✅ Many integrations	✅ 100+ providers	❌ OpenAI only	✅ Limited LLMs
Intelligent Routing	✅ Cost/speed/quality	❌ Manual only	⚠️ Basic routing	❌ None	❌ Pipeline-based
Built-in Analytics	✅ Enterprise-grade	❌ External tools needed	⚠️ Basic metrics	❌ None	⚠️ Pipeline metrics
Conversation Management	✅ Full lifecycle	⚠️ Memory components	❌ None	❌ Manual handling	✅ Dialog systems
Business Rules	✅ Content validation	❌ Custom implementation	❌ None	❌ None	⚠️ Custom filters
Cost Optimization	✅ Automatic routing	❌ Manual optimization	⚠️ Basic cost tracking	❌ None	❌ None
Streaming Support	✅ All providers	✅ Provider-dependent	✅ Most providers	✅ OpenAI only	⚠️ Limited
Production Ready	✅ Out of the box	⚠️ Requires setup	✅ Minimal setup	⚠️ Basic features	✅ Complex setup
Caching	✅ Multi-backend	⚠️ Custom implementation	✅ Basic caching	❌ None	✅ Document stores

Enterprise Configuration

For production deployments with advanced features:

enterprise_config = {
    "providers": {
        "azure_openai": {
            "api_key": os.getenv("AZURE_OPENAI_KEY"),
            "resource_name": "my-enterprise-resource",
            "deployment_mapping": {
                "gpt-4": "my-gpt4-deployment",
                "gpt-3.5-turbo": "my-gpt35-deployment"
            }
        },
        "anthropic": {"api_key": os.getenv("ANTHROPIC_KEY")},
        "google": {"api_key": os.getenv("GOOGLE_KEY")}
    },
    "routing": {
        "strategy": "cost",
        "fallback_provider": "azure_openai",
        "fallback_model": "gpt-3.5-turbo"
    },
    "validation": {
        "enabled": True,
        "business_rules": [
            # PII detection, content filtering, compliance rules
        ]
    },
    "analytics": {
        "enabled": True,
        "track_usage": True,
        "track_performance": True
    },
    "caching": {
        "enabled": True,
        "backend": "redis",
        "ttl": 3600
    },
    "conversations": {
        "backend": "disk",
        "auto_save": True,
        "auto_title": True,
        "max_context_tokens": 8000
    }
}

client = JustLLM(enterprise_config)

License

MIT License - see LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.1.8

Oct 8, 2025

2.1.7

Oct 3, 2025

2.1.6

Oct 3, 2025

2.1.5

Oct 3, 2025

2.0.5

Oct 2, 2025

2.0.4

Oct 1, 2025

2.0.3

Oct 1, 2025

2.0.2

Oct 1, 2025

2.0.1

Sep 11, 2025

2.0.0

Sep 6, 2025

This version

1.4.0

Aug 26, 2025

1.3.0

Aug 11, 2025

1.2.0

Aug 10, 2025

1.1.0

Aug 10, 2025

1.0.1

Aug 8, 2025

1.0.0

Aug 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

justllms-1.4.0.tar.gz (162.3 kB view details)

Uploaded Aug 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

justllms-1.4.0-py3-none-any.whl (176.1 kB view details)

Uploaded Aug 26, 2025 Python 3

File details

Details for the file justllms-1.4.0.tar.gz.

File metadata

Download URL: justllms-1.4.0.tar.gz
Upload date: Aug 26, 2025
Size: 162.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for justllms-1.4.0.tar.gz
Algorithm	Hash digest
SHA256	`b38ef942640fa7fe4a63fb9f21ffded116971d05f386643f4930a3ac41482a55`
MD5	`bd8e2df965ba4d6ed62f1f0817646933`
BLAKE2b-256	`e8f86829bc554dbb94ca8406f4568309fa8cbf8f4933bdb88c1e143fd2657bf4`

See more details on using hashes here.

File details

Details for the file justllms-1.4.0-py3-none-any.whl.

File metadata

Download URL: justllms-1.4.0-py3-none-any.whl
Upload date: Aug 26, 2025
Size: 176.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for justllms-1.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`90c6a70c876d2b7b2d4a22c2f7fb82c60b473cec1a37b2c45fb290919b4b83db`
MD5	`1f5c089d5ab8508f193532a0b86e603f`
BLAKE2b-256	`7bde8b5cc7ded312fff887873d17e8b9735734384c89872e21c942317cb85843`

See more details on using hashes here.

justllms 1.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

JustLLMs

Why JustLLMs?

Installation

Quick Start

Core Features

Multi-Provider Support

Intelligent Routing

Available Strategies

How Cluster Routing Works

Real-time Streaming

Conversation Management

Smart Caching

Enterprise Analytics

What You Get

Unified LLM Interface

Business Rule Validation

Compliance Features

Advanced Usage

Async Operations

Error Handling & Reliability

Configuration Management

🏆 Comparison with Alternatives

Enterprise Configuration

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes