A unified interface for multiple LLM providers with image generation, speech-to-text, and function calling (OpenAI, Anthropic, Gemini, VertexAI, Ollama + DALL-E, Replicate, Whisper, Google Speech, Tools)

These details have not been verified by PyPI

Project links

Project description

LLM Provider Factory

A unified, extensible Python library for interacting with multiple Large Language Model (LLM) providers through a single, consistent interface. Support for OpenAI, Anthropic Claude, Google Gemini, VertexAI, and Ollama (local LLMs).

🌟 Features

Unified Interface: Single API for multiple LLM providers
Cloud & Local LLMs: Support for both cloud-based and local LLM providers
Image Generation: DALL-E and Replicate image generation support
Speech-to-Text: OpenAI Whisper and Google Cloud Speech-to-Text support
Tools / Function Calling: Unified tools API (OpenAI, Anthropic, Gemini-ready)
Async Support: Full async/await support for better performance
Streaming: Real-time streaming responses from all providers
Type Safety: Complete type hints and Pydantic models
Error Handling: Comprehensive error handling with specific exceptions
Configuration Management: Flexible configuration system
Extensible: Easy to add new providers
Testing: Full test coverage with mocking support

🔌 Supported Providers

Text Generation

Provider	Models	Features	Type
OpenAI	GPT-3.5, GPT-4, GPT-4o	Generate, Stream, Conversation	Cloud
Anthropic	Claude-3 (Haiku, Sonnet, Opus)	Generate, Stream, Conversation	Cloud
Google Gemini	Gemini Pro, Gemini Flash	Generate, Stream, Conversation	Cloud
VertexAI	Mistral, Gemini	Generate, Conversation	Cloud
Ollama	Llama, CodeLlama, Mistral, etc.	Generate, Stream, Conversation	Local

Image Generation

Provider	Models	Features	Type
OpenAI	DALL-E 2, DALL-E 3	Text-to-Image, HD Quality, Style Control	Cloud
Replicate	Stable Diffusion, InstantID	Text-to-Image, Reference Images, Custom Models	Cloud

Speech-to-Text

Provider	Models	Features	Type
OpenAI	Whisper	Multi-language, Timestamps, Word Confidence	Cloud
Google Cloud	Speech-to-Text v2	Real-time, Speaker Diarization, Punctuation	Cloud

🚀 Quick Start

Installation

pip install llm-provider-factory

Basic Usage

import asyncio
from llm_provider import LLMProviderFactory, OpenAIConfig, OllamaConfig

async def main():
    # Cloud LLM - OpenAI
    openai_config = OpenAIConfig(api_key="your-api-key", model="gpt-4")
    openai_provider = LLMProviderFactory().create_provider("openai", openai_config)
    response = await openai_provider.generate("Hello, world!")
    print(f"OpenAI: {response.content}")
    
    # Local LLM - Ollama
    ollama_config = OllamaConfig(
        base_url="http://localhost:11434",
        model="llama3.1:latest"
    )
    ollama_provider = LLMProviderFactory().create_provider("ollama", ollama_config)
    response = await ollama_provider.generate("Hello, world!")
    print(f"Ollama: {response.content}")

asyncio.run(main())

🎤 Speech-to-Text

Basic Speech Transcription

import asyncio
from llm_provider import SpeechFactory, SpeechRequest

async def speech_example():
    # OpenAI Whisper
    factory = SpeechFactory()
    openai_speech = factory.create_openai_speech(api_key="your-openai-key")
    
    # Basic transcription
    request = SpeechRequest(
        audio_data="/path/to/audio.mp3",
        language="en",
        provider_options={"response_format": "text"}
    )
    
    response = await openai_speech.transcribe(request)
    print(f"Transcription: {response.text}")
    
    # Advanced transcription with timestamps
    detailed_request = SpeechRequest(
        audio_data="/path/to/audio.wav", 
        language="auto",
        timestamps=True,
        word_confidence=True,
        provider_options={
            "response_format": "verbose_json",
            "temperature": 0.2
        }
    )
    
    response = await openai_speech.transcribe(detailed_request)
    print(f"Text: {response.text}")
    
    # Print word-level timestamps
    for word in response.words:
        print(f"{word.word}: {word.start}s - {word.end}s (confidence: {word.confidence})")

asyncio.run(speech_example())

🧰 Tools (Function Calling)

Modelinize fonksiyonlar (tools) tanımlayıp çağırmasına izin verin.

import asyncio
from llm_provider import LLMProviderFactory, OpenAIConfig, ToolFunction
from llm_provider.utils import run_with_tools_async

calculator = ToolFunction(
    name="calculator",
    description="Evaluate arithmetic expressions",
    parameters={
        "type": "object",
        "properties": {"expression": {"type": "string"}},
        "required": ["expression"],
    },
)

def execute_tool(name: str, args: dict):
    if name == "calculator":
        try:
            return str(eval(args["expression"], {"__builtins__": None}, {}))
        except Exception as e:
            return f"Error: {e}"
    return "Unknown tool"

async def main():
    provider = LLMProviderFactory().create_openai(OpenAIConfig(api_key="your-key", model="gpt-4o-mini"))
    response = await run_with_tools_async(
        provider,
        prompt="Compute (12+30)*2 using the calculator tool",
        tools=[calculator],
        execute_tool=execute_tool,
        max_iterations=2,
    )
    print(response.content)

asyncio.run(main())

VertexAI Gemini ile Tools

import asyncio
from llm_provider import LLMProviderFactory, VertexAIConfig, ToolFunction
from llm_provider.utils import run_with_tools_async

# Hava durumu tool'u
weather_tool = ToolFunction(
    name="get_weather",
    description="Belirtilen şehir için hava durumu bilgisi döndürür",
    parameters={
        "type": "object",
        "properties": {
            "city": {
                "type": "string",
                "description": "Şehir ismi (örn: Istanbul, Ankara)"
            }
        },
        "required": ["city"]
    },
)

def execute_tool(name: str, args: dict):
    if name == "get_weather":
        city = args.get("city", "")
        # Gerçek uygulamada API çağrısı yapılır
        return f"{city} için hava durumu: 22°C, Güneşli"
    return "Bilinmeyen tool"

async def main():
    config = VertexAIConfig(
        project_id="your-project-id",
        location="us-central1",
        model="gemini-2.0-flash-exp"
    )
    
    provider = LLMProviderFactory().create_vertexai(config)
    
    response = await run_with_tools_async(
        provider,
        prompt="İstanbul'da hava nasıl?",
        tools=[weather_tool],
        execute_tool=execute_tool,
        max_iterations=3,
    )
    
    print(response.content)

asyncio.run(main())

Google Cloud Speech-to-Text

async def google_speech_example():
    factory = SpeechFactory()
    google_speech = factory.create_google_speech(credentials_path="/path/to/credentials.json")
    
    # Advanced transcription with speaker diarization
    request = SpeechRequest(
        audio_data="/path/to/meeting.wav",
        language="en-US",
        speaker_labels=True,
        punctuation=True,
        word_confidence=True,
        provider_options={
            "min_speaker_count": 2,
            "max_speaker_count": 5
        }
    )
    
    response = await google_speech.transcribe(request)
    
    # Print transcript with speaker labels
    print(f"Full transcript: {response.text}")
    
    for segment in response.segments:
        speaker = f"Speaker {segment.speaker_tag}" if segment.speaker_tag else "Unknown"
        print(f"{speaker}: {segment.text}")
        print(f"  Time: {segment.start_time}s - {segment.end_time}s")
        print(f"  Confidence: {segment.confidence}")

asyncio.run(google_speech_example())

🎨 Image Generation

Basic Image Generation

import asyncio
from llm_provider import ImageProviderFactory

async def image_example():
    # OpenAI DALL-E
    factory = ImageProviderFactory()
    openai_image = factory.create_openai_image(api_key="your-openai-key")
    
    response = await openai_image.generate_image(
        prompt="A futuristic city with flying cars at sunset",
        size="1024x1024",
        quality="hd",
        model="dall-e-3"
    )
    print(f"Image URL: {response.urls[0]}")
    
    # Replicate
    replicate_image = factory.create_replicate_image(api_token="your-replicate-token")
    
    response = await replicate_image.generate_image(
        prompt="A cyberpunk street scene with neon lights",
        model="stability-ai/sdxl",
        width=1024,
        height=1024
    )
    print(f"Image URL: {response.urls[0]}")

asyncio.run(image_example())

Combined Text + Image Generation

async def combined_example():
    # Generate prompt with LLM
    llm_factory = LLMProviderFactory()
    llm = llm_factory.create_openai(api_key="your-key")
    
    prompt_response = await llm.generate(
        "Create a detailed artistic prompt for a fantasy landscape"
    )
    
    # Use generated prompt for image
    image_factory = ImageProviderFactory()
    image_provider = image_factory.create_openai_image(api_key="your-key")
    
    image_response = await image_provider.generate_image(
        prompt=prompt_response.content,
        size="1024x1024"
    )
    
    print(f"Generated image: {image_response.urls[0]}")

📖 Detailed Usage

Configuration

Environment Variables

# Cloud LLM Providers
export OPENAI_API_KEY="your-openai-key"
export ANTHROPIC_API_KEY="your-anthropic-key"  
export GOOGLE_API_KEY="your-google-key"
export GOOGLE_CLOUD_PROJECT="your-gcp-project-id"
export GOOGLE_APPLICATION_CREDENTIALS="path/to/service-account.json"

# Local LLM Providers
export OLLAMA_BASE_URL="http://localhost:11434"  # Default Ollama server

Programmatic Configuration

from llm_provider import OpenAIConfig, AnthropicConfig, GeminiConfig, VertexAIConfig

# OpenAI Configuration
openai_config = OpenAIConfig(
    api_key="your-key",
    model="gpt-4",
    max_tokens=1000,
    temperature=0.7
)

# Anthropic Configuration
anthropic_config = AnthropicConfig(
    api_key="your-key",
    model="claude-3-sonnet-20240229",
    max_tokens=1000,
    temperature=0.7
)

# Gemini Configuration
gemini_config = GeminiConfig(
    api_key="your-key",
    model="gemini-pro",
    max_tokens=1000,
    temperature=0.7
)

# VertexAI Configuration  
vertexai_config = VertexAIConfig(
    project_id="your-gcp-project",
    location="us-central1",
    model="gemini-1.5-pro",
    credentials_path="path/to/service-account.json"  # Optional if using GOOGLE_APPLICATION_CREDENTIALS
)

# Ollama Configuration (Local LLM)
ollama_config = OllamaConfig(
    base_url="http://localhost:11434",  # Default Ollama server
    model="llama3.1:latest",  # Any Ollama model
    max_tokens=1000,
    temperature=0.7
)

Multiple Provider Usage

from llm_provider import LLMProviderFactory

async def compare_providers():
    factory = LLMProviderFactory()
    prompt = "Explain quantum computing in simple terms"
    
    # Generate with different providers
    openai_response = await factory.generate(prompt, provider="openai")
    anthropic_response = await factory.generate(prompt, provider="anthropic")
    gemini_response = await factory.generate(prompt, provider="gemini")
    vertexai_response = await factory.generate(prompt, provider="vertexai")
    
    print(f"OpenAI: {openai_response.content}")
    print(f"Anthropic: {anthropic_response.content}")
    print(f"Gemini: {gemini_response.content}")
    print(f"VertexAI: {vertexai_response.content}")

Conversation History

from llm_provider import Message, MessageRole

async def conversation_example():
    factory = LLMProviderFactory.create_openai()
    
    history = [
        Message(role=MessageRole.USER, content="Hello, I'm learning Python"),
        Message(role=MessageRole.ASSISTANT, content="Hello! I'd be happy to help you learn Python."),
        Message(role=MessageRole.USER, content="Can you explain variables?")
    ]
    
    response = await factory.generate(
        "Now explain functions",
        history=history
    )
    print(response.content)

Streaming Responses

async def streaming_example():
    factory = LLMProviderFactory.create_openai()
    
    async for chunk in factory.stream_generate("Write a short story about AI"):
        if chunk.content:
            print(chunk.content, end="", flush=True)
        
        if chunk.is_final:
            print(f"\nFinish reason: {chunk.finish_reason}")
            break

Error Handling

from llm_provider import (
    AuthenticationError,
    RateLimitError,
    ModelNotAvailableError,
    GenerationError
)

async def robust_generation():
    factory = LLMProviderFactory.create_openai()
    
    try:
        response = await factory.generate("Hello world")
        return response.content
    except AuthenticationError:
        print("Check your API key")
    except RateLimitError:
        print("Rate limit exceeded, try again later")
    except ModelNotAvailableError:
        print("Model not available")
    except GenerationError as e:
        print(f"Generation failed: {e}")

🔧 Advanced Usage

Custom Provider

from llm_provider import BaseLLMProvider, ProviderConfig

class CustomProvider(BaseLLMProvider):
    async def initialize(self):
        # Initialize your custom provider
        pass
    
    async def generate(self, request):
        # Implement generation logic
        pass
    
    async def stream_generate(self, request):
        # Implement streaming logic
        pass
    
    def get_supported_models(self):
        return ["custom-model-1", "custom-model-2"]
    
    def validate_config(self):
        return True
    
    def get_provider_info(self):
        # Return provider information
        pass

# Register custom provider
factory = LLMProviderFactory()
factory.register_provider("custom", CustomProvider)

Provider Information

async def provider_info_example():
    factory = LLMProviderFactory()
    
    # Get all provider information
    all_providers = factory.get_provider_info()
    for info in all_providers:
        print(f"{info.display_name}: {info.supported_models}")
    
    # Get specific provider info
    openai_info = factory.get_provider_info("openai")
    print(f"OpenAI models: {openai_info.supported_models}")

📁 Project Structure

llm-provider-factory/
├── src/
│   └── llm_provider/
│       ├── __init__.py
│       ├── factory.py
│       ├── base_provider.py
│       ├── settings.py
│       ├── providers/
│       │   ├── __init__.py
│       │   ├── openai_provider.py
│       │   ├── anthropic_provider.py
│       │   └── gemini_provider.py
│       └── utils/
│           ├── __init__.py
│           ├── config.py
│           ├── exceptions.py
│           └── logger.py
├── tests/
├── pyproject.toml
└── README.md

🧪 Testing

# Install development dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run tests with coverage
pytest --cov=src/llm_provider --cov-report=html

# Run specific test file
pytest tests/test_factory.py

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/new-provider)
Make your changes
Add tests for new functionality
Ensure all tests pass (pytest)
Submit a pull request

Adding a New Provider

Create a new provider class in src/llm_provider/providers/
Inherit from BaseLLMProvider
Implement all abstract methods
Add configuration class in utils/config.py
Register the provider in factory.py
Add tests in tests/

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for their excellent API and documentation
Anthropic for Claude's capabilities
Google for Gemini's multimodal features
The Python community for inspiration and tools

LLM Provider Factory

A unified factory for multiple LLM providers (OpenAI, Anthropic, Google Gemini, Google Vertex AI).

VertexAI + Mistral Support

Bu paket artık Google Cloud Vertex AI üzerinden Mistral modellerini desteklemektedir!

Desteklenen Modeller

VertexAI Provider:

Gemini modelleri: gemini-1.5-pro, gemini-1.5-flash, gemini-1.0-pro
Text/Chat modelleri: text-bison, chat-bison
Mistral modelleri: mistral-large-2411, mistral-7b-instruct

Hızlı Başlangıç - VertexAI

Paketleri yükleyin:

python setup_vertexai.py

Google Cloud Setup:
- Service account oluşturun
- JSON credentials dosyası indirin
- Environment variable ayarlayın:

export GOOGLE_APPLICATION_CREDENTIALS='/path/to/your/credentials.json'

Kullanım:

from llm_provider import LLMProviderFactory
from llm_provider.utils.config import VertexAIConfig

# Configuration
config = VertexAIConfig(
    project_id="your-project-id",
    location="us-central1", 
    model="mistral-large-2411",  # Mistral model!
    credentials_path="/path/to/credentials.json",
    temperature=0.1,
    max_tokens=1000
)

# Create provider
factory = LLMProviderFactory()
provider = factory.create_provider("vertexai", config)

# Generate response
response = await provider.generate(request)

Test:

python test_vertexai_mistral.py
``` Built with clean architecture principles and SOLID design patterns.

## 🚀 Quick Start

```bash
pip install llm-provider

from llm_provider import LLMProviderFactory, OpenAI

provider = LLMProviderFactory(OpenAI(api_key="your-key"))
response = provider.generate(prompt="Hello", history=[])
print(response.content)

✨ Features

🏭 Factory Pattern: Clean, consistent interface
🔌 Extensible: Easy to add new providers
🛡️ Type Safe: Full typing support
🚀 Production Ready: Comprehensive error handling
📦 Zero Dependencies: Only requires requests

🔗 Links

PyPI: https://pypi.org/project/llm-provider/
Test PyPI: https://test.pypi.org/project/llm-provider/

📦 Supported Providers

OpenAI (GPT-3.5, GPT-4)
Anthropic (Claude models)
Google Gemini (Gemini Pro, Flash)

📚 Documentation

See the package source code and examples in the repository.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.7.3

Oct 25, 2025

0.7.2

Oct 25, 2025

0.7.1

Oct 25, 2025

0.6.0

Oct 18, 2025

0.5.9

Oct 16, 2025

0.5.8

Oct 16, 2025

0.5.7

Oct 9, 2025

0.5.0

Oct 9, 2025

0.4.1

Oct 8, 2025

0.4.0

Oct 8, 2025

0.3.0

Oct 8, 2025

0.2.3

Oct 8, 2025

0.1.1

Oct 8, 2025

0.1.0

Oct 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_provider_factory-0.7.3.tar.gz (85.4 kB view details)

Uploaded Oct 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_provider_factory-0.7.3-py3-none-any.whl (53.0 kB view details)

Uploaded Oct 25, 2025 Python 3

File details

Details for the file llm_provider_factory-0.7.3.tar.gz.

File metadata

Download URL: llm_provider_factory-0.7.3.tar.gz
Upload date: Oct 25, 2025
Size: 85.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.13

File hashes

Hashes for llm_provider_factory-0.7.3.tar.gz
Algorithm	Hash digest
SHA256	`ab4944479cc41cdbaa862e8c598f96cea7fa7cfcefba31426beb88f2a7c1d3f4`
MD5	`6a54e5edd31e98a2bd1213a68bd200c3`
BLAKE2b-256	`1b3ee040134f1bf5d4debdaa362142d4de9c2729356a596d32975a2274300308`

See more details on using hashes here.

File details

Details for the file llm_provider_factory-0.7.3-py3-none-any.whl.

File metadata

Download URL: llm_provider_factory-0.7.3-py3-none-any.whl
Upload date: Oct 25, 2025
Size: 53.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.13

File hashes

Hashes for llm_provider_factory-0.7.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b55833c83505824db900c10a49acfdc1917bd533eb0150ffe7a75f413a6700ff`
MD5	`b24803b4f4be75b3e90675484f025c56`
BLAKE2b-256	`d720cdab306927752d6934ee5744966a53422fa774658eae82358cdef707473a`

See more details on using hashes here.

llm-provider-factory 0.7.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LLM Provider Factory

🌟 Features

🔌 Supported Providers

Text Generation

Image Generation

Speech-to-Text

🚀 Quick Start

Installation

Basic Usage

🎤 Speech-to-Text

Basic Speech Transcription

🧰 Tools (Function Calling)

VertexAI Gemini ile Tools

Google Cloud Speech-to-Text

🎨 Image Generation

Basic Image Generation

Combined Text + Image Generation

📖 Detailed Usage

Configuration

Environment Variables

Programmatic Configuration

Multiple Provider Usage

Conversation History

Streaming Responses

Error Handling

🔧 Advanced Usage

Custom Provider

Provider Information

📁 Project Structure

🧪 Testing

🤝 Contributing

Adding a New Provider

📄 License

🙏 Acknowledgments

LLM Provider Factory

VertexAI + Mistral Support

Desteklenen Modeller

Hızlı Başlangıç - VertexAI

✨ Features

🔗 Links

📦 Supported Providers

📚 Documentation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes