Skip to main content

Unified Python interface for multiple AI providers with support for text, images, and documents.

Project description

Multi AI Handler

A unified Python library for interacting with multiple AI providers through a consistent interface. Supports text and file inputs across OpenAI, Anthropic Claude, Google Gemini, and OpenRouter APIs.

Features

  • Unified interface for multiple AI providers
  • Support for text-only, file-only, or combined text and file inputs
  • Automatic payload formatting for each provider's API requirements
  • Support for images and documents (PDF)
  • Built-in token usage tracking for Gemini
  • Streaming support for Anthropic Claude
  • Environment-based API key management

Supported Providers

  • Anthropic Claude
  • Google Gemini
  • OpenAI
  • OpenRouter

Installation

Prerequisites

  • Python 3.14 or higher
  • uv package manager

Setup

git clone https://github.com/vsharha/multi-ai-handler.git
cd multi-ai-handler
uv sync

Setup

1. Create a .env file

Create a .env file in your project root with your API keys:

ANTHROPIC_API_KEY=your_anthropic_api_key_here
GEMINI_API_KEY=your_gemini_api_key_here
OPENROUTER_API_KEY=your_openrouter_api_key_here

2. Import the library

from multi_ai_handler import (
    request_ai,
    request_anthropic,
    request_google,
    request_openai,
    request_openrouter,
    Providers
)

Usage

Text-only requests

from multi_ai_handler import request_anthropic

response = request_anthropic(
    system_prompt="You are a helpful assistant.",
    user_text="What is the capital of France?",
    model="claude-3-5-sonnet-20241022",
    temperature=0.7
)
print(response)

Image analysis

from multi_ai_handler import request_google

# Simply pass the file path
response = request_google(
    system_prompt="You are an image analysis expert.",
    user_text="Describe what you see in this image.",
    file="image.jpg",
    model="gemini-1.5-flash",
    temperature=0.0
)
print(response)

Document processing

from multi_ai_handler import request_anthropic

# Pass the PDF file path
response = request_anthropic(
    system_prompt="You are a document analysis assistant.",
    user_text="Summarize the key points from this document.",
    file="document.pdf",
    model="claude-3-5-sonnet-20241022",
    temperature=0.0
)
print(response)

File-only requests (no text)

from multi_ai_handler import request_google

# User text can be None when only analyzing a file
response = request_google(
    system_prompt="Extract all text and data from images.",
    file="chart.png",
    model="gemini-1.5-pro"
)
print(response)

Using pathlib.Path

from multi_ai_handler import request_anthropic
from pathlib import Path

response = request_anthropic(
    system_prompt="Analyze this document.",
    file=Path("documents/report.pdf"),
    model="claude-3-5-sonnet-20241022"
)
print(response)

Using pre-encoded data

from multi_ai_handler import request_google
import base64

# If you already have encoded data
with open("image.jpg", "rb") as f:
    encoded_data = base64.b64encode(f.read()).decode()

response = request_google(
    system_prompt="Describe this image.",
    file={"filename": "image.jpg", "encoded_data": encoded_data},
    model="gemini-1.5-flash"
)
print(response)

Unified interface with JSON output

The library provides a unified request_ai() function that automatically routes to the appropriate provider and supports JSON output parsing:

from multi_ai_handler import request_ai, Providers

# Basic usage with default provider (Google)
response = request_ai(
    system_prompt="You are a helpful assistant.",
    user_text="What is the capital of France?",
    temperature=0.7
)
print(response)

# Specify a provider and model
response = request_ai(
    system_prompt="You are a data extraction expert.",
    user_text="Extract key information from: John Doe, age 30, lives in NYC",
    provider=Providers.ANTHROPIC,
    model="claude-sonnet-4-5-20250929",
    temperature=0.0
)
print(response)

# Request JSON output - automatically parses response
data = request_ai(
    system_prompt="You are a JSON formatter. Return valid JSON only.",
    user_text="Convert to JSON: Name: Alice, Age: 25, City: London",
    provider="google",
    json_output=True
)
print(data)  # Returns parsed dict

JSON Output Parsing:

  • When json_output=True, the function automatically extracts and parses JSON from the response
  • Handles responses wrapped in markdown code blocks (json ... )
  • Returns a Python dictionary
  • Raises an exception if JSON parsing fails

API Reference

Common Parameters

All request functions share the following parameters:

  • system_prompt (str, required): The system instruction for the AI model
  • user_text (str, optional): The user's text input
  • file (str | Path | dict, optional): File to process. Can be:
    • File path: "image.jpg" or Path("image.jpg") - automatically reads and encodes
    • Dict: {"filename": "image.jpg", "encoded_data": "base64..."} - for pre-encoded data
  • model (str, required): The specific model to use
  • temperature (float, optional): Controls randomness (0.0 = deterministic, 1.0 = creative). Default: 0.0

Note: Either user_text or file must be provided.

request_anthropic()

Makes a request to Anthropic's Claude API with streaming support.

def request_anthropic(
    system_prompt: str,
    user_text: str = None,
    file: str | Path | dict = None,
    model: str = None,
    temperature: float = 0.0
) -> str

Supported file types: Images (PNG, JPEG, GIF, WebP), Documents (PDF, DOCX, TXT, etc.)

request_google()

Makes a request to Google's Gemini API with token usage reporting.

def request_google(
    system_prompt: str,
    user_text: str = None,
    file: str | Path | dict = None,
    model: str = None,
    temperature: float = 0.0
) -> str

Supported file types: Images, videos, audio, documents

Note: Prints token usage (prompt, output, and total tokens) to console.

request_openai()

Makes a request to OpenAI's API.

def request_openai(
    system_prompt: str,
    user_text: str = None,
    file: str | Path | dict = None,
    model: str = None,
    temperature: float = 0.0,
    link: str = None
) -> str

Supported file types: Images (PNG, JPEG, GIF, WebP), PDFs (via file API)

Additional parameter:

  • link (str, optional): Custom base URL for API endpoint

request_openrouter()

Makes a request through OpenRouter's unified API.

def request_openrouter(
    system_prompt: str,
    user_text: str = None,
    file: str | Path | dict = None,
    model: str = None,
    temperature: float = 0.0
) -> str

Uses OpenAI-compatible format. Requires OPENROUTER_API_KEY in environment.

request_ai() (Unified Interface)

Unified function that automatically routes to the appropriate provider with built-in JSON parsing support.

def request_ai(
    system_prompt: str,
    user_text: str = None,
    file: str | Path | dict = None,
    provider: str | Providers | None = None,
    model: str | None = None,
    temperature: float = 0.2,
    json_output: bool = False
) -> str | dict

Parameters:

  • All common parameters (system_prompt, user_text, file, temperature)
  • provider (str | Providers, optional): Provider to use. Defaults to Google Gemini
  • model (str, optional): Model to use. Defaults to first supported model for the provider
  • json_output (bool, optional): If True, parses and returns JSON as dict. Default: False

Returns:

  • str if json_output=False
  • dict if json_output=True

Supported Providers Enum:

  • Providers.GOOGLE
  • Providers.ANTHROPIC
  • Providers.OPENAI
  • Providers.OPENROUTER

Payload Generation

The library automatically formats payloads for each provider using specialized functions:

  • generate_openai_payload(): Creates OpenAI-compatible content blocks
  • generate_gemini_payload(): Creates Gemini-compatible parts
  • generate_claude_payload(): Creates Claude-compatible content blocks

These functions handle:

  • MIME type detection from filenames
  • Base64 data URL formatting
  • Provider-specific content structure
  • Validation of required inputs

Error Handling

The library raises errors in the following cases:

  • ValueError: Neither user_text nor file is provided
  • ValueError: MIME type cannot be detected from filename
  • FileNotFoundError: File path provided but file doesn't exist
  • ValueError: Path provided is not a file (e.g., is a directory)
  • Invalid API credentials

Example:

from multi_ai_handler import request_anthropic

try:
    response = request_anthropic(
        system_prompt="You are helpful.",
        file="document.pdf",
        model="claude-3-5-sonnet-20241022"
    )
except FileNotFoundError as e:
    print(f"File not found: {e}")
except ValueError as e:
    print(f"Error: {e}")

Best Practices

  1. Use specific model names: Always specify the exact model version (e.g., claude-3-5-sonnet-20241022)
  2. Handle errors: Wrap API calls in try-except blocks
  3. Manage API keys securely: Never commit .env files to version control
  4. Optimize temperature: Use lower values (0.0-0.3) for factual tasks, higher (0.7-1.0) for creative tasks
  5. Monitor token usage: Check console output for Gemini token counts

License

MIT

Contributing

Contributions are welcome! Please open an issue or submit a pull request.

Support

For issues and questions, please open an issue on the GitHub repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multi_ai_handler-1.0.0.tar.gz (26.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multi_ai_handler-1.0.0-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file multi_ai_handler-1.0.0.tar.gz.

File metadata

  • Download URL: multi_ai_handler-1.0.0.tar.gz
  • Upload date:
  • Size: 26.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for multi_ai_handler-1.0.0.tar.gz
Algorithm Hash digest
SHA256 737dc56611ffb0688bfa224b17d18ccdbc3a9751a69b0bb50625412b4088402a
MD5 3980a4c03ba6265f4d473e41acc01088
BLAKE2b-256 20f343f616bf62fdb1915c22259b6d4405d0c10f8c67c20bcaa4bcce26d7975f

See more details on using hashes here.

Provenance

The following attestation bundles were made for multi_ai_handler-1.0.0.tar.gz:

Publisher: publish.yml on vsharha/multi-ai-handler

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file multi_ai_handler-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for multi_ai_handler-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ec768377307a0320722697b0dfb2264815049c72d24dc3474a8363a0026c65c6
MD5 c91f4ac9d601139980000d876086c933
BLAKE2b-256 f8f8a29ddb451cfd5149706becef170be4c7fcfe55a5db244e556a673a06da13

See more details on using hashes here.

Provenance

The following attestation bundles were made for multi_ai_handler-1.0.0-py3-none-any.whl:

Publisher: publish.yml on vsharha/multi-ai-handler

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page