A unified interface for interacting with multiple Large Language Model providers

These details have not been verified by PyPI

Project links

Homepage

Project description

SmartLLM

SmartLLM is a unified Python interface for interacting with multiple Large Language Model providers. It provides a consistent API across different LLM providers, handles caching of responses, and supports both synchronous and streaming interactions.

Installation

pip install smartllm

Features

Unified API: Consistent interface for OpenAI, Anthropic, and Perplexity LLMs
Response Caching: Persistent JSON-based caching of responses to improve performance
Streaming Support: Real-time streaming of LLM responses (Anthropic only)
JSON Mode: Structured JSON responses (OpenAI and Anthropic)
Citations: Access to source information (Perplexity only)
Asynchronous Execution: Non-blocking request execution
Configurable Parameters: Granular control over temperature, tokens, and other model parameters

Supported Providers

SmartLLM currently supports the following LLM providers:

OpenAI
- Models: GPT-4, GPT-3.5 series, and other OpenAI models
- Features: JSON-structured outputs, token usage information
- Example: base="openai", model="gpt-4"
Anthropic
- Models: Claude models (e.g., claude-3-7-sonnet-20250219)
- Features: Streaming support, JSON-structured outputs, system prompts
- Example: base="anthropic", model="claude-3-7-sonnet-20250219"
Perplexity
- Models: sonar-small-online, sonar-medium-online, sonar-pro, etc.
- Features: Web search capabilities, citation information
- Example: base="perplexity", model="sonar-pro"

Basic Usage

from smartllm import SmartLLM
import os

# Create SmartLLM instance
llm = SmartLLM(
    base="openai",
    model="gpt-4",
    api_key=os.environ.get("OPENAI_API_KEY"),
    prompt="Explain quantum computing in simple terms",
    temperature=0.7
)

# Execute the request
llm.execute()

# Wait for completion
llm.wait_for_completion()

# Check status and get response
if llm.is_completed():
    print(llm.response)
else:
    print(f"Error: {llm.get_error()}")

SmartLLM Class Reference

Constructor

SmartLLM(
    base: str = "",                  # LLM provider ("openai", "anthropic", "perplexity")
    model: str = "",                 # Model identifier
    api_key: str = "",               # API key for the provider
    prompt: Union[str, List[str]] = "", # Single prompt or conversation history
    stream: bool = False,            # Enable streaming (Anthropic only)
    max_input_tokens: Optional[int] = None,  # Max input tokens
    max_output_tokens: Optional[int] = None, # Max output tokens
    output_type: str = "text",       # Output type
    temperature: float = 0.2,        # Temperature for generation
    top_p: float = 0.9,              # Top-p sampling parameter
    frequency_penalty: float = 1.0,  # Frequency penalty
    presence_penalty: float = 0.0,   # Presence penalty
    system_prompt: Optional[str] = None, # System prompt
    search_recency_filter: Optional[str] = None, # Filter for search (Perplexity)
    return_citations: bool = False,  # Include citations (Perplexity)
    json_mode: bool = False,         # Enable JSON mode (OpenAI, Anthropic)
    json_schema: Optional[Dict[str, Any]] = None, # JSON schema
    ttl: int = 7,                    # Cache time-to-live in days
    clear_cache: bool = False        # Clear existing cache
)

Methods

`execute(callback: Optional[Callable[[str, str], None]] = None) -> SmartLLM`

Initiates the LLM request. For streaming requests, an optional callback function can be provided to process each chunk of the response.

# Basic execution
llm.execute()

# With streaming callback (Anthropic only)
def handle_chunk(chunk: str, accumulated: str) -> None:
    print(f"New chunk: {chunk}")

llm.execute(callback=handle_chunk)

`wait_for_completion(timeout: Optional[float] = None) -> bool`

Waits for the request to complete. Returns True if successful, False otherwise. An optional timeout parameter can be provided.

# Wait indefinitely
llm.wait_for_completion()

# Wait with timeout (in seconds)
success = llm.wait_for_completion(timeout=10.0)

`is_failed() -> bool`

Returns True if the request failed.

`is_completed() -> bool`

Returns True if the request completed successfully.

`is_pending() -> bool`

Returns True if the request is still in progress.

`get_error() -> Optional[str]`

Returns the error message if the request failed, or None if no error occurred.

Properties

`response: Union[str, Dict[str, Any]]`

Returns the generated content. If JSON mode is enabled and JSON content is available, returns a dictionary; otherwise, returns the text content.

`_content: str`

Returns the raw text content of the response.

`_json_content: Optional[Dict[str, Any]]`

Returns the parsed JSON content if available (requires json_mode=True).

`sources: List[str]`

Returns citation sources (available with Perplexity when return_citations=True).

`usage: Dict[str, int]`

Returns token usage statistics for the request, including prompt tokens, completion tokens, and total tokens.

Advanced Features

Streaming Responses (Anthropic Only)

from smartllm import SmartLLM
import os

def print_chunk(chunk: str, accumulated: str) -> None:
    print(f"CHUNK: {chunk}")

llm = SmartLLM(
    base="anthropic",
    model="claude-3-7-sonnet-20250219",
    api_key=os.environ.get("ANTHROPIC_API_KEY"),
    prompt="Write a short story about a robot learning to paint",
    stream=True  # Enable streaming
)

# Execute with callback
llm.execute(callback=print_chunk)
llm.wait_for_completion()

JSON Mode (OpenAI and Anthropic)

from smartllm import SmartLLM
import os

json_schema = {
    "type": "object",
    "properties": {
        "title": {"type": "string"},
        "topics": {"type": "array", "items": {"type": "string"}},
        "difficulty": {"type": "integer", "minimum": 1, "maximum": 10}
    },
    "required": ["title", "topics", "difficulty"]
}

llm = SmartLLM(
    base="openai",
    model="gpt-4",
    api_key=os.environ.get("OPENAI_API_KEY"),
    prompt="Generate information about a quantum computing course",
    json_mode=True,
    json_schema=json_schema
)

llm.execute()
llm.wait_for_completion()

# Access structured data
course_info = llm.response  # Returns a Python dictionary
print(f"Course title: {course_info['title']}")
print(f"Topics: {', '.join(course_info['topics'])}")
print(f"Difficulty: {course_info['difficulty']}/10")

Getting Citations (Perplexity Only)

from smartllm import SmartLLM
import os

llm = SmartLLM(
    base="perplexity",
    model="sonar-pro",
    api_key=os.environ.get("PERPLEXITY_API_KEY"),
    prompt="What are the latest advancements in quantum computing?",
    search_recency_filter="week",  # Filter for recent information
    return_citations=True  # Enable citations
)

llm.execute()
llm.wait_for_completion()

# Print the response
print(llm.response)

# Print the sources
print("\nSources:")
for source in llm.sources:
    print(f"- {source}")

Caching Mechanism

SmartLLM uses a persistent JSON-based caching system powered by the Cacherator library. This significantly improves performance by avoiding redundant API calls for identical requests.

Cache Configuration

By default, responses are cached for 7 days. You can customize the cache behavior:

# Set custom time-to-live (TTL) in days
llm = SmartLLM(
    base="openai",
    model="gpt-4",
    api_key=os.environ.get("OPENAI_API_KEY"),
    prompt="Explain quantum computing",
    ttl=30  # Cache results for 30 days
)

# Force clear existing cache
llm = SmartLLM(
    base="openai",
    model="gpt-4",
    api_key=os.environ.get("OPENAI_API_KEY"),
    prompt="Explain quantum computing",
    clear_cache=True  # Ignore any existing cached response
)

How Caching Works

Each request is assigned a unique identifier based on:
- Provider (base)
- Model
- Prompt
- All relevant parameters (temperature, tokens, etc.)
Responses are stored in JSON format in the data/llm directory
When making an identical request, the cached response is returned instead of making a new API call
Cache entries automatically expire after the specified TTL
Cache can be manually cleared by setting clear_cache=True

Error Handling

SmartLLM provides robust error handling through state tracking:

llm = SmartLLM(...)
llm.execute()
llm.wait_for_completion()

if llm.is_failed():
    print(f"Request failed: {llm.get_error()}")
elif llm.is_completed():
    print("Request completed successfully")
    print(llm.response)
elif llm.is_pending():
    print("Request is still in progress")

Dependencies

cacherator: Persistent JSON-based caching
logorator: Decorator-based logging
openai>=1.0.0: OpenAI API client
anthropic>=0.5.0: Anthropic API client
python-slugify: Utility for creating safe identifiers

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.1.14

Mar 9, 2026

0.1.13

Feb 28, 2026

0.1.12

Feb 24, 2026

0.1.11

Feb 24, 2026

0.1.10

Feb 24, 2026

0.1.9

Feb 24, 2026

0.1.8

Feb 24, 2026

0.1.7

Feb 22, 2026

0.1.6

Feb 21, 2026

0.1.4

Feb 20, 2026

0.1.3

Feb 18, 2026

0.1.2

Feb 18, 2026

0.1.1

Feb 18, 2026

0.1.0

Feb 18, 2026

0.0.8

Dec 4, 2025

0.0.7

Mar 27, 2025

0.0.6

Mar 21, 2025

This version

0.0.5

Mar 21, 2025

0.0.4

Mar 11, 2025

0.0.3

Mar 10, 2025

0.0.2

Mar 10, 2025

0.0.1

Mar 7, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smartllm-0.0.5.tar.gz (18.9 kB view details)

Uploaded Mar 21, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

smartllm-0.0.5-py3-none-any.whl (21.9 kB view details)

Uploaded Mar 21, 2025 Python 3

File details

Details for the file smartllm-0.0.5.tar.gz.

File metadata

Download URL: smartllm-0.0.5.tar.gz
Upload date: Mar 21, 2025
Size: 18.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for smartllm-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`107a21414ed5f28a9bc3cfd5cea76f030ed2b018ec19683a8a9c41ef8bf2ba5c`
MD5	`90fc857556a2e7ab8bc3882c6d0c8aff`
BLAKE2b-256	`91215351a7163946836bf16915aa1f6c633844f0aec73f03db4fe3169f2735ca`

See more details on using hashes here.

File details

Details for the file smartllm-0.0.5-py3-none-any.whl.

File metadata

Download URL: smartllm-0.0.5-py3-none-any.whl
Upload date: Mar 21, 2025
Size: 21.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for smartllm-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ec8c6cd84c75afd5ed900294fd6d854f062c5d6609cf04000b86a072f31e4eed`
MD5	`4afda8ef9b05b2cda1e6630683e0799e`
BLAKE2b-256	`e7c224f66f2bd0ba708bbb6ed404fe371b3f27480e70953c7b9f24ed458c2265`

See more details on using hashes here.

smartllm 0.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SmartLLM

Installation

Features

Supported Providers

Basic Usage

SmartLLM Class Reference

Constructor

Methods

execute(callback: Optional[Callable[[str, str], None]] = None) -> SmartLLM

wait_for_completion(timeout: Optional[float] = None) -> bool

is_failed() -> bool

is_completed() -> bool

is_pending() -> bool

get_error() -> Optional[str]

Properties

response: Union[str, Dict[str, Any]]

_content: str

_json_content: Optional[Dict[str, Any]]

sources: List[str]

usage: Dict[str, int]

Advanced Features

Streaming Responses (Anthropic Only)

JSON Mode (OpenAI and Anthropic)

Getting Citations (Perplexity Only)

Caching Mechanism

Cache Configuration

How Caching Works

Error Handling

Dependencies

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`execute(callback: Optional[Callable[[str, str], None]] = None) -> SmartLLM`

`wait_for_completion(timeout: Optional[float] = None) -> bool`

`is_failed() -> bool`

`is_completed() -> bool`

`is_pending() -> bool`

`get_error() -> Optional[str]`

`response: Union[str, Dict[str, Any]]`

`_content: str`

`_json_content: Optional[Dict[str, Any]]`

`sources: List[str]`

`usage: Dict[str, int]`