Pydantic-AI models for LLMling-agent
Project description
LLMling-models
llmling-models
Collection of model wrappers and adapters for use with LLMling-Agent, but should work with the underlying pydantic-ai API without issues.
WARNING:
This is just a prototype for now and will likely change in the future. Also, pydantic-ais APIs dont seem stable yet, so things might not work across all pydantic-ai versions. I will try to keep this up to date as fast as possible.
Available Models
LLM Library Adapter
Adapter to use models from the LLM library with Pydantic-AI:
from pydantic_ai import Agent
from llmling_models.llm_adapter import LLMAdapter
# Basic usage
adapter = LLMAdapter(model_name="gpt-4o-mini")
agent = Agent(model=adapter)
result = await agent.run("Write a short poem")
# Streaming support
async with agent.run_stream("Test prompt") as response:
async for chunk in response.stream():
print(chunk)
# Usage statistics
result = await agent.run("Test prompt")
usage = result.usage()
print(f"Request tokens: {usage.request_tokens}")
print(f"Response tokens: {usage.response_tokens}")
(Examples need to be wrapped in async function and run with asyncio.run)
AISuite Adapter
Adapter to use models from AISuite with Pydantic-AI:
from pydantic_ai import Agent
from llmling_models.aisuite_adapter import AISuiteAdapter
# Basic usage
adapter = AISuiteAdapter(model="model_name")
agent = Agent(adapter)
result = await agent.run("Write a story")
Multi-Models
Fallback Model
Tries models in sequence until one succeeds. Perfect for handling rate limits or service outages:
from llmling_models import FallbackMultiModel
fallback_model = FallbackMultiModel(
models=[
"openai:gpt-4", # Try this first
"openai:gpt-3.5-turbo", # Fallback option
"anthropic:claude-2" # Last resort
]
)
agent = Agent(fallback_model)
result = await agent.run("Complex question")
Augmented Model
Enhances prompts through pre- and post-processing steps using auxiliary language models:
from llmling_models import AugmentedModel
model = AugmentedModel(
main_model="openai:gpt-4",
pre_prompt={
"text": "Expand this question: {input}",
"model": "openai:gpt-3.5-turbo"
},
post_prompt={
"text": "Summarize this response concisely: {output}",
"model": "openai:gpt-3.5-turbo"
}
)
agent = Agent(model)
# The question will be expanded before processing
# and the response will be summarized afterward
result = await agent.run("What is AI?")
Model Delegation
Dynamically selects models based on given prompt. Uses a selector model to choose the most appropriate model for each task:
from pydantic_ai import Agent
from llmling_models import DelegationMultiModel
# Basic setup with model list
delegation_model = DelegationMultiModel(
selector_model="openai:gpt-4-turbo",
models=["openai:gpt-4", "openai:gpt-3.5-turbo"],
selection_prompt="Pick gpt-4 for complex tasks, gpt-3.5-turbo for simple queries."
)
# Advanced setup with model descriptions
delegation_model = DelegationMultiModel(
selector_model="openai:gpt-4-turbo",
models=["openai:gpt-4", "anthropic:claude-2", "openai:gpt-3.5-turbo"],
model_descriptions={
"openai:gpt-4": "Complex reasoning, math problems, and coding tasks",
"anthropic:claude-2": "Long-form analysis and research synthesis",
"openai:gpt-3.5-turbo": "Simple queries, chat, and basic information"
},
selection_prompt="Select the most appropriate model for the task."
)
agent = Agent(delegation_model)
# The selector model will analyze the prompt and choose the most suitable model
result = await agent.run("Solve this complex mathematical proof...")
Cost-Optimized Model
Selects models based on input cost limits, automatically choosing the most appropriate model within your budget constraints:
from pydantic_ai import Agent
from llmling_models import CostOptimizedMultiModel
# Use cheapest model that can handle the task
cost_model = CostOptimizedMultiModel(
models=[
"openai:gpt-4", # More expensive
"openai:gpt-3.5-turbo", # Less expensive
],
max_input_cost=0.1, # Maximum cost in USD per request
strategy="cheapest_possible" # Use cheapest model that fits
)
# Or use the best model within budget
cost_model = CostOptimizedMultiModel(
models=[
"openai:gpt-4-32k", # Most expensive
"openai:gpt-4", # Medium cost
"openai:gpt-3.5-turbo", # Cheapest
],
max_input_cost=0.5, # Higher budget
strategy="best_within_budget" # Use best model within budget
)
agent = Agent(cost_model)
result = await agent.run("Your prompt here")
Token-Optimized Model
Automatically selects models based on input token count and context window requirements:
from pydantic_ai import Agent
from llmling_models import TokenOptimizedMultiModel
# Create model that automatically handles different context lengths
token_model = TokenOptimizedMultiModel(
models=[
"openai:gpt-4-32k", # 32k context
"openai:gpt-4", # 8k context
"openai:gpt-3.5-turbo", # 4k context
],
strategy="efficient" # Use smallest sufficient context window
)
# Or maximize context window availability
token_model = TokenOptimizedMultiModel(
models=[
"openai:gpt-4-32k", # 32k context
"openai:gpt-4", # 8k context
"openai:gpt-3.5-turbo", # 4k context
],
strategy="maximum_context" # Use largest available context window
)
agent = Agent(token_model)
# Will automatically select appropriate model based on input length
result = await agent.run("Your long prompt here...")
# Long inputs automatically use models with larger context windows
result = await agent.run("Very long document..." * 1000)
The cost-optimized model ensures you stay within budget while getting the best possible model for your needs, while the token-optimized model automatically handles varying input lengths by selecting models with appropriate context windows.
All multi models are generically typed to follow pydantic best practices. Usefulness for that is debatable though. :P
Installation
pip install llmling-models
Requirements
- Python 3.12+
- pydantic-ai
- llm (optional, for LLM adapter)
- aisuite (optional, for aisuite adapter)
- Either tokenizers or transformers for improved token calculation
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llmling_models-0.3.2.tar.gz.
File metadata
- Download URL: llmling_models-0.3.2.tar.gz
- Upload date:
- Size: 24.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3618fb7f7dac44692fb9e39cfc86ac2da705e6762969b067b42ac7bca3f8c9dc
|
|
| MD5 |
6d56a857df06a311e1452ef816fe0aef
|
|
| BLAKE2b-256 |
ebfff33cb3ad09d73d482df20a504d11ae82d66240a5998c2048599369d3693e
|
Provenance
The following attestation bundles were made for llmling_models-0.3.2.tar.gz:
Publisher:
build.yml on phil65/LLMling-models
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
llmling_models-0.3.2.tar.gz -
Subject digest:
3618fb7f7dac44692fb9e39cfc86ac2da705e6762969b067b42ac7bca3f8c9dc - Sigstore transparency entry: 159294117
- Sigstore integration time:
-
Permalink:
phil65/LLMling-models@76321e6f2c111620f54f423146da78111e95e764 -
Branch / Tag:
refs/tags/v0.3.2 - Owner: https://github.com/phil65
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yml@76321e6f2c111620f54f423146da78111e95e764 -
Trigger Event:
push
-
Statement type:
File details
Details for the file llmling_models-0.3.2-py3-none-any.whl.
File metadata
- Download URL: llmling_models-0.3.2-py3-none-any.whl
- Upload date:
- Size: 24.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3cf4bf0b1947bf0cb5cab37207b4c1ac6b7d18e878767ac7975d201a0676ba56
|
|
| MD5 |
e33e8c9516c7459de289a8dfaccd9415
|
|
| BLAKE2b-256 |
488d83de5aae01f7aa627a568636e9bc3706ab43e9d1da37f1a7593be40eb52f
|
Provenance
The following attestation bundles were made for llmling_models-0.3.2-py3-none-any.whl:
Publisher:
build.yml on phil65/LLMling-models
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
llmling_models-0.3.2-py3-none-any.whl -
Subject digest:
3cf4bf0b1947bf0cb5cab37207b4c1ac6b7d18e878767ac7975d201a0676ba56 - Sigstore transparency entry: 159294119
- Sigstore integration time:
-
Permalink:
phil65/LLMling-models@76321e6f2c111620f54f423146da78111e95e764 -
Branch / Tag:
refs/tags/v0.3.2 - Owner: https://github.com/phil65
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
build.yml@76321e6f2c111620f54f423146da78111e95e764 -
Trigger Event:
push
-
Statement type: