Easy '1-line' calling of every LLM from OpenAI, MS Azure, AWS Bedrock, GCP Vertex, and Ollama
Project description
Easy-LLMS
- Easy "1-line" calling of every LLM from OpenAI, MS Azure, AWS Bedrock, GCP Vertex, and Ollama
Table of Contents
- What is Easy-LLMS and why would you use it
- Quick Getting Started
- Authentication
- Detailed Usage and Examples
- Supported Providers and Models
- Assumptions
- Help/Questions/Comments
What is Easy-LLMS and why would you use it?
Easy-LLMS is a Python module which gives you easy "1-line" access to every LLM (Large Language Model) within OpenAI, MS Azure, AWS Bedrock, GCP Vertex, and Ollama
The idea is that you do not need to focus on the individual provider's APIs or LangChain abstractions, the different authentication methods, or the multitude of LLM requirements/options/parameters/documentation, etc. Rather, you can quickly interact and compare LLMs, get results, and build useful things with those LLMs.
If you find this helpful, I would very much appreciate starring it on GitHub.
Quick Getting Started
Install/Upgrade
-
Install/upgrade latest via:
pip install -U easy-llms
-
Alternatively, install latest via:
pip install git+https://github.com/ventz/easy-llms
-
You can upgrade to the latest via:
pip install --upgrade --force-reinstall git+https://github.com/openai/whisper.git
Example: OpenAI
Let's start with OpenAI since most are familiar with it:
from llms.openai import *
answer = gpt_4o().run("what llm are you?")
The authentication, in this case for OpenAI, is done in one of two ways:
1.) Either by having the OPENAI_API_KEY
ENV variable available in your environment
or
2.) Via having a .llms
folder in the working directory, with a openai
file inside of it, which contains OPENAI_API_KEY="sk-..."
.
Example: AWS Bedrock - Claude v3 Haiku
Next, let's switch to Claude v3 Haiku via AWS Bedrock.:
from llms.aws import *
answer = claude_3_haiku().run("what llm are you?")
The authentication, in this case for AWS Bedrock, is done in one of two ways:
1.) Either by having the AWS_ACCESS_KEY_ID
, AWS_SECRET_ACCESS_KEY, and
AWS_REGION` ENV variables available in your environment
or
2.) Via having a .llms
folder in the working directory, with an aws
file inside of it, which contains AWS_ACCESS_KEY_ID
, AWS_SECRET_ACCESS_KEY, and
AWS_REGION`.
Example: Ollama - Llama3
Next, let's switch to Llama3 via Ollama:
Ollama does not export any LLMs. This is because it is technically a pass-through to any models that you have setup.
This means you will NOT see it list any LLMs explicitly:
from llms import *
ollama.list()
What you would do in this case is pass the model directly to the module, for example:
from llms import *
answer = ollama("llama3").run("what llm are you?")
By default, Ollama will use the "Chat" interface. If you have a model
that does not support Chat, you can disable chat:
answer = ollama("llama3", chat=False).run("what llm are you?")
While Ollama has no authentication, we use the "BASE_URL" instead and it is still done in one of two ways:
1.) Either by having the OLLAMA_BASE_URL
ENV variable available in your environment
or
2.) Via having a .llms
folder in the working directory, with a ollama
file inside of it, which contains OLLAMA_BASE_URL="http://fqdn:11434"
See Authentication for more authentication information and examples.
See Detailed Usage and Examples for more usage and options (including advanced parameters) options.
Authentication
There are two ways of authenticating the various providers within Easy-LLMS:
1.) Via ENV variables
or
2.) Via .llms
folder in the working directory, with a openai
file, azure
file, aws
file, and google
file, inside the llms directory, as needed.
Here are the AUTH requirements for each of the providers in ENV variables and files (choose one, not both)
OpenAI
1.) ENV variables:
export OPENAI_API_KEY="sk-..."
or
2.) openai
file with:
OPENAI_API_KEY="sk-..."
Azure
1.) ENV variables:
export AZURE_OPENAI_API_KEY=...
export AZURE_OPENAI_API_VERSION="YYYY-MM-DD"
export AZURE_OPENAI_ENDPOINT="https://your-endpoint.openai.azure.com"
or
2.) azure
file with:
AZURE_OPENAI_API_KEY=...
AZURE_OPENAI_API_VERSION="YYYY-MM-DD"
AZURE_OPENAI_ENDPOINT="https://your-endpoint.openai.azure.com"
AWS
1.) ENV variables:
export AWS_ACCESS_KEY_ID="AKIA..."
export AWS_SECRET_ACCESS_KEY="..."
export AWS_REGION="us-east-1"
or
2.) aws
file with:
AWS_ACCESS_KEY_ID="AKIA..."
AWS_SECRET_ACCESS_KEY="..."
AWS_REGION="us-east-1"
1.) ENV variables:
export GCP_PROJECT_ID="project-name"
export GCP_REGION="us-central1"
export GCP_CREDENTIALS_JSON='{ "type": "service_account", "project_id": "project-name", "private_key_id": "...", "private_key": "-----BEGIN PRIVATE KEY-----\n...=\n-----END PRIVATE KEY-----\n", "client_email": "project@project-name.iam.gserviceaccount.com", "client_id": "...", "auth_uri": "https://accounts.google.com/o/oauth2/auth", "token_uri": "https://oauth2.googleapis.com/token", "auth_provider_x509_cert_url": "https://www.googleapis.com/oauth2/v1/certs", "client_x509_cert_url": "https://www.googleapis.com/robot/v1/metadata/x509/project%40project-name.iam.gserviceaccount.com", "universe_domain": "googleapis.com" }'
or
2.) google
file with:
GCP_PROJECT_ID="project-name"
GCP_REGION="us-central1"
GCP_CREDENTIALS_JSON='{
"type": "service_account",
"project_id": "project-name",
"private_key_id": "...",
"private_key": "-----BEGIN PRIVATE KEY-----\n...=\n-----END PRIVATE KEY-----\n",
"client_email": "project@project-name.iam.gserviceaccount.com",
"client_id": "...",
"auth_uri": "https://accounts.google.com/o/oauth2/auth",
"token_uri": "https://oauth2.googleapis.com/token",
"auth_provider_x509_cert_url": "https://www.googleapis.com/oauth2/v1/certs",
"client_x509_cert_url": "https://www.googleapis.com/robot/v1/metadata/x509/project%40project-name.iam.gserviceaccount.com",
"universe_domain": "googleapis.com"
}'
Ollama
1.) ENV variables:
export OLLAMA_BASE_URL="http://fqdn:11434"
or
2.) ollama
file with:
OLLAMA_BASE_URL="http://fqdn:11434"
Detailed Usage and Examples
1.) Look at Supported Providers and Models to get a list of Providers and Models - a one time task!
2.) Then make sure you have setup the proper Authentication - a one time task!
3.) The easiest and quickest way to use everything:
Easy "Import All" and call any Model
from llms import *
From here, you can call any provider/model available to you:
answer = gpt_4o().run("what llm are you?")
or
answer = claude_3_haiku().run("what llm are you?")
For example, you can run one question against multiple providers like this:
from llms import *
question = "Who was the first person to step on the moon?"
# OpenAI
answer = gpt_4o().run(question)
# AWS
answer = claude_3_haiku().run(question)
# Google
answer = gemini_pro_1_5().run(question)
It is worth noting again for that Ollama, you will specify the model directly, such as:
from llms import *
question = "Who was the first person to step on the moon?"
answer = ollama("llama3").run(question)
answer = ollama("mistral").run(question)
And again, if your Ollama model does not support "Chat", you can specify it without Chat:
from llms import *
question = "Who was the first person to step on the moon?"
answer = ollama("command-r-plus", chat=False).run(question)
Passing Advanced Options
You can pass every option/parameter that each provider and model supports.
Two widely used options/parameters are "wrapped" for you as:
temperature
andmax_tokens
For example:
answer = gpt_4o(temperature=1, max_tokens=100).run("what llm are you?")
You can pass any other option available, for example:
answer = gpt_4o(top_p=1).run("what llm are you?")
And:
answer = gpt_4o(top_p=1, presence_penalty=1, frequency_penalty=1).run("what llm are you?")
For Ollama, it's as simple as:
answer = ollama("llama3", top_p=1, presence_penalty=1, frequency_penalty=1).run("what llm are you?")
Loading Providers and Models more efficiently
If you are only working with one provider, you can only load that provider. For example, you only need OpenAI, you can:
from llms.openai import *
answer = gpt_4o().run(...)
You can do the same with Azure:
from llms.azure import *
or AWS:
from llms.aws import *
or Google:
from llms.google import *
You can further narrow it down to one model. For example, let's say you only need "gpt-4o" from OpenAI, you can load it as:
from llms.openai import gpt_4o
answer = gpt_4o().run(...)
Again, Ollama is the only one that you CANNOT load with:
from llms.ollama import *
answer = ollama("llama3").run(...)
This is due to Ollama not exporting any LLMs.
You would have load it with:
from llms import ollama
answer = ollama("llama3").run(...)
Supported Providers and Models
You can list out the models with:
from llms import llms
print(llms.list())
Which will produce:
['openai', 'azure', 'aws', 'google', 'ollama']
And from there, you can list each as:
from llms import openai
print(openai.list())
from llms import azure
print(azure.list())
from llms import aws
print(aws.list())
from llms import google
print(google.list())
# NOTE: Ollama does not export any LLMs.
# This is because it is technically a pass-through to any models that you have setup.
# That means you will see an empty list "[]" of exported LLMs
from llms import ollama
print(ollama.list())
Which will produce the models you can use:
['gpt_35_turbo', 'gpt_35_turbo_16k', 'gpt_4_turbo', 'gpt_4o', 'gpt_4', 'gpt_4_32k']
['azure_gpt_35_turbo', 'azure_gpt_35_turbo_16k', 'azure_gpt_4_turbo', 'azure_gpt_4', 'azure_gpt_4_32k']
['claude_3_haiku', 'claude_3_sonnet', 'claude_3_opus', 'claude_1_instant', 'claude_1', 'llama2_70b', 'llama3_8b_instruct', 'llama3_70b_instruct', 'mistral_7b_instruct', 'mistral_large', 'mistral_small', 'mixtral_8x7b_instruct', 'cohere_command_14', 'cohere_command_light_14', 'j2_mid', 'j2_ultra', 'titan_lite_v1', 'titan_express_v1', 'titan_premier_v1']
['gemini_pro_1', 'gemini_pro_1_5', 'gemini_flash_1_5', 'bison', 'bison_32k']
# NOTE: Ollama is empty due to not exporting any LLMs
[]
I also made a more user-friendly list of the models across OpenAI, Azure, AWS, Google, and Ollama:
# AWS Anthropic:
* claude-3-haiku
* claude-3-sonnet
* claude-3-opus
* claude-v2
* claude-instant
# AWS Meta:
* llama2-13b
* llama2-70b
* llama3-8b
* llama3-70b
# AWS Mistral and Mixtral:
* mistral-7b-instruct
* mistral-large
* mistral-small
* mixtral-8x7b-instruct
* mixtral-large
# AWS Cohere:
* cohere-command-v14
* cohere-command-light-v14
* [support coming soon] cohere.command-r-v1:0
* [support coming soon] cohere.command-r-plus-v1:0
# AWS AI21 Labs:
* ai21-j2-mid
* ai21-j2-ultra
# AWS Amazon (Titan):
* amazon-titan-lite
* amazon-titan-express
* amazon-titan-premier
# MS Azure:
* azure-gpt-4-turbo-preview
* azure-gpt-3.5-turbo
* azure-gpt-4
* azure-gpt-3.5-turbo-16k
* azure-gpt-4-32k
# OpenAI:
* gpt-4-turbo
* gpt-4-turbo-preview
* gpt-3.5-turbo
* gpt-4o
* gpt-4
* gpt-3.5-turbo-16k
* gpt-4-32k
# Google Vertex:
* google-chat-bison
* google-chat-bison-32k
* google-gemini-pro-1.0
* google-gemini-pro-1.5-preview
* google-gemini-flash-1.5-preview
# Ollama
# https://ollama.com/library
* Any "Chat" model
* Any "Invoke" model
Assumptions?
-
You have python3 (ideally 3.11.x+, although it should work 3.6.x+)
-
You have a python venv setup
-
You have
pip install wheel
in your venv -
You have installed Easy-LLMS:
pip install easy-llms
orpip install git+https://github.com/ventz/easy-llms
-
You have API access to the providers/models you want to use: OpenAI; MS Azure; AWS Bedrock; Google Vertex; Ollama See Supported Providers and Models for more information.
Help/Questions/Comments
Please feel free to open GitHub Issues for all Questions and Comments. PRs are always welcome and encouraged! The goal is to make this project better!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file easy_llms-0.1.4.tar.gz
.
File metadata
- Download URL: easy_llms-0.1.4.tar.gz
- Upload date:
- Size: 15.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d96010990046a998c2cb96abf5d687ba07d3ee76daf8c261c5157111ed501c0e |
|
MD5 | f7be55a0a1cc22f9c7bf4efa9f44a60b |
|
BLAKE2b-256 | 149cd62615e9e2d657c7cb366ccf660abb6ab3b9e6fa7960ed8a11836ce92fbb |
File details
Details for the file easy_llms-0.1.4-py3-none-any.whl
.
File metadata
- Download URL: easy_llms-0.1.4-py3-none-any.whl
- Upload date:
- Size: 33.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.11.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5c3b15549927f37758e76291743023869c52756ddcd4f2507730b598ae22c7c6 |
|
MD5 | bbbc0f28574d623fedc0a8df42feb501 |
|
BLAKE2b-256 | 19490b3b4f7daafd41a4338c10b934011e38f3aa61c3f471384e79e16438bc63 |