No project description provided

These details have not been verified by PyPI

Project description

Gai/Lib: Universal LLM Wrapper Client

Universal Multi-Modal Wrapper Client for LLM inferencing. The library provides a simplified and unified interface for seamless switching between multi-modal open source language models on a local machine and OpenAI APIs. This is intended for developers who are targetting the use of multi-modal LLMs for both OpenAI API and local machine models. Gai/Lib is the client-side version of the Gai/Gen library.

Gai/Lib: Universal LLM Wrapper Client

1. Introduction

The Universal Multi-Modal Wrapper Client Library (Gai/Lib) is a client-side library designed to simplify the development of LLM applications that require connectivity to open source and proprietary LLM services. This is also a client-side library for Gai/Gen Universal LLM Wrapper Services, which provides a common wrapper for open source LLM and proprietary LLM based on OpenAI's API specification as a reference.

The design principal can be described as a reverse hub-and-spoke model where a single client may require connections to multiple LLM services at any one time. For instance, multi-agent chatbots.

2. Disclaimer

Maintainers of this repo are not responsible for the actions of third parties who use the models. Please consult an attorney before using models for commercial purposes.

3. Credits

TheBloke for all the quantized models in the demo
turboderp for ExLlama
Meta Team for the LLaMa2 Model
HuggingFace team for the Transformers library and open source models
Mistral AI Team for Mistral7B Model
Georgi Gerganov for LLaMaCpp
Liu HaoTian for the LLaVa Model and Library
Coqui-AI for the xTTS Model
OpenAI for Open Sourcing Whisper v3
chromadb for AI-native open-source embedding database
instructor open source embedding model

3. License

This project is licensed under the MIT License - see the LICENSE file for details.

4. Configuration

Step 1. Create a .gairc file in your home directory. This file contains the default configurations for Gai.

{
    "app_dir": "~/gai"
}

Step 2. Create a /gai directory and create gai.yml.

mkdir ~/gai

Copy the yaml file gai.yml from this repository into ~/gai. This file contains the network configurations for connecting to each models.

eg.

ttt:
    ...
    mistral7b-exllama:
        url: "https://gaiaio.ai/api/mistral7b-exllama/v1/chat"
        hyperparameters:
            temperature: 1.2
            top_p: 0.15
            min_p: 0.0
            top_k: 50
            max_new_tokens: 1000
            typical: 0.0
            token_repetition_penalty_max: 1.25
            token_repetition_penalty_sustain: 256
            token_repetition_penalty_decay: 128
            beams: 1
            beam_length: 1
    ...
stt:
    ...
    whisper-transformers:
        url: "https://gaiaio.ai/api/whisper-transformers/v1/audio"
tts:
    ...
    xtts-2:
        url: "https://gaiaio.ai/api/xtts-2/v1/audio"
itt:
    ...
    llava-transformers:
        url: "https://gaiaio.ai/api/llava-transformers/v1/vision"
rag:
    default: rag
    rag:
        url: "https://gaiaio.ai/api/rag/v1/chat"

Step 3. All API keys should be stored in a .env file in ~/gai directory.

For example,

```.env
OPENAI_API_KEY=<--replace-with-your-api-key-->
ANTHROPIC_API_KEY=<--replace-with-your-api-key-->
```

The final user directory structure should look like this:

home
├── gai
│   ├── .env
│   └── gai.yml
└── .gairc

5. Quick Start

Setup

It is recommended to use a virtual environment such as venv or miniconda before installing the library to avoid any package conflict. The following example shows how to install the gai-lib environment on miniconda.

conda create -n gai-lib python=3.10.10 -y

Switch to the virtual environment

conda activate gai-lib

While the virtual environment is activated, install Gai/Lib at the root of this repository.

pip install -e .

Save your OpenAI API key in the .env file in ~/gai/.env

OPENAI_API_KEY=<--replace-with-your-api-key-->

Examples

You only need GGG to perform all tasks below.

from gai.lib.GGG import GGG
ggg = GGG()

Text-to-Text (TTT)

Chat

Chat using Mistral-7B

print("> Mistral-7B")
for chunk in ggg(category="ttt",messages="user: Tell me a one paragraph story\nassistant:"):
    print(chunk.decode(),end="",flush=True)
print("\n")

gai-lib-ttt-mistral7b

Chat using OpenAI GPT4

print("> OpenAI")
for chunk in ggg(category="ttt",generator="gpt-4",messages="user: Tell me a one paragraph story\nassistant:"):
    print(chunk.decode(),end="",flush=True)
print("\n")

gai-lib-ttt-openai

Chat using OpenAI's API

print("> OpenAI API")

import os
import openai
from openai import OpenAI
from dotenv import load_dotenv
load_dotenv()
if not os.environ.get("OPENAI_API_KEY"):
    raise Exception(
        "OPENAI_API_KEY not found in environment variables")
openai.api_key = os.environ["OPENAI_API_KEY"]
client = OpenAI()

response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role":"user","content":"Tell me a one paragraph story"}],
    stream=True,
    max_tokens=100,
)
for chunk in response:
    if chunk.choices[0].delta.content:
        print(chunk.choices[0].delta.content,end="",flush=True)

Function Call

Function Call is one of the most powerful feature of GPT4 whereby the LLM can return a function call result that can be used by the user to execute an external function. In this example, we will demonstrate the original OpenAI API function call and function call for open source model.

First, create a list of tools for the LLM.

# Load Tools
import json
with (open("./tests/lib/ttt/tools/tools.txt", "r")) as f:
    tools = json.load(f)
    print(json.dumps(tools, indent=4))

# Print Tool
def print_tool(response):
    tool = {}
    for chunk in response:
        decoded = chunk.decode()
        if "tool_name" in decoded:
            tool["tool_name"] = decoded[0]['tool_name']
        if "tool_arg" in decoded:
            tool["tool_arg"] = decoded[0]["tool_arg"]
    print(json.dumps(tool, indent=4)+"\n")

The tools will look like this.

[
    {
        "type": "function",
        "function": {
            "name": "gg",
            "description": "Search google based on the provided search query",
            "parameters": {
                "type": "object",
                "properties": {
                    "search_query": {
                        "type": "string",
                        "description": "The search query to search google with"
                    }
                },
                "required": ["search_query"]
            }
        }
    },
    {
        "type": "function",
        "function": {
            "name": "scrape",
            "description": "Scrape the content of the provided url",
            "parameters": {
                "type": "object",
                "properties": {
                    "url": {
                        "type": "string",
                        "description": "The url to scrape the content from"
                    }
                },
                "required": ["url"]
            }
        }
    }
]

Run on Mistral-7B

# Mistral7B
print("> Mistral-7B")
response = ggg(category="ttt",
               messages="user: Tell me the latest news on Singapore\nassistant:",
               tools=tools)
print_tool(response)

Run on OpenAI

# OpenAI
print("> OpenAI")
response = ggg(category="ttt",
               messages="user: Tell me the latest news on Singapore\nassistant:",
               tools=tools)
print_tool(response)

Run on OpenAI using OpenAI's API

print("> OpenAI Original")
response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role":"user","content":"Tell me the latest news on Singapore"}],
    stream=True,
    max_tokens=100,
    tools=tools
)
tool = {}
tool["tool_arg"]=""
for chunk in response:
    if chunk.choices[0].delta.tool_calls and chunk.choices[0].delta.tool_calls[0].function.name:
        tool["tool_name"] = chunk.choices[0].delta.tool_calls[0].function.name
    elif chunk.choices[0].delta.tool_calls and chunk.choices[0].delta.tool_calls[0].function.arguments:
        tool["tool_arg"] += chunk.choices[0].delta.tool_calls[0].function.arguments
print(json.dumps(tool, indent=4)+"\n")

All of the above examples will return the following:

{
    "tool_name": "gg",
    "tool_arg": "{ \"search_query\": \"latest news singapore\" }"
}

Text-to-Speech (TTS)

Set up the following text as input

from gai.common.sound_utils import play_audio

data = {
    "input": "The definition of insanity is doing the same thing over and over and expecting different results.",
    "voice": None,
    "language": None
}

Run coqui-tts

# xtts
response = ggg("tts", **data)
play_audio(response)

Your browser does not support the audio element.

Run OpenAI's TTS

# openai tts
response = ggg("tts", generator="openai-tts-1", **data)
play_audio(response)

Your browser does not support the audio element.

Run using OpenAI's API

# openai original
response = client.audio.speech.create(
    model='tts-1', input="The definition of insanity is doing the same thing over and over and expecting different results.", voice="alloy")
play_audio(response.content)

Speech-to-Text (STT)

Load Speech Sample

# sample
with open("./tests/lib/stt/today-is-a-wonderful-day.wav", "rb") as f:
    play_audio(f.read())

Your browser does not support the audio element.

Transcribe using OpenAI's open source Whisper model

# OpenSource Whisper
with open("./tests/lib/stt/today-is-a-wonderful-day.wav", "rb") as f:
    output = ggg("stt", file=f)
    print(output.decode())

Transcribe using OpenAI's hosted Whisper model

# OpenAI Whisper
with open("./tests/lib/stt/today-is-a-wonderful-day.wav", "rb") as f:
    output = ggg("stt", generator="openai-whisper", file=f)
    print(output.text)

Image-to-Text (ITT)

Load Image Sample

from gai.common.image_utils import read_to_base64
import os
from IPython.display import Image,display
image_file = os.path.join("./tests/lib/itt", "buses.jpeg")
encoded_string = read_to_base64(image_file)
messages = [
    {
        "role": "user",
        "content": [
            {"type": "text", "text": "What’s in this image?"},
            {
                "type": "image_url",
                "image_url": {
                    "url": f"data:image/jpeg;base64,{encoded_string}",
                },
            },
        ],
    }
]

bus

Describe using Llava

# Llava
print("> Llava")
for chunk in ggg("itt",messages=messages,stream=True):
    print(chunk.decode(),end="",flush=True)
print("\n")

itt-llava

Describe using OpenAI's Vision

# OpenAI
print("> OpenAI")
for chunk in ggg(category="itt", generator="openai-vision", messages=messages, stream=True, max_tokens=100):
    print(chunk.decode(), end="", flush=True)
print("\n")

itt-openai

RAG

Index a file (ie. chat with PDF)

data = {
    "collection_name": "demo",
    "file_path": "./tests/lib/rag/pm_long_speech_2023.pdf",
    "metadata": {"title": "2023 National Day Rally Speech", "source": "https://www.pmo.gov.sg/Newsroom/national-day-rally-2023"},
}
response = ggg("index", **data)
print(response.json())

rag-index

Question and Answer

data = {
    "collection_name": "demo",
    "query_texts": "Who are the young seniors?",
}
response = ggg("retrieve", **data)
context = response.text
question = "Who are the young seniors?"
answer = ggg("ttt", messages=f"user: Based on the context below: <context>{context}</context>, answer the question: {question}\nassistant:")
for chunk in answer:
    print(chunk.decode(), end="", flush=True)

rag-retrieve

6. API Endpoints

NOTE: You can choose to host your own model by referring to the instruction here Gai/Gen Universal LLM Wrapper Services. The downside is that you need to ensure availability of your computing resources and you are not likely to capitalise on multiple model instances unless you are running a cluster of machines.

- Text-to-Text (TTT)
Endpoint: https://gaiaio.ai/mistral7b-exllama/v1/chat/completions
Method: POST
Type: Body

Name	Type	Description	Default
model	str	generator name	mistral7b-exllama
messages	list	See below
stream	bool	True, False	True
...		Hyperparameters based on model

Note:

messages

[
    { "role": "system", "content": system message },
    { "role": "user", "content": user message },
    { "role": "assistant", "content": AI message },
    ...
]

- Text-to-Speech (TTS)
Endpoint: https://gaiaio.ai/xtts-2/v1/audio/speech
Method: POST
Type: Body

Name	Type	Description	Default
model	str	generator name	xtts-2
input	str	text to be spoken
voice	str	voice id (speaker)
language	file	language code	en
stream	bool	True, False	True
...		Hyperparameters based on model

- Speech-to-Text Endpoint: https://gaiaio.ai/whisper-transformer/v1/audio/transcription
Method: POST
Type: Multipart Form-Data

Name	Type	Description	Default
model	str	generator name
file	file	audio file object

- itt: Image-to-Text Endpoint: https://gaiaio.ai/llava/v1/vision/completions
Method: POST Type: Body Parameters:

Name	Type	Description
model	str	generator name
messages	list	see below
stream	bool	True,False
...		Hyperparameters based on model

Note:

messages format

[
    {
        "role": "user",
        "content": [
            {"type": "text", "text": text},
            {
                "type": "image_url",
                "image_url": {
                    "url": 'data:image/jpeg;base64,.....',
                },
            },
        ],
        ...
    }
]

- rag: Retrieval-Augmented Generation

a) Endpoint: https://gaiaio.ai/rag/v1/rag/index_file
Method: POST
Type: Multipart Form-Data Parameters:

Name	Type	Description
collection_name	str	collection name in the store
file	file	the document to be indexed
metadata	dict	metadata tied to the document

b) Endpoint: https://gaiaio.ai/rag/v1/rag/retrieve Method: POST
Type: Body Parameters:

Name	Type	Description
collection_name	str	collection name in the store
query_texts	str	query
n_results	int	no. of nearest result returned

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.120

Jul 21, 2024

0.119

Jul 10, 2024

0.118

Jul 1, 2024

0.117

Jun 12, 2024

0.116

Jun 10, 2024

0.115

Jun 6, 2024

0.114

Jun 2, 2024

0.113

Apr 28, 2024

0.112

Apr 27, 2024

0.111

Apr 27, 2024

0.110

Apr 27, 2024

0.109

Apr 27, 2024

0.108

Apr 27, 2024

0.107

Apr 27, 2024

0.106

Apr 27, 2024

0.105

Apr 27, 2024

0.104

Apr 26, 2024

0.103

Apr 26, 2024

0.102

Apr 26, 2024

0.101

Apr 23, 2024

0.100

Apr 23, 2024

0.99

Apr 23, 2024

0.98

Apr 22, 2024

0.97

Apr 19, 2024

0.96

Apr 2, 2024

0.95

Apr 2, 2024

0.94

Mar 25, 2024

0.93

Mar 25, 2024

0.92

Mar 14, 2024

0.91

Mar 14, 2024

0.90

Feb 25, 2024

0.89

Feb 24, 2024

0.88

Feb 24, 2024

0.87

Feb 23, 2024

0.86

Feb 23, 2024

0.85

Feb 23, 2024

0.84

Feb 23, 2024

0.83

Feb 22, 2024

0.82

Feb 22, 2024

0.81

Feb 22, 2024

0.80

Feb 20, 2024

0.79

Feb 20, 2024

0.78

Feb 20, 2024

0.77

Feb 20, 2024

0.76

Feb 20, 2024

0.75

Feb 20, 2024

0.74

Feb 20, 2024

0.73

Feb 20, 2024

0.72

Feb 20, 2024

0.71

Feb 20, 2024

0.70

Feb 20, 2024

0.69

Feb 20, 2024

0.68

Feb 20, 2024

0.67

Feb 19, 2024

0.66

Feb 14, 2024

0.65

Feb 6, 2024

0.64

Feb 4, 2024

0.63

Feb 4, 2024

0.62

Feb 4, 2024

0.61

Feb 4, 2024

This version

0.60

Feb 4, 2024

0.59

Feb 4, 2024

0.58

Feb 4, 2024

0.57

Feb 4, 2024

0.56

Feb 4, 2024

0.55

Feb 4, 2024

0.54

Feb 3, 2024

0.53

Feb 3, 2024

0.52

Jan 23, 2024

0.51

Jan 21, 2024

0.50

Jan 20, 2024

0.49

Jan 19, 2024

0.10

Dec 27, 2023

0.9

Dec 26, 2023

0.0.0

Jan 19, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gai-lib-0.60.tar.gz (28.8 kB view hashes)

Uploaded Feb 4, 2024 Source

Hashes for gai-lib-0.60.tar.gz

Hashes for gai-lib-0.60.tar.gz
Algorithm	Hash digest
SHA256	`3a86e2d572765175dc89717a791b7103a8218507f60171ad9f8e872e19e10224`
MD5	`988705168aba11be9f0c513e52c82f52`
BLAKE2b-256	`fa19bd582d604fa1d5a90a339bb5dab34d9faed4a4f986d2780e453fa0edbe36`