Large Language Model DevOps - a package to help deploy LLMs to the Cloud.
Project description
Introduction
This is the Sunholo Python project, a comprehensive toolkit for working with language models and vector stores on Google Cloud Platform. It provides a wide range of functionalities and utilities to facilitate the development and deployment of language model applications.
This is the sunholo-py project. It provides a collection of useful tools and functionalities for working with language models, databases, and various bots.
sunholo-py
(draft release https://pypi.org/project/sunholo/ )
Table of Contents
pip install sunholo
A python library to enable LLMOps within cloud environments
sunholo
provides utilities to help manage LLM operations on Google Cloud Platform at first, but it is hoped that making it open source will help it support other clouds in the future. A lot of the functionality is not Google Cloud Platform specific, so still may be helpful.
It is derived from the Edmonbrain project, the original blog post you can read here: https://code.markedmondson.me/running-llms-on-gcp/ and owes a lot to Langchain ( https://github.com/langchain-ai/langchain )
The package includes:
agents/
- functions for working with agents, including easy flask apps, parsing chat history and dispatching requests to different agent endpointsarchive/
- functions to record all Q&A activity to BigQuery via PubSubbots/
- functions for special cases regarding frontend bots such as GChat, Web Apps, Discord and Slackchunker/
- functions to slice up documents for sending into vector storescomponents/
- functions to help configure which LLM, prompt, vectorstore or document retriever you will use based on a yaml config filedatabase/
- database setup functions and SQL to run on those sources such as Supabaseembedder/
- functions to send chunks into embedding vector storespubsub/
- use of PubSub for a message queue between componentsqna/
- utilities for running agents such as retry strats and parsing of output/inputstreaming/
- creation of streaming responses from LLM botssummarise/
- creation of summaries of large documentsutils/
- reading configuration files, Google Cloud Platform metadata
Configuration
The library uses the config specifications that some examples are given in the config/
folder.
When using the functions, make sure to have the config/
folder in the root of where your application is running (usually $HOME/config
)
Copyright [2024] [Holosun ApS]
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for sunholo-0.21.15-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2cd79e772e89b80fbac354865f949e3b8d0d1fc5eab6652dbfa0d648ce496884 |
|
MD5 | c7ac6f8ea6c45022301f05b45402e25f |
|
BLAKE2b-256 | 08ae606cf7325c72c0b0441f9cd6ecc212839d1a06e7c99502f2eb0b4dfd776c |