A tool for building gRPC-based model backends for LeapfrogAI

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

LeapfrogAI Logo

Project Goal
Why Host Your Own LLM?
Features
Getting Started
Usage
License
Community

Project Goal

LeapfrogAI is designed to provide AI-as-a-service in egress limited environments. This project aims to bridge the gap between resource-constrained environments and the growing demand for sophisticated AI solutions, by enabling the hosting of APIs that provide AI-related services.

Our services include vector databases, completions with models like Large Language Models (LLMs), and the creation of embeddings. These AI capabilities can be easily accessed and integrated with your existing infrastructure, ensuring the power of AI can be harnessed irrespective of your environment's limitations.

Why Host Your Own LLM?

Large Language Models (LLMs) are a powerful resource for AI-driven decision making, content generation, and more. However, the use of cloud-based LLMs can introduce limitations such as:

Data Privacy and Security: Sending sensitive information to a third-party service may not be suitable or permissible for all types of data or organizations. By hosting your own LLM, you retain full control over your data.
Cost: Pay-as-you-go AI services can become expensive, especially when large volumes of data are involved. Running your own LLM can often be a more cost-effective solution in the long run.
Customization and Control: By hosting your own LLM, you have the ability to customize the model's parameters, training data, and more, tailoring the AI to your specific needs.
Latency: If your application requires real-time or near-real-time responses, hosting the model locally can significantly reduce latency compared to making a round trip to a remote API.

Features

LeapfrogAI provides an API that closely matches that of OpenAI's. This feature allows tools that have been built with OpenAI/ChatGPT to function seamlessly with LeapfrogAI as a backend. This compatibility greatly simplifies the transition process for developers familiar with OpenAI's API, and facilitates easy integration with existing systems.
Vector Databases: Our vector database service allows you to perform efficient similarity searches on large scale databases. This feature can be utilized to augment prompts with responses from VectorDBs, enhancing the contextual awareness of the model.
Fine-Tuning Models: One of the key strengths of LeapfrogAI is its ability to leverage customer specific data. We provide capabilities to fine-tune models with your data, enabling the AI to better understand your domain and provide more accurate and contextually relevant outputs.
Embeddings Creation: Embeddings are fundamental to the working of many AI algorithms. LeapfrogAI provides services to generate embeddings which can be used for a variety of tasks such as semantic similarity, clustering, and more.

Getting Started

Setting up the Kubernetes Cluster

K3d

There's a Zarf package that deploys a k3d cluster with GPU support here. To deploy the zarf package simply:

zarf package deploy oci://ghcr.io/runyontr/zarf-package-k3d/k3d-local:v1.26.0-amd64

on a node with at least 1 GPU

EKSCTL

eckctl create cluster -f config.yaml
zarf init -a amd64
zarf package deploy oci://ghcr.io/defenseunicorns/packages/big-bang-distro-k3d/big-bang-distro-k3d:0.0.1-amd64

Deploy

zarf package create
zarf package deploy zarf-package-leapfrogai-amd64-0.1.1.tar.zst

Configure DNS

Ensure that the DNS record for *.bigbang.dev points to the load balancer for Istio. By default this DNS record points at localhost, so for the k3d deployment, this should work out of the box with the load balancers configured. For a remote EKS deployment, you may need to

The OpenAI API service is hosted and then uses GRPC to talk to the embedding server and the alpaca-lora-7B instance

Usage

Reference one of the ipythonnotebooks that showcase a simple getting started.

Leapfrog AI

Leapfrog AI is a deployable AI-as-a-service that brings the capabilities of AI models to egress limited environments by allowing teams to deploy APIs that mirror OpenAI's spec. Teams are able to use tools built around OpenAIs models in their own environment, preventing the release of proprietary and sensitive data to SaaS tools.

In addition, tools like Weaviate are deployed to allow for the creation of content augmented applications.

Create the API Server

See the Getting Started Notebook for example of using the API with the OpenAI python module.

Contributing

Building `leapfrogai` and updating PyPi

Change the version in pyproject.toml
python3 -m pip install --upgrade build hatchling twine
python3 -m build
python3 -m twine upload dist/*

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

0.4.0

Nov 30, 2023

0.4.0rc1 pre-release

Nov 28, 2023

0.3.3

Sep 15, 2023

0.3.0

Jul 10, 2023

This version

0.2.0

Jun 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

leapfrogai-0.2.0.tar.gz (17.8 kB view hashes)

Uploaded Jun 28, 2023 Source

Built Distribution

leapfrogai-0.2.0-py3-none-any.whl (30.9 kB view hashes)

Uploaded Jun 28, 2023 Python 3

Hashes for leapfrogai-0.2.0.tar.gz

Hashes for leapfrogai-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`5f84d7a0c3a7188cc9c39e59fb305b378a23fefb6b606628d5aa1c5105be9e5b`
MD5	`1f860519a1a316d47b55ebcaf75e5091`
BLAKE2b-256	`db2a608b736960c95118a1cfb5e0b68275e39b9a38edf17956f6bd94ff5cec43`

Hashes for leapfrogai-0.2.0-py3-none-any.whl

Hashes for leapfrogai-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6edfc92660d25d9711f715a62993de8f63accc504ed3be080274400385e42393`
MD5	`a4c0431c5dc51164f09478cbb6996601`
BLAKE2b-256	`9870a9866deae6572a9529881515253239d04a74d248d302bd921ca7722dbdb5`

leapfrogai 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Table of Contents

Project Goal

Why Host Your Own LLM?

Features

Getting Started

Setting up the Kubernetes Cluster

K3d

EKSCTL

Deploy

Configure DNS

Usage

Leapfrog AI

Create the API Server

Contributing

Building `leapfrogai` and updating PyPi

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

leapfrogai 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Table of Contents

Project Goal

Why Host Your Own LLM?

Features

Getting Started

Setting up the Kubernetes Cluster

K3d

EKSCTL

Deploy

Configure DNS

Usage

Leapfrog AI

Create the API Server

Contributing

Building leapfrogai and updating PyPi

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

Building `leapfrogai` and updating PyPi