proxyllm

LLM Proxy to reduce cost and complexity of using multiple LLMs

Project description

LLM Proxy

A low-code solution to efficiently manage multiple large language models
View Demo · Report Bug · Request Feature

Table of Contents

About The Project
Getting Started
- Prerequisites
- Installation
Usage
Roadmap
Contributing
License

What is LLM Proxy?

LLM Proxy is a tool that sits between your application and the different LLM providers. LLM Proxy's goal is to simplify the use of multiple LLMs through a TUI while providing cost and response optimization.

(back to top)

Getting Started

There are 2 ways to get started. You can directly clone the repo into your project or install it as a library.

Prerequisites

Python 3.11+

Installation

With pip:

pip install proxyllm

With poetry:

poetry add proxyllm

Run the install script for the default configuration file:

config --default-config

If you prefer poetry:

poetry run config --default-config

If the installation scripts do not work, you can visit the repo to grab a copy manually.

Note:

Ensure that you have all of your API keys for each respective provider in the .env file (You can utilize the .env.example for reference)
For Google's models, you will need the path to application credentials, and the project ID inside of the .env

(back to top)

Usage

Currently, the LLM Proxy provides 2 different route types: Cost and Category.

To get started import the LLMProxy client:

from proxyllm import LLMProxy

After the setup is complete, you only need 1 line of code to get started:

llmproxy_client = LLMProxy()

Note: You will need to specify your yaml configuration file if you did not use the default name:

llmproxy_client = LLMProxy(path_to_user_configuration="llmproxy.config.yml")

To use the llmproxy, simply call the route function with your prompt:

output = llmproxy_client.route(prompt=prompt)

The route function will return a CompletionResponse:

print("RESPONSE MODEL: ", output.response_model)
print("RESPONSE: ", output.response)
print("ERRORS: ", output.errors)

response_model: contains the model used for the request
response: contains the string response from the model
errors: contains an array of models that failed to make a request with their respective errors

Important Note: Although parameters changed programmatically, it is best to favor the YAML configuration file. Only use the constructor parameters when you must override the YAML configuration.

(back to top)

Roadmap

Support for more providers
- Replicate
- Claude
Support for multimodal Models
Custom, optimized model for category routing
Effectiveness Routing
Context Injection

See the open issues for a full list of proposed features (and known issues).

(back to top)

Contributing

LLM Proxy is open source, so we are open and grateful, for contributions. Open-source communities are what makes software great, so feel fork the repo and create a pull request with the feature tag. Thanks!

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

(back to top)

License

Distributed under the MIT License. See LICENSE.txt for more information.

(back to top)

Project details

Release history Release notifications | RSS feed

0.2.3

May 24, 2024

0.2.2

May 24, 2024

0.2.1

May 3, 2024

0.2.0

May 3, 2024

This version

0.1.6

Mar 16, 2024

0.1.5

Mar 8, 2024

0.1.4

Mar 7, 2024

0.1.3

Mar 5, 2024

0.1.1

Feb 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

proxyllm-0.1.6.tar.gz (326.2 kB view hashes)

Uploaded Mar 16, 2024 Source

Built Distribution

proxyllm-0.1.6-py3-none-any.whl (337.3 kB view hashes)

Uploaded Mar 16, 2024 Python 3

Hashes for proxyllm-0.1.6.tar.gz

Hashes for proxyllm-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`6520ca021ac5f29c274a705a2370540e195bf2c76fce993cdbb44f0204539d49`
MD5	`323af26f563a104084f67d95d54c5454`
BLAKE2b-256	`54f9f2399bacb51e11a4b3a1ebb29b27a97f09f85d1b8cfa7b30cfe632e83871`

Hashes for proxyllm-0.1.6-py3-none-any.whl

Hashes for proxyllm-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a26b62940aacb639f27152ce0093996deb5991fabffa69c382c8a029969b673c`
MD5	`8ef02176192dfac94e4dfba125a8a700`
BLAKE2b-256	`fb5c5dbe30df6ecc22f4cf76220637a51fbbadd7d5762e9a4066d464f1a5b4ce`