Switch your current LLM with a finetuned one automatically, no additional latency

These details have not been verified by PyPI

Project links

Project description

Tests

Logos Shift

Replace expensive GPT/Claude calls with smaller, faster finetuned Llama/Mistral automatically

Integrating Large Language Models (LLMs) into production systems can be a convoluted process, with myriad challenges to overcome. While several tools offer call instrumentation, Logos Shift sets itself apart with its game-changing feature: automated rollout of your fine-tuned model. Just integrate with a single line of code and let us notify you when your fine-tuned model is ready for deployment.

You can also do this yourself for free. LogosShift is the simplest and best way to do this for hackers.

Pssst: Can do this for any API, not just LLMs

Key Feature

Effortless A/B Rollout: Once your fine-tuned model achieves readiness, it's rolled out as an A/B test. No manual intervention, no complex configurations. Just simplicity.

Other Features

No Proxying: Direct calls, eliminating the latency of proxying.
Retain Your OpenAI Key: Your OpenAI key remains yours, safeguarding confidentiality. No leaked keys.
Feedback-Driven Finetuning: Refine model performance with feedback based on unique result IDs.
Open Source: Flexibility to modify and adapt as needed.
Dynamic Configuration: Synchronize with server configurations on-the-fly.
Simplicity: Simple is beautiful. Minimal dependancies.
No lock-in: Can optionally save data to local drive if you want to train model yourself.
Upcoming:
- Error fallback mechanisms

Why

sergey

At Bohita, our pioneering efforts in deploying Large Language Models (LLMs) in production environments have brought forth unique challenges, especially concerning cost management, latency reduction, and optimization. The solutions available in the market weren't adequate for our needs, prompting us to develop and subsequently open-source some of our bespoke tools.

On the subject of proxying: We prioritize the reliability and uptime of our services. By introducing an additional domain as a dependency, we'd inherently be reducing our uptime. Specifically, the probability of combined uptime would be (1 - (P_A_up_B_down + P_B_up_A_down + P_both_down)), which is inherently less than the uptime of either individual service. Given the inherent unpredictability of APIs in today's landscape, compromising our reliability in this manner is not a trade-off we're willing to make.

Getting Started

Prerequisites

Obtain an API key from Bohita Logos Shift Portal.

Installation

pip install logos_shift_client

Basic Usage

from logos_shift_client import LogosShift

# Initialize with your API key (without if you just want the local copy)
logos_shift = LogosShift(api_key="YOUR_API_KEY")

# Instrument your function
@logos_shift()
def add(x, y):
    return x + y

result = add(1, 2)

# Optionally, provide feedback
logos_shift.provide_feedback(result['bohita_logos_shift_id_123', "success")

How It Works

Here's a high-level overview:

graph LR
    A["Client Application"]
    B["Logos Shift Client"]
    C["Buffer Manager Thread"]
    D["Logos Server"]
    E["Expensive LLM Client"]
    F["Cheap LLM Client"]
    G["API Router A/B Test Rollout"]

    A -->|"Function Call"| B
    B -->|"Capture & Buffer Data"| C
    C -->|"Send Data"| D
    B -->|"Route API Call"| G
    G -->|"Expensive API Route"| E
    G -->|"Cheap API Route"| F

    classDef mainClass fill:#0077b6,stroke:#004c8c,stroke-width:2px,color:#fff;
    classDef api fill:#90e0ef,stroke:#0077b6,stroke-width:2px,color:#333;
    classDef buffer fill:#48cae4,stroke:#0077b6,stroke-width:2px,color:#333;
    classDef expensive fill:#d00000,stroke:#9d0208,stroke-width:2px,color:#fff;
    classDef cheap fill:#52b788,stroke:#0077b6,stroke-width:2px,color:#333;

    class A,B,D,G mainClass
    class E expensive
    class F cheap
    class C buffer

Dataset

All function calls are grouped into datasets. Think of this as the usecase those calls are made for. If you are intrumentating just one call, then you don't need to do anything. The default dataset is 'default'.

If you have different usecases in your application (E.g, chatbot for sales, vs chatbot for help), you should separate them out.

@logos_shift(dataset="sales")
def add_sales(x, y):
    return x + y

@logos_shift(dataset="help")
def add_help(x, y):
    return x + y

This helps you track them separately and also finetune them separately for each use case.

Metadata

You can provide additional metadata, including user_id, which can be used for routing decisions based on user-specific details.

@logos_shift()
def multiply(x, y, logos_shift_metadata={"user_id": "12345"}):
    return x * y

Feedback

Using feedback you can get better models that will be cheaper and more effective.

If you don't have feedback, it will be auto-regressive as usual.

Configuration Retrieval

The library will also support retrieving configurations every few minutes, ensuring your logos_shift adapts to dynamic environments.

Local Copy

Initialize with a filename to keep a local copy. You can also run it without Bohita, just set api_key to None

logos_shift = LogosShift(api_key="YOUR_API_KEY", filename="api_calls.log")

# Can also disable sending data to Bohita. However, you will lose the automatic routing.
logos_shift = LogosShift(api_key=None, filename="api_calls.log")

Best Practices

When using Logos Shift to integrate Large Language Models (LLMs) into your applications, it’s crucial to tailor the integration to the specific outputs and outcomes that are most relevant to your use case. Below are some best practices to help you maximize the effectiveness of Logos Shift.

Focus on Relevant Outputs

When instrumenting your functions with Logos Shift, aim to return the specific parts of the output that are most pertinent to your application’s needs.

Not Recommended:

@logos_shift(dataset="story_raw")
def get_story(model, messages):
    """Generates a story"""
    completion = openai.ChatCompletion.create(model=model, messages=messages)
    return completion

In the above example, the entire completion object is returned, which might include a lot of information that is not directly relevant to your application.

Providing Feedback

Providing feedback on specific outcomes is crucial for fine-tuning your models and ensuring accurate A/B test rollouts.

story_d = get_story()
# ... your application logic ...
logos_shift.provide_feedback(story_d['bohita_logos_shift_id'], "success")

In this example, provide_feedback is called with the result ID and an outcome string ("success" in this case). This helps in two ways:

Accurate A/B Test Measurements: The feedback ensures that the A/B test rollouts are based on actual outcomes, providing a true measure of the model’s performance in real-world scenarios.
Targeted Fine-Tuning: By providing feedback on specific outcomes, you help in fine-tuning the model to better suit your application’s needs, leading to more effective and efficient model performance over time.

Adopting these best practices will help you leverage the full potential of Logos Shift, ensuring that your integration with LLMs is not just seamless but also highly effective and outcome-driven.

Contribute

Feel free to fork, open issues, and submit PRs. For major changes, please open an issue first to discuss what you'd like to change.

License

This project is licensed under the MIT License.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.11.0

Feb 15, 2024

0.10.0

Oct 26, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

logos-shift-client-0.11.0.tar.gz (15.6 kB view details)

Uploaded Feb 15, 2024 Source

Built Distribution

logos_shift_client-0.11.0-py3-none-any.whl (11.9 kB view details)

Uploaded Feb 15, 2024 Python 3

File details

Details for the file logos-shift-client-0.11.0.tar.gz.

File metadata

Download URL: logos-shift-client-0.11.0.tar.gz
Upload date: Feb 15, 2024
Size: 15.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for logos-shift-client-0.11.0.tar.gz
Algorithm	Hash digest
SHA256	`87565da0bec4483fc3a3fd887d1f269f2875a93e1640aa142d919613f03574db`
MD5	`e75ffb656192dfcda7321fa44b0a15ef`
BLAKE2b-256	`fbc89808614736ab6fc7485a719f4418c123cf7ac7ad5b1793c412328ba30f4e`

See more details on using hashes here.

File details

Details for the file logos_shift_client-0.11.0-py3-none-any.whl.

File metadata

Download URL: logos_shift_client-0.11.0-py3-none-any.whl
Upload date: Feb 15, 2024
Size: 11.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for logos_shift_client-0.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f04b372fb11d75f7ba9cab73987ac799925b2cf6c0fa1407d5357ff02eb30c7b`
MD5	`d2aefb626c7a52668103d8b140fb5372`
BLAKE2b-256	`bc5eafa6dfdf1366e3aebdba2d1a05f87eda31979e40f48e9083aaf04fde4349`