Skip to main content

Python SDK for Laminar AI

Project description

Laminar AI

This repo provides core for code generation, Laminar CLI, and Laminar SDK.

Quickstart

python3 -m venv .myenv
source .myenv/bin/activate  # or use your favorite env management tool

pip install lmnr

Create .env file at the root and add LMNR_PROJECT_API_KEY value to it.

Read more here on how to get LMNR_PROJECT_API_KEY.

Sending events

You can send events in two ways:

  • .event(name, value) – for a pre-defined event with one of possible values.
  • .evaluate_event(name, data) – for an event that our agent checks for and assigns a value from possible values.

There are 3 types of events:

  • Number - Numeric value.
  • String - Arbitrary string.
  • Boolean - Convenient to classify if something has took place or not.

Important notes:

  • If event name does not match anything pre-defined in the UI, the event won't be saved.

Instrumentation

We provide two ways to instrument your python code:

  • With @observe() decorators and wrap_llm_call helpers
  • Manually

It is important to not mix the two styles of instrumentation, this can lead to unpredictable results.

Decorator instrumentation example

For easy automatic instrumentation, we provide you two simple primitives:

  • observe - a multi-purpose automatic decorator that starts traces and spans when functions are entered, and finishes them when functions return
  • wrap_llm_call - a function that takes in your LLM call and return a "decorated" version of it. This does all the same things as observe, plus a few utilities around LLM-specific things, such as counting tokens and recording model params.

You can also import lmnr_context in order to interact and have more control over the context of the current span.

import os
from openai import OpenAI

from lmnr import observe, wrap_llm_call, lmnr_context
client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])

@observe()  # annotate all functions you want to trace
def poem_writer(topic="turbulence"):
    prompt = f"write a poem about {topic}"

    # wrap the actual final call to LLM with `wrap_llm_call`
    response = wrap_llm_call(client.chat.completions.create)(
        model="gpt-4o",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": prompt},
        ],
    )

    poem = response.choices[0].message.content

    if topic in poem:
        # send an event with a pre-defined name
        lmnr_context.event("topic_alignment", "good")
    
    # to trigger an automatic check for a possible event do:
    lmnr_context.evaluate_event("excessive_wordiness", poem)

    return poem

if __name__ == "__main__":
    print(poem_writer(topic="laminar flow"))

This gives an advantage of quick instrumentation, but is somewhat limited in flexibility + doesn't really work as expected with threading. This is due to the fact that we use contextvars.ContextVar for this, and how Python manages them between threads.

If you want to instrument your code manually, follow on to the next section

Manual instrumentation example

For manual instrumetation you will need to import the following:

  • trace - this is a function to start a trace. It returns a TraceContext
  • TraceContext - a pointer to the current trace that you can pass around functions as you want.
  • SpanContext - a pointer to the current span that you can pass around functions as you want

Both TraceContext and SpanContext expose the following interfaces:

  • span(name: str, **kwargs) - create a child span within the current context. Returns SpanContext
  • update(**kwargs) - update the current trace or span and return it. Returns TraceContext or SpanContext. Useful when some metadata becomes known later during the program execution

In addition, SpanContext allows you to:

  • event(name: str, value: str | int) - emit a custom event at any point
  • evaluate_event(name: str, data: str) - register a possible event for automatic checking by Laminar.
  • end(**kwargs) – update the current span, and terminate it

Example:

import os
from openai import OpenAI

from lmnr import trace, TraceContext, SpanContext, EvaluateEvent
from lmnr.semantic_conventions.gen_ai_spans import INPUT_TOKEN_COUNT, OUTPUT_TOKEN_COUNT, RESPONSE_MODEL, PROVIDER, STREAM
client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])

def poem_writer(t: TraceContext, topic = "turbulence"):
    span: SpanContext = t.span(name="poem_writer", input=topic)

    prompt = f"write a poem about {topic}"
    messages = [
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": prompt},
    ]
    # create a child span within the current `poem_writer` span.
    llm_span = span.span(name="OpenAI completion", input=messages, span_type="LLM")
    
    response = client.chat.completions.create(
        model="gpt-4o-mini",
        messages=[
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Hello. What is the capital of France?"},
        ],
    )
    poem = response.choices[0].message.content
    if topic in poem:
        llm_span.event("topic_alignment", "good")  # send an event with a pre-defined name

    # note that you can register possible events here as well,
    # not only `llm_span.evaluate_event()`
    llm_span.end(
        output=poem,
        evaluate_events=[EvaluateEvent(name="excessive_wordines", data=poem)],
        attributes={
            INPUT_TOKEN_COUNT: response.usage.prompt_tokens,
            OUTPUT_TOKEN_COUNT: response.usage.completion_tokens,
            RESPONSE_MODEL: response.model,
            PROVIDER: 'openai',
            STREAM: False
        }
    )
    span.end(output=poem)
    return poem


t: TraceContext = trace(user_id="user123", session_id="session123", release="release")
main(t, topic="laminar flow")

Manual attributes

You can specify span attributes when creating/updating/ending spans.

If you use decorator instrumentation, wrap_llm_call handles all of this for you.

Example usage:

from lmnr.semantic_conventions.gen_ai_spans import REQUEST_MODEL

# span_type = LLM is important for correct attribute semantics
llm_span = span.span(name="OpenAI completion", input=messages, span_type="LLM")
llm_span.update(
    attributes={REQUEST_MODEL: "gpt-4o-mini"}
)
response = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Hello. What is the capital of France?"},
    ],
)

Semantics:

Check for available semantic conventions in lmnr.semantic_conventions.gen_ai_spans.

You can specify the cost with COST. Otherwise, the cost will be calculated on the Laminar servers, given the following are specified:

  • span_type is "LLM"
  • Model provider: PROVIDER, e.g. 'openai', 'anthropic'
  • Output tokens: OUTPUT_TOKEN_COUNT
  • Input tokens: INPUT_TOKEN_COUNT*
  • Model. We look at RESPONSE_MODEL first, and then, if it is not present, we take the value of REQUEST_MODEL

* Also, for the case when PROVIDER is "openai", the STREAM is set to True, and INPUT_TOKEN_COUNT is not set, we will calculate the number of input tokens, and the cost on the server using tiktoken and use it in cost calculation. This is done because OpenAI does not stream the usage back when streaming is enabled. Output token count is (approximately) equal to the number of streaming events sent by OpenAI, but there is no way to calculate the input token count, other than re-tokenizing.

Making Laminar pipeline calls

After you are ready to use your pipeline in your code, deploy it in Laminar by selecting the target version for the pipeline.

Once your pipeline target is set, you can call it from Python in just a few lines.

Example use:

from lmnr import Laminar 

# for decorator instrumentation, do: `from lmnr inport lmnr_context`

l = Laminar('<YOUR_PROJECT_API_KEY>')
result = l.run(  # lmnr_context.run( for decorator instrumentation
    pipeline = 'my_pipeline_name',
    inputs = {'input_node_name': 'some_value'},
    # all environment variables
    env = {'OPENAI_API_KEY': 'sk-some-key'},
    # any metadata to attach to this run's trace
    metadata = {'session_id': 'your_custom_session_id'}
)

Resulting in:

>>> result
PipelineRunResponse(
    outputs={'output': {'value': [ChatMessage(role='user', content='hello')]}},
    # useful to locate your trace
    run_id='53b012d5-5759-48a6-a9c5-0011610e3669'
)

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lmnr-0.3.6.tar.gz (26.1 kB view details)

Uploaded Source

Built Distribution

lmnr-0.3.6-py3-none-any.whl (31.2 kB view details)

Uploaded Python 3

File details

Details for the file lmnr-0.3.6.tar.gz.

File metadata

  • Download URL: lmnr-0.3.6.tar.gz
  • Upload date:
  • Size: 26.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.7 Darwin/23.0.0

File hashes

Hashes for lmnr-0.3.6.tar.gz
Algorithm Hash digest
SHA256 f4001842ac943c3a4c59893fdb66c5887675eed431e70063c1658702dade078d
MD5 cf27834fb34984d355ab1bf763c38a97
BLAKE2b-256 c0754ed58d4df3b33e6ac1760d30642b50d8ad9897531716c121b6d2f6541873

See more details on using hashes here.

File details

Details for the file lmnr-0.3.6-py3-none-any.whl.

File metadata

  • Download URL: lmnr-0.3.6-py3-none-any.whl
  • Upload date:
  • Size: 31.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.11.7 Darwin/23.0.0

File hashes

Hashes for lmnr-0.3.6-py3-none-any.whl
Algorithm Hash digest
SHA256 c0260a8b82401c782072dd572b729291b81a8ec091c470885a7ac33a498b1c5b
MD5 56c07c346dd53b218806a4e10b20216f
BLAKE2b-256 83015338cc3c2f3ada076c5f7c1ad4da76de199be88f7425e7815a79cb04b4f6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page