Client library to connect to the LangSmith LLM Tracing and Evaluation Platform.

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

LangSmith Client SDK

This package contains the Python client for interacting with the LangSmith platform.

To install:

pip install -U langsmith
export LANGCHAIN_TRACING_V2=true
export LANGCHAIN_API_KEY=ls_...

Then trace:

import openai
from langsmith.wrappers import wrap_openai

client = wrap_openai(openai.Client())

client.chat.completions.create(
    messages=[{"role": "user", "content": "Hello, world"}],
    model="gpt-3.5-turbo"
)

LangSmith helps you and your team develop and evaluate language models and intelligent agents. It is compatible with any LLM application.

Cookbook: For tutorials on how to get more value out of LangSmith, check out the Langsmith Cookbook repo.

A typical workflow looks like:

Set up an account with LangSmith.
Log traces while debugging and prototyping.
Run benchmark evaluations and continuously improve with the collected data.

We'll walk through these steps in more detail below.

1. Connect to LangSmith

Sign up for LangSmith using your GitHub, Discord accounts, or an email address and password. If you sign up with an email, make sure to verify your email address before logging in.

Then, create a unique API key on the Settings Page, which is found in the menu at the top right corner of the page.

Note: Save the API Key in a secure location. It will not be shown again.

2. Log Traces

You can log traces natively using the LangSmith SDK or within your LangChain application.

Logging Traces with LangChain

LangSmith seamlessly integrates with the Python LangChain library to record traces from your LLM applications.

Copy the environment variables from the Settings Page and add them to your application.

Tracing can be activated by setting the following environment variables or by manually specifying the LangChainTracer.

import os
os.environ["LANGCHAIN_TRACING_V2"] = "true"
os.environ["LANGCHAIN_ENDPOINT"] = "https://api.smith.langchain.com"
os.environ["LANGCHAIN_API_KEY"] = "<YOUR-LANGSMITH-API-KEY>"
# os.environ["LANGCHAIN_PROJECT"] = "My Project Name" # Optional: "default" is used if not set

Tip: Projects are groups of traces. All runs are logged to a project. If not specified, the project is set to default.

Run an Agent, Chain, or Language Model in LangChain

If the environment variables are correctly set, your application will automatically connect to the LangSmith platform.

from langchain_core.runnables import chain

@chain
def add_val(x: dict) -> dict:
    return {"val": x["val"] + 1}

add_val({"val": 1})

Logging Traces Outside LangChain

You can still use the LangSmith development platform without depending on any LangChain code.

Copy the environment variables from the Settings Page and add them to your application.

import os
os.environ["LANGCHAIN_ENDPOINT"] = "https://api.smith.langchain.com"
os.environ["LANGCHAIN_API_KEY"] = "<YOUR-LANGSMITH-API-KEY>"
# os.environ["LANGCHAIN_PROJECT"] = "My Project Name" # Optional: "default" is used if not set

Log traces

The easiest way to log traces using the SDK is via the @traceable decorator. Below is an example.

from datetime import datetime
from typing import List, Optional, Tuple

import openai
from langsmith import traceable
from langsmith.wrappers import wrap_openai

client = wrap_openai(openai.Client())

@traceable
def argument_generator(query: str, additional_description: str = "") -> str:
    return client.chat.completions.create(
        [
            {"role": "system", "content": "You are a debater making an argument on a topic."
             f"{additional_description}"
             f" The current time is {datetime.now()}"},
            {"role": "user", "content": f"The discussion topic is {query}"}
        ]
    ).choices[0].message.content



@traceable
def argument_chain(query: str, additional_description: str = "") -> str:
    argument = argument_generator(query, additional_description)
    # ... Do other processing or call other functions...
    return argument

argument_chain("Why is blue better than orange?")

Alternatively, you can manually log events using the Client directly or using a RunTree, which is what the traceable decorator is meant to manage for you!

A RunTree tracks your application. Each RunTree object is required to have a name and run_type. These and other important attributes are as follows:

name: str - used to identify the component's purpose
run_type: str - Currently one of "llm", "chain" or "tool"; more options will be added in the future
inputs: dict - the inputs to the component
outputs: Optional[dict] - the (optional) returned values from the component
error: Optional[str] - Any error messages that may have arisen during the call

from langsmith.run_trees import RunTree

parent_run = RunTree(
    name="My Chat Bot",
    run_type="chain",
    inputs={"text": "Summarize this morning's meetings."},
    # project_name= "Defaults to the LANGCHAIN_PROJECT env var"
)
parent_run.post()
# .. My Chat Bot calls an LLM
child_llm_run = parent_run.create_child(
    name="My Proprietary LLM",
    run_type="llm",
    inputs={
        "prompts": [
            "You are an AI Assistant. The time is XYZ."
            " Summarize this morning's meetings."
        ]
    },
)
child_llm_run.post()
child_llm_run.end(
    outputs={
        "generations": [
            "I should use the transcript_loader tool"
            " to fetch meeting_transcripts from XYZ"
        ]
    }
)
child_llm_run.patch()
# ..  My Chat Bot takes the LLM output and calls
# a tool / function for fetching transcripts ..
child_tool_run = parent_run.create_child(
    name="transcript_loader",
    run_type="tool",
    inputs={"date": "XYZ", "content_type": "meeting_transcripts"},
)
child_tool_run.post()
# The tool returns meeting notes to the chat bot
child_tool_run.end(outputs={"meetings": ["Meeting1 notes.."]})
child_tool_run.patch()

child_chain_run = parent_run.create_child(
    name="Unreliable Component",
    run_type="tool",
    inputs={"input": "Summarize these notes..."},
)
child_chain_run.post()

try:
    # .... the component does work
    raise ValueError("Something went wrong")
    child_chain_run.end(outputs={"output": "foo"}
    child_chain_run.patch()
except Exception as e:
    child_chain_run.end(error=f"I errored again {e}")
    child_chain_run.patch()
    pass
# .. The chat agent recovers

parent_run.end(outputs={"output": ["The meeting notes are as follows:..."]})
res = parent_run.patch()
res.result()

Create a Dataset from Existing Runs

Once your runs are stored in LangSmith, you can convert them into a dataset. For this example, we will do so using the Client, but you can also do this using the web interface, as explained in the LangSmith docs.

from langsmith import Client

client = Client()
dataset_name = "Example Dataset"
# We will only use examples from the top level AgentExecutor run here,
# and exclude runs that errored.
runs = client.list_runs(
    project_name="my_project",
    execution_order=1,
    error=False,
)

dataset = client.create_dataset(dataset_name, description="An example dataset")
for run in runs:
    client.create_example(
        inputs=run.inputs,
        outputs=run.outputs,
        dataset_id=dataset.id,
    )

Evaluating Runs

Check out the LangSmith Testing & Evaluation dos for up-to-date workflows.

For generating automated feedback on individual runs, you can run evaluations directly using the LangSmith client.

from typing import Optional
from langsmith.evaluation import StringEvaluator


def jaccard_chars(output: str, answer: str) -> float:
    """Naive Jaccard similarity between two strings."""
    prediction_chars = set(output.strip().lower())
    answer_chars = set(answer.strip().lower())
    intersection = prediction_chars.intersection(answer_chars)
    union = prediction_chars.union(answer_chars)
    return len(intersection) / len(union)


def grader(run_input: str, run_output: str, answer: Optional[str]) -> dict:
    """Compute the score and/or label for this run."""
    if answer is None:
        value = "AMBIGUOUS"
        score = 0.5
    else:
        score = jaccard_chars(run_output, answer)
        value = "CORRECT" if score > 0.9 else "INCORRECT"
    return dict(score=score, value=value)

evaluator = StringEvaluator(evaluation_name="Jaccard", grading_function=grader)

runs = client.list_runs(
    project_name="my_project",
    execution_order=1,
    error=False,
)
for run in runs:
    client.evaluate_run(run, evaluator)

Additional Documentation

To learn more about the LangSmith platform, check out the docs.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.59

May 16, 2024

0.1.58

May 15, 2024

0.1.57

May 11, 2024

0.1.56

May 8, 2024

0.1.55

May 7, 2024

0.1.54

May 4, 2024

0.1.53

May 2, 2024

0.1.52

Apr 29, 2024

0.1.51

Apr 25, 2024

0.1.50

Apr 23, 2024

0.1.49

Apr 19, 2024

0.1.48

Apr 15, 2024

0.1.47

Apr 13, 2024

0.1.46

Apr 12, 2024

0.1.46rc1 pre-release

Apr 12, 2024

0.1.45

Apr 10, 2024

0.1.45rc1 pre-release

Apr 11, 2024

0.1.44

Apr 10, 2024

0.1.43

Apr 10, 2024

0.1.42

Apr 9, 2024

0.1.41

Apr 9, 2024

0.1.40

Apr 4, 2024

0.1.39

Apr 3, 2024

0.1.38

Mar 29, 2024

0.1.37

Mar 28, 2024

0.1.36

Mar 28, 2024

0.1.35

Mar 27, 2024

0.1.34

Mar 27, 2024

0.1.33

Mar 27, 2024

0.1.32rc8 pre-release

Mar 26, 2024

0.1.32rc7 pre-release

Mar 26, 2024

0.1.32rc6 pre-release

Mar 26, 2024

0.1.32rc5 pre-release

Mar 26, 2024

0.1.32rc4 pre-release

Mar 26, 2024

0.1.32rc3 pre-release

Mar 26, 2024

0.1.32rc2 pre-release

Mar 26, 2024

0.1.32rc1 pre-release

Mar 25, 2024

0.1.31

Mar 19, 2024

0.1.30

Mar 19, 2024

0.1.29

Mar 19, 2024

0.1.28

Mar 18, 2024

0.1.27

Mar 16, 2024

0.1.26

Mar 14, 2024

0.1.25

Mar 14, 2024

0.1.24

Mar 12, 2024

0.1.23

Mar 8, 2024

0.1.22

Mar 6, 2024

0.1.21

Mar 5, 2024

0.1.20

Mar 5, 2024

0.1.19

Mar 5, 2024

0.1.18

Mar 5, 2024

0.1.17

Mar 4, 2024

0.1.16

Mar 4, 2024

0.1.15

Mar 4, 2024

0.1.14

Mar 3, 2024

0.1.13

Mar 2, 2024

0.1.12

Mar 1, 2024

0.1.11

Mar 1, 2024

0.1.10

Feb 27, 2024

0.1.9

Feb 26, 2024

0.1.8

Feb 25, 2024

0.1.7

Feb 24, 2024

0.1.6

Feb 23, 2024

0.1.5

Feb 21, 2024

0.1.4

Feb 21, 2024

0.1.3

Feb 20, 2024

0.1.2

Feb 16, 2024

0.1.1

Feb 15, 2024

0.1.0

Feb 15, 2024

This version

0.0.92

Feb 14, 2024

0.0.91

Feb 13, 2024

0.0.90

Feb 10, 2024

0.0.89

Feb 9, 2024

0.0.88

Feb 9, 2024

0.0.87

Feb 7, 2024

0.0.86

Feb 2, 2024

0.0.86rc1 pre-release

Jan 30, 2024

0.0.85

Jan 30, 2024

0.0.84

Jan 29, 2024

0.0.84rc5 pre-release

Jan 26, 2024

0.0.84rc4 pre-release

Jan 25, 2024

0.0.84rc3 pre-release

Jan 25, 2024

0.0.84rc2 pre-release

Jan 23, 2024

0.0.84rc1 pre-release

Jan 22, 2024

0.0.83

Jan 18, 2024

0.0.82

Jan 18, 2024

0.0.81

Jan 16, 2024

0.0.80

Jan 11, 2024

0.0.79

Jan 9, 2024

0.0.78

Jan 8, 2024

0.0.77

Jan 3, 2024

0.0.76

Jan 3, 2024

0.0.75

Dec 22, 2023

0.0.74

Dec 21, 2023

0.0.73

Dec 21, 2023

0.0.72

Dec 18, 2023

0.0.71

Dec 15, 2023

0.0.70

Dec 14, 2023

0.0.69

Dec 3, 2023

0.0.68

Dec 1, 2023

0.0.67

Nov 28, 2023

0.0.66

Nov 20, 2023

0.0.65

Nov 17, 2023

0.0.64

Nov 14, 2023

0.0.63

Nov 9, 2023

0.0.62 yanked

Nov 8, 2023

Reason this release was yanked:

Dev dep was allowed to be added to the prod requiremenents

0.0.61

Nov 8, 2023

0.0.60

Nov 7, 2023

0.0.59

Nov 6, 2023

0.0.58

Nov 6, 2023

0.0.57 yanked

Nov 3, 2023

Reason this release was yanked:

issue with feedback

0.0.56

Nov 1, 2023

0.0.55

Nov 1, 2023

0.0.54

Oct 30, 2023

0.0.53

Oct 27, 2023

0.0.52

Oct 25, 2023

0.0.51

Oct 25, 2023

0.0.50

Oct 24, 2023

0.0.49

Oct 20, 2023

0.0.48

Oct 20, 2023

0.0.47

Oct 19, 2023

0.0.46

Oct 18, 2023

0.0.45

Oct 18, 2023

0.0.44

Oct 16, 2023

0.0.43

Oct 6, 2023

0.0.42

Oct 5, 2023

0.0.41

Sep 27, 2023

0.0.40

Sep 21, 2023

0.0.39

Sep 21, 2023

0.0.38

Sep 18, 2023

0.0.37

Sep 14, 2023

0.0.36

Sep 12, 2023

0.0.35

Sep 8, 2023

0.0.34

Sep 8, 2023

0.0.33

Sep 2, 2023

0.0.32

Sep 1, 2023

0.0.31

Aug 31, 2023

0.0.30

Aug 30, 2023

0.0.29

Aug 30, 2023

0.0.28

Aug 30, 2023

0.0.27

Aug 27, 2023

0.0.26

Aug 22, 2023

0.0.25

Aug 18, 2023

0.0.24

Aug 17, 2023

0.0.23

Aug 16, 2023

0.0.22

Aug 11, 2023

0.0.21

Aug 10, 2023

0.0.20

Aug 8, 2023

0.0.19

Aug 5, 2023

0.0.18

Aug 3, 2023

0.0.16

Aug 1, 2023

0.0.15

Jul 27, 2023

0.0.14

Jul 21, 2023

0.0.13

Jul 21, 2023

0.0.12

Jul 21, 2023

0.0.11

Jul 19, 2023

0.0.10

Jul 18, 2023

0.0.9

Jul 17, 2023

0.0.8

Jul 17, 2023

0.0.7

Jul 15, 2023

0.0.6

Jul 14, 2023

0.0.5

Jul 12, 2023

0.0.4

Jul 12, 2023

0.0.3

Jul 11, 2023

0.0.2

Jul 8, 2023

0.0.1

Jun 26, 2023

0.0.0rc0 pre-release

Jun 26, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langsmith-0.0.92.tar.gz (53.2 kB view hashes)

Uploaded Feb 14, 2024 Source

Built Distribution

langsmith-0.0.92-py3-none-any.whl (56.5 kB view hashes)

Uploaded Feb 14, 2024 Python 3

Hashes for langsmith-0.0.92.tar.gz

Hashes for langsmith-0.0.92.tar.gz
Algorithm	Hash digest
SHA256	`61a3a502222bdd221b7f592b6fc14756d74c4fc088aa6bd8834b92adfe9ee583`
MD5	`9e3c9076c771578c05be3ec143b88363`
BLAKE2b-256	`148242ed4a902ad1ed9a7d0d29ee78e7e1b6b1aea0b1bb66be3d4c9a3f2543a6`

Hashes for langsmith-0.0.92-py3-none-any.whl

Hashes for langsmith-0.0.92-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ddcf65e3b5ca11893ae8ef9816ce2a11a089d051be491886e43a2c4556b88fd0`
MD5	`01a81ac5ff1346b607d7c44bff84b870`
BLAKE2b-256	`97cd1c618f89d3fcbb375c99a3ea950bffba8a01862cc0f0ab5032dfb95e8d1e`