Skip to main content

A language toolkit for monitoring LLM interactions

Project description

LangKit is an open-source text metrics toolkit for monitoring language models. It offers an array of methods for extracting relevant signals from the input and/or output text, which are compatible with the open-source data logging library whylogs.

💡 Want to experience LangKit? Go to this notebook!

Table of Contents 📖

Motivation 🎯

Productionizing language models, including LLMs, comes with a range of risks due to the infinite amount of input combinations, which can elicit an infinite amount of outputs. The unstructured nature of text poses a challenge in the ML observability space - a challenge worth solving, since the lack of visibility on the model's behavior can have serious consequences.

Features 🛠️

The currently supported metrics include:

  • Text Quality
    • readability score
    • complexity and grade scores
  • Text Relevance
    • Similarity scores between prompt/responses
    • Similarity scores against user-defined themes
  • Security and Privacy
    • patterns - count of strings matching a user-defined regex pattern group
    • jailbreaks - similarity scores with respect to known jailbreak attempts
    • prompt injection - similarity scores with respect to known prompt injection attacks
    • refusals - similarity scores with respect to known LLM refusal of service responses
  • Sentiment and Toxicity
    • sentiment analysis
    • toxicity analysis

Installation 💻

To install LangKit, use the Python Package Index (PyPI) as follows:

pip install langkit[all]

Usage 🚀

LangKit modules contain UDFs that automatically wire into the collection of UDFs on String features provided by whylogs by default. All we have to do is import the LangKit modules and then instantiate a custom schema as shown in the example below.

import whylogs as why
from langkit import llm_metrics

results = why.log({"prompt": "Hello!", "response": "World!"}, schema=llm_metrics.init())

The code above will produce a set of metrics comprised of the default whylogs metrics for text features and all the metrics defined in the imported modules. This profile can be visualized and monitored in the WhyLabs platform or they can be further analyzed by the user on their own accord.

More examples are available here.

Modules 📦

You can have more information about the different modules and their metrics here.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langkit-0.0.35.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

langkit-0.0.35-py3-none-any.whl (1.2 MB view details)

Uploaded Python 3

File details

Details for the file langkit-0.0.35.tar.gz.

File metadata

  • Download URL: langkit-0.0.35.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.12 Linux/6.5.0-1025-azure

File hashes

Hashes for langkit-0.0.35.tar.gz
Algorithm Hash digest
SHA256 22a0a15f825d286ba9b7b060f1ce28753558fa02242631bb57208e5be0b46056
MD5 e6f548ab33903dbae5c62fdd1ff3a1d1
BLAKE2b-256 e1ca6c0c3d088663a8e7c0df5a6bd618b5f24348d212c607ecb7ea05c8cfde25

See more details on using hashes here.

File details

Details for the file langkit-0.0.35-py3-none-any.whl.

File metadata

  • Download URL: langkit-0.0.35-py3-none-any.whl
  • Upload date:
  • Size: 1.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.12 Linux/6.5.0-1025-azure

File hashes

Hashes for langkit-0.0.35-py3-none-any.whl
Algorithm Hash digest
SHA256 602cb673af70138691cd7709fb451af62a0927f1c396ff2cbbcba86dae886305
MD5 e667daec164a0f41cd5b61a0250e330a
BLAKE2b-256 778fa7b4f01f018e8f0fd0c435672ee35b9fede344baaa64eae0e5ab7a9ffc00

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page