Library with langchain instrumentation to evaluate LLM based applications.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Welcome to TruLens-Eval!

TruLens

Evaluate and track your LLM experiments with TruLens. As you work on your models and prompts TruLens-Eval supports the iterative development and of a wide range of LLM applications by wrapping your application to log key metadata across the entire chain (or off chain if your project does not use chains) on your local machine.

Using feedback functions, you can objectively evaluate the quality of the responses provided by an LLM to your requests. This is completed with minimal latency, as this is achieved in a sequential call for your application, and evaluations are logged to your local machine. Finally, we provide an easy to use Streamlit dashboard run locally on your machine for you to better understand your LLM’s performance.

Value Propositions

TruLens-Eval has two key value propositions:

Evaluation:
- TruLens supports the the evaluation of inputs, outputs and internals of your LLM application using any model (including LLMs).
- A number of feedback functions for evaluation are implemented out-of-the-box such as groundedness, relevance and toxicity. The framework is also easily extensible for custom evaluation requirements.
Tracking:
- TruLens contains instrumentation for any LLM application including question answering, retrieval-augmented generation, agent-based applications and more. This instrumentation allows for the tracking of a wide variety of usage metrics and metadata. Read more in the instrumentation overview.
- TruLens' instrumentation can be applied to any LLM application without being tied down to a given framework. Additionally, deep integrations with LangChain and Llama-Index allow the capture of internal metadata and text.
- Anything that is tracked by the instrumentation can be evaluated!

The process for building your evaluated and tracked LLM application with TruLens is below 👇

Architecture Diagram

Installation and Setup

Install the trulens-eval pip package from PyPI.

    pip install trulens-eval

Setting Keys

In any of the quickstarts, you will need OpenAI and Huggingface keys. You can add keys by setting the environmental variables:

import os
os.environ["OPENAI_API_KEY"] = "..."
os.environ["HUGGINGFACE_API_KEY"] = "..."

Quick Usage

TruLens supports the evaluation of tracking for any LLM app framework. Choose a framework below to get started:

Langchain

langchain_quickstart.ipynb.

langchain_quickstart.py.

Llama-Index

llama_index_quickstart.ipynb.

llama_index_quickstart.py

No Framework

no_framework_quickstart.ipynb.

no_framework_quickstart.py

💡 Contributing

Interested in contributing? See our contribution guide for more details.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.5.3

Jul 1, 2025

1.5.2

Jun 18, 2025

1.5.1

Jun 5, 2025

1.5.0

Jun 2, 2025

1.4.9

Apr 10, 2025

1.4.8

Apr 3, 2025

1.4.7

Mar 20, 2025

1.4.6

Mar 13, 2025

1.4.5

Mar 6, 2025

1.4.4

Feb 27, 2025

1.4.3

Feb 26, 2025

1.4.2

Feb 20, 2025

1.4.1

Feb 20, 2025

1.4.1a1 pre-release

Feb 18, 2025

1.4.1a0 pre-release

Feb 18, 2025

1.4.0

Feb 13, 2025

1.4.0a0 pre-release

Feb 12, 2025

1.3.5

Feb 6, 2025

1.3.4

Feb 6, 2025

1.3.3

Jan 27, 2025

1.3.3a0 pre-release

Jan 24, 2025

1.3.2

Jan 16, 2025

1.3.1

Jan 16, 2025

1.3.0

Jan 10, 2025

1.2.11

Dec 16, 2024

1.2.10

Dec 5, 2024

1.2.9

Nov 27, 2024

1.2.8

Nov 19, 2024

1.2.7

Nov 14, 2024

1.2.6

Nov 6, 2024

1.2.5

Nov 6, 2024

1.2.4

Oct 31, 2024

1.2.3

Oct 31, 2024

1.2.2

Oct 30, 2024

1.2.1

Oct 29, 2024

1.2.0

Oct 28, 2024

1.1.0

Oct 10, 2024

1.1.0a4 pre-release

Oct 8, 2024

1.1.0a3 pre-release

Oct 8, 2024

1.1.0a2 pre-release

Oct 8, 2024

1.1.0a1 pre-release

Oct 3, 2024

1.1.0a0 pre-release

Sep 21, 2024

1.0.11

Oct 9, 2024

1.0.10

Oct 7, 2024

1.0.9

Oct 7, 2024

1.0.8

Oct 4, 2024

1.0.7

Oct 2, 2024

1.0.6

Sep 26, 2024

1.0.5

Sep 26, 2024

1.0.4

Sep 25, 2024

1.0.3

Sep 21, 2024

1.0.2

Sep 18, 2024

1.0.1

Aug 30, 2024

1.0.1a6 pre-release

Aug 30, 2024

1.0.1a5 pre-release

Aug 29, 2024

1.0.1a4 pre-release

Aug 28, 2024

1.0.1a1 pre-release

Aug 28, 2024

0.33.1

Aug 28, 2024

0.33.0

Jul 16, 2024

0.32.1

Aug 28, 2024

0.32.0

Jun 24, 2024

0.31.1

Aug 28, 2024

0.31.0

Jun 10, 2024

0.30.1

May 25, 2024

0.30.0

May 25, 2024

0.29.0

May 16, 2024

0.28.2

Apr 24, 2024

0.28.1

Apr 22, 2024

0.28.0

Apr 17, 2024

0.27.2

Apr 4, 2024

0.27.1

Apr 4, 2024

0.27.0

Mar 23, 2024

0.26.0

Mar 15, 2024

0.25.1

Mar 8, 2024

0.25.0

Mar 7, 2024

0.24.1

Feb 23, 2024

0.24.0

Feb 23, 2024

0.23.0

Feb 16, 2024

0.22.2

Feb 13, 2024

0.22.1

Feb 9, 2024

0.22.0

Feb 3, 2024

0.21.0

Jan 26, 2024

0.20.3

Jan 10, 2024

0.20.2

Jan 9, 2024

0.20.1

Jan 5, 2024

0.20.0

Dec 23, 2023

0.19.2

Dec 18, 2023

0.19.1

Dec 15, 2023

0.19.0

Dec 15, 2023

0.18.3

Dec 7, 2023

0.18.2

Dec 1, 2023

0.18.1

Nov 23, 2023

0.18.0

Nov 16, 2023

0.17.0

Nov 2, 2023

0.17.0b0 pre-release

Nov 2, 2023

0.17.0a0 pre-release

Nov 2, 2023

0.16.0

Oct 20, 2023

0.15.3

Oct 11, 2023

0.15.1 yanked

Oct 6, 2023

Reason this release was yanked:

Unstable release

0.15.0 yanked

Oct 6, 2023

Reason this release was yanked:

Unstable release

0.14.0

Sep 28, 2023

0.14.0b0 pre-release

Sep 28, 2023

0.14.0a0 pre-release

Sep 28, 2023

0.13.0

Sep 22, 2023

0.13.0a0 pre-release

Sep 22, 2023

0.12.0

Sep 7, 2023

0.12.0a0 pre-release

Sep 7, 2023

This version

0.11.0

Aug 31, 2023

0.11.0b0 pre-release

Aug 31, 2023

0.11.0a0 pre-release

Aug 31, 2023

0.10.0

Aug 18, 2023

0.9.0

Aug 10, 2023

0.9.0a0 pre-release

Aug 10, 2023

0.8.0

Aug 3, 2023

0.8.0a0 pre-release

Aug 3, 2023

0.7.0

Jul 27, 2023

0.7.0a0 pre-release

Jul 27, 2023

0.6.0

Jul 21, 2023

0.6.0a0 pre-release

Jul 21, 2023

0.5.0

Jul 12, 2023

0.5.0a0 pre-release

Jul 12, 2023

0.4.1b0 pre-release

Jun 30, 2023

0.4.1a0 pre-release

Jun 30, 2023

0.4.0

Jun 29, 2023

0.4.0a0 pre-release

Jun 29, 2023

0.3.0

Jun 23, 2023

0.3.0rc0 pre-release

Jun 23, 2023

0.3.0b0 pre-release

Jun 22, 2023

0.3.0a0 pre-release

Jun 22, 2023

0.2.2

Jun 15, 2023

0.2.2b0 pre-release

Jun 15, 2023

0.2.2a0 pre-release

Jun 15, 2023

0.2.1

Jun 14, 2023

0.2.1a0 pre-release

Jun 14, 2023

0.2.0

Jun 14, 2023

0.2.0a0 pre-release

Jun 14, 2023

0.1.2

Jun 7, 2023

0.1.2a0 pre-release

Jun 2, 2023

0.1.1

May 24, 2023

0.1.1a0 pre-release

May 24, 2023

0.0.1

May 23, 2023

0.0.1a0 pre-release

May 24, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

trulens_eval-0.11.0-py3-none-any.whl (302.5 kB view details)

Uploaded Aug 31, 2023 Python 3

File details

Details for the file trulens_eval-0.11.0-py3-none-any.whl.

File metadata

Download URL: trulens_eval-0.11.0-py3-none-any.whl
Upload date: Aug 31, 2023
Size: 302.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for trulens_eval-0.11.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`58697d614640b26fa51d8e317542efc8777d7e1cec5a2a175e8e2b43eb486bd8`
MD5	`a7f38493fe970f56666397846b6f9877`
BLAKE2b-256	`84868df74dd925ef090cabd812c69ff08f7b52a515cded10ddd317185f42f78d`

See more details on using hashes here.

trulens-eval 0.11.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Welcome to TruLens-Eval!

Value Propositions

Installation and Setup

Setting Keys

Quick Usage

💡 Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes