LangFair is a Python library for conducting use-case level LLM bias and fairness assessments

These details have not been verified by PyPI

Project links

Project description

LangFair: Use-Case Level LLM Bias and Fairness Assessments

LangFair is a comprehensive Python library designed for conducting bias and fairness assessments of large language model (LLM) use cases. This repository includes various supporting resources, including

Documentation site with complete API reference
Comprehensive framework for choosing bias and fairness metrics
Demo notebooks providing illustrative examples
LangFair tutorial on Medium
Software paper on how LangFair compares to other toolkits
Research paper on our evaluation approach

🚀 Why Choose LangFair?

Static benchmark assessments, which are typically assumed to be sufficiently representative, often fall short in capturing the risks associated with all possible use cases of LLMs. These models are increasingly used in various applications, including recommendation systems, classification, text generation, and summarization. However, evaluating these models without considering use-case-specific prompts can lead to misleading assessments of their performance, especially regarding bias and fairness risks.

LangFair addresses this gap by adopting a Bring Your Own Prompts (BYOP) approach, allowing users to tailor bias and fairness evaluations to their specific use cases. This ensures that the metrics computed reflect the true performance of the LLMs in real-world scenarios, where prompt-specific risks are critical. Additionally, LangFair's focus is on output-based metrics that are practical for governance audits and real-world testing, without needing access to internal model states.

Note: This diagram illustrates the workflow for assessing bias and fairness in text generation and summarization use cases.

⚡ Quickstart Guide

(Optional) Create a virtual environment for using LangFair

We recommend creating a new virtual environment using venv before installing LangFair. To do so, please follow instructions here.

Installing LangFair

The latest version can be installed from PyPI:

pip install langfair

Usage Examples

Below are code samples illustrating how to use LangFair to assess bias and fairness risks in text generation and summarization use cases. The below examples assume the user has already defined a list of prompts from their use case, prompts.

Generate LLM responses

To generate responses, we can use LangFair's ResponseGenerator class. First, we must create a langchain LLM object. Below we use ChatVertexAI, but any of LangChain’s LLM classes may be used instead. Note that InMemoryRateLimiter is to used to avoid rate limit errors.

from langchain_google_vertexai import ChatVertexAI
from langchain_core.rate_limiters import InMemoryRateLimiter
rate_limiter = InMemoryRateLimiter(
    requests_per_second=4.5, check_every_n_seconds=0.5, max_bucket_size=280,  
)
llm = ChatVertexAI(
    model_name="gemini-pro", temperature=0.3, rate_limiter=rate_limiter
)

We can use ResponseGenerator.generate_responses to generate 25 responses for each prompt, as is convention for toxicity evaluation.

from langfair.generator import ResponseGenerator
rg = ResponseGenerator(langchain_llm=llm)
generations = await rg.generate_responses(prompts=prompts, count=25)
responses = generations["data"]["response"]
duplicated_prompts = generations["data"]["prompt"] # so prompts correspond to responses

Compute toxicity metrics

Toxicity metrics can be computed with ToxicityMetrics. Note that use of torch.device is optional and should be used if GPU is available to speed up toxicity computation.

# import torch # uncomment if GPU is available
# device = torch.device("cuda") # uncomment if GPU is available
from langfair.metrics.toxicity import ToxicityMetrics
tm = ToxicityMetrics(
    # device=device, # uncomment if GPU is available,
)
tox_result = tm.evaluate(
    prompts=duplicated_prompts, 
    responses=responses, 
    return_data=True
)
tox_result['metrics']
# # Output is below
# {'Toxic Fraction': 0.0004,
# 'Expected Maximum Toxicity': 0.013845130120171235,
# 'Toxicity Probability': 0.01}

Compute stereotype metrics

Stereotype metrics can be computed with StereotypeMetrics.

from langfair.metrics.stereotype import StereotypeMetrics
sm = StereotypeMetrics()
stereo_result = sm.evaluate(responses=responses, categories=["gender"])
stereo_result['metrics']
# # Output is below
# {'Stereotype Association': 0.3172750176745329,
# 'Cooccurrence Bias': 0.44766333654278373,
# 'Stereotype Fraction - gender': 0.08}

Generate counterfactual responses and compute metrics

We can generate counterfactual responses with CounterfactualGenerator.

from langfair.generator.counterfactual import CounterfactualGenerator
cg = CounterfactualGenerator(langchain_llm=llm)
cf_generations = await cg.generate_responses(
    prompts=prompts, attribute='gender', count=25
)
male_responses = cf_generations['data']['male_response']
female_responses = cf_generations['data']['female_response']

Counterfactual metrics can be easily computed with CounterfactualMetrics.

from langfair.metrics.counterfactual import CounterfactualMetrics
cm = CounterfactualMetrics()
cf_result = cm.evaluate(
    texts1=male_responses, 
    texts2=female_responses,
    attribute='gender'
)
cf_result['metrics']
# # Output is below
# {'Cosine Similarity': 0.8318708,
# 'RougeL Similarity': 0.5195852482361165,
# 'Bleu Similarity': 0.3278433712872481,
# 'Sentiment Bias': 0.0009947145187601957}

Alternative approach: Semi-automated evaluation with `AutoEval`

To streamline assessments for text generation and summarization use cases, the AutoEval class conducts a multi-step process that completes all of the aforementioned steps with two lines of code.

from langfair.auto import AutoEval
auto_object = AutoEval(
    prompts=prompts, 
    langchain_llm=llm,
    # toxicity_device=device # uncomment if GPU is available
)
results = await auto_object.evaluate()
results['metrics']
# # Output is below
# {'Toxicity': {'Toxic Fraction': 0.0004,
#   'Expected Maximum Toxicity': 0.013845130120171235,
#   'Toxicity Probability': 0.01},
#  'Stereotype': {'Stereotype Association': 0.3172750176745329,
#   'Cooccurrence Bias': 0.44766333654278373,
#   'Stereotype Fraction - gender': 0.08,
#   'Expected Maximum Stereotype - gender': 0.60355167388916,
#   'Stereotype Probability - gender': 0.27036},
#  'Counterfactual': {'male-female': {'Cosine Similarity': 0.8318708,
#    'RougeL Similarity': 0.5195852482361165,
#    'Bleu Similarity': 0.3278433712872481,
#    'Sentiment Bias': 0.0009947145187601957}}}

📚 Example Notebooks

Explore the following demo notebooks to see how to use LangFair for various bias and fairness evaluation metrics:

Toxicity Evaluation: A notebook demonstrating toxicity metrics.
Counterfactual Fairness Evaluation: A notebook illustrating how to generate counterfactual datasets and compute counterfactual fairness metrics.
Stereotype Evaluation: A notebook demonstrating stereotype metrics.
AutoEval for Text Generation / Summarization (Toxicity, Stereotypes, Counterfactual): A notebook illustrating how to use LangFair's AutoEval class for a comprehensive fairness assessment of text generation / summarization use cases. This assessment includes toxicity, stereotype, and counterfactual metrics.
Classification Fairness Evaluation: A notebook demonstrating classification fairness metrics.
Recommendation Fairness Evaluation: A notebook demonstrating recommendation fairness metrics.

🛠 Choosing Bias and Fairness Metrics for an LLM Use Case

Selecting the appropriate bias and fairness metrics is essential for accurately assessing the performance of large language models (LLMs) in specific use cases. Instead of attempting to compute all possible metrics, practitioners should focus on a relevant subset that aligns with their specific goals and the context of their application.

Our decision framework for selecting appropriate evaluation metrics is illustrated in the diagram below. For more details, refer to our research paper detailing the evaluation approach.

Note: Fairness through unawareness means none of the prompts for an LLM use case include any mention of protected attribute words.

📊 Supported Bias and Fairness Metrics

Bias and fairness metrics offered by LangFair are grouped into several categories. The full suite of metrics is displayed below.

Toxicity Metrics

Expected Maximum Toxicity (Gehman et al., 2020)
Toxicity Probability (Gehman et al., 2020)
Toxic Fraction (Liang et al., 2023)

Counterfactual Fairness Metrics

Strict Counterfactual Sentiment Parity (Huang et al., 2020)
Weak Counterfactual Sentiment Parity (Bouchard, 2024)
Counterfactual Cosine Similarity Score (Bouchard, 2024)
Counterfactual BLEU (Bouchard, 2024)
Counterfactual ROUGE-L (Bouchard, 2024)

Stereotype Metrics

Stereotypical Associations (Liang et al., 2023)
Co-occurrence Bias Score (Bordia & Bowman, 2019)
Stereotype classifier metrics (Zekun et al., 2023, Bouchard, 2024)

Recommendation (Counterfactual) Fairness Metrics

Jaccard Similarity (Zhang et al., 2023)
Search Result Page Misinformation Score (Zhang et al., 2023)
Pairwise Ranking Accuracy Gap (Zhang et al., 2023)

Classification Fairness Metrics

Predicted Prevalence Rate Disparity (Feldman et al., 2015; Bellamy et al., 2018; Saleiro et al., 2019)
False Negative Rate Disparity (Bellamy et al., 2018; Saleiro et al., 2019)
False Omission Rate Disparity (Bellamy et al., 2018; Saleiro et al., 2019)
False Positive Rate Disparity (Bellamy et al., 2018; Saleiro et al., 2019)
False Discovery Rate Disparity (Bellamy et al., 2018; Saleiro et al., 2019)

📖 Associated Research

A technical description and a practitioner's guide for selecting evaluation metrics is contained in this paper. If you use our evaluation approach, we would appreciate citations to the following paper:

@misc{bouchard2024actionableframeworkassessingbias,
      title={An Actionable Framework for Assessing Bias and Fairness in Large Language Model Use Cases}, 
      author={Dylan Bouchard},
      year={2024},
      eprint={2407.10853},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2407.10853}, 
}

A high-level description of LangFair's functionality is contained in this paper. If you use LangFair, we would appreciate citations to the following paper:

@misc{bouchard2025langfairpythonpackageassessing,
      title={LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases}, 
      author={Dylan Bouchard and Mohit Singh Chauhan and David Skarbrevik and Viren Bajaj and Zeya Ahmad},
      year={2025},
      eprint={2501.03112},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2501.03112}, 
}

📄 Code Documentation

Please refer to our documentation site for more details on how to use LangFair.

🤝 Development Team

The open-source version of LangFair is the culmination of extensive work carried out by a dedicated team of developers. While the internal commit history will not be made public, we believe it's essential to acknowledge the significant contributions of our development team who were instrumental in bringing this project to fruition:

🤗 Contributing

Contributions are welcome. Please refer here for instructions on how to contribute to LangFair.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.8.0

Jan 9, 2026

0.7.1

Sep 11, 2025

0.7.1a3 pre-release

Sep 11, 2025

This version

0.7.1a2 pre-release

Sep 11, 2025

0.7.1a1 pre-release

Sep 11, 2025

0.7.1a0 pre-release

Sep 11, 2025

0.7.0

Sep 11, 2025

0.6.8

Aug 21, 2025

0.6.7

Jul 29, 2025

0.6.6

Jul 16, 2025

0.6.5

Jul 14, 2025

0.6.4

Jun 18, 2025

0.6.3

Jun 11, 2025

0.6.2

Jun 2, 2025

0.6.1

Apr 25, 2025

0.6.0

Apr 24, 2025

0.5.3

Apr 21, 2025

0.5.2

Apr 16, 2025

0.5.1

Mar 10, 2025

0.5.1a0 pre-release

Mar 10, 2025

0.4.0

Feb 12, 2025

0.4.0a0 pre-release

Feb 12, 2025

0.3.2

Jan 15, 2025

0.3.1

Jan 2, 2025

0.3.1a0 pre-release

Jan 2, 2025

0.3.0

Dec 20, 2024

0.3.0a0 pre-release

Dec 20, 2024

0.2.1

Dec 11, 2024

0.2.0

Nov 21, 2024

0.2.0b1 pre-release

Nov 21, 2024

0.2.0b0 pre-release

Nov 21, 2024

0.1.2

Nov 11, 2024

0.1.2b1 pre-release

Nov 8, 2024

0.1.2b0 pre-release

Nov 8, 2024

0.1.2a1 pre-release

Nov 8, 2024

0.1.2a0 pre-release

Nov 8, 2024

0.1.1

Oct 28, 2024

0.1.1b1 pre-release

Nov 8, 2024

0.1.0

Oct 23, 2024

0.1.0a1 pre-release

Oct 23, 2024

0.1.0a0 pre-release

Oct 22, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langfair-0.7.1a2.tar.gz (58.9 kB view details)

Uploaded Sep 11, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langfair-0.7.1a2-py3-none-any.whl (98.3 kB view details)

Uploaded Sep 11, 2025 Python 3

File details

Details for the file langfair-0.7.1a2.tar.gz.

File metadata

Download URL: langfair-0.7.1a2.tar.gz
Upload date: Sep 11, 2025
Size: 58.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.4 CPython/3.10.15 Linux/5.10.0-35-cloud-amd64

File hashes

Hashes for langfair-0.7.1a2.tar.gz
Algorithm	Hash digest
SHA256	`3b276a13ea8f1546e8134214272f262c39fe38576a49709eed66233e9fd2277b`
MD5	`6717d60974a15dfea59465ee1872794f`
BLAKE2b-256	`e9751550a32ab3281be13f4ccf693fc5d5af356301296d4f3d39e9bebf7d2a9e`

See more details on using hashes here.

File details

Details for the file langfair-0.7.1a2-py3-none-any.whl.

File metadata

Download URL: langfair-0.7.1a2-py3-none-any.whl
Upload date: Sep 11, 2025
Size: 98.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.4 CPython/3.10.15 Linux/5.10.0-35-cloud-amd64

File hashes

Hashes for langfair-0.7.1a2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4bf65b889691ba3279574510de2cc85e61704421e0d1c65918816204d70f574c`
MD5	`93bfe911d69fb89f53cc2635dcef26a1`
BLAKE2b-256	`2f497916b7e9671e7534dc1ded992fe2fbdf57d11a0ba2e51f2c3c695bae2b13`

See more details on using hashes here.

langfair 0.7.1a2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LangFair: Use-Case Level LLM Bias and Fairness Assessments

🚀 Why Choose LangFair?

⚡ Quickstart Guide

(Optional) Create a virtual environment for using LangFair

Installing LangFair

Usage Examples

Generate LLM responses

Compute toxicity metrics

Compute stereotype metrics

Generate counterfactual responses and compute metrics

Alternative approach: Semi-automated evaluation with AutoEval

📚 Example Notebooks

🛠 Choosing Bias and Fairness Metrics for an LLM Use Case

📊 Supported Bias and Fairness Metrics

Toxicity Metrics

Counterfactual Fairness Metrics

Stereotype Metrics

Recommendation (Counterfactual) Fairness Metrics

Classification Fairness Metrics

📖 Associated Research

📄 Code Documentation

🤝 Development Team

🤗 Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Alternative approach: Semi-automated evaluation with `AutoEval`