Monitor LLMs with custom metrics to scale with confidence

These details have not been verified by PyPI

Project links

Homepage

Project description

Guardrail ML

plot

Guardrail ML is an open-source toolkit for fine-tuning and deploying powerful, safe, and customized large language models.

Our toolkit accelerates the time-to-production of custom LLMs by transforming unstructured data to .json for fine-tuning and capturing responsible AI metrics of outputs/prompts to mitigate risks and improve performance.

Quickstart

Get started with the below tasks in minutes via a free colab instance:

Evaluate LLM outputs/prompts for Text Quality, Toxicity, Bias, Relevance, Sentiment, Prompt Injection, etc.
Generate JSON Question & Answer dataset from PDF leveraging LLMs
Log evaluation metrics to improve performance and auditing

Installation 💻

To install guardrail-ml, use the Python Package Index (PyPI) as follows:

pip install guardrail-ml

Features

Guardrail ML supports the following metrics and logs them:

Toxicity & Bias
Text Quality
Text Relevance
Privacy
Sentiment

Guardrail ML can transform your data from:

PDFs into .json question & answer pairs
Uses dolly-v2 as default to generate pairs
Leverage your huggingface models to generate pairs

View logs in streamlit dashboard

Locally deployed dashboard to view metrics
Be used for auditing benchmarking experiments

Usage

from guardrail.client import run_metrics
from guardrail.client import run_simple_metrics
from guardrail.client import create_dataset

# Output/Prompt Metrics
run_metrics(output="Guardrail is an open-source toolkit for building domain-specific language models with confidence. From domain-specific dataset creation and custom     evaluations to safeguarding and redteaming aligned with policies, our tools accelerates your LLM workflows to systematically derisk deployment.",
            prompt="What is guardrail-ml?",
            model_uri="dolly-v2-0.01")

# View Logs
con = sqlite3.connect("logs.db")
df = pd.read_sql_query("SELECT * from logs", con)
df.tail(20)

# Generate Dataset from PDF
create_dataset(model="databricks/dolly-v2-2-8b",
               tokenizer="databricks/dolly-v2-2-8b",
               file_path="example-docs/Medicare Appeals Paper FINAL.pdf",
               output_path="./output.json")

More Colab Notebooks

Fine-Tuning Dolly 2.0 with LoRA:

Inferencing Dolly 2.0:

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.0.16

Nov 1, 2023

0.0.15

Oct 31, 2023

0.0.14

Sep 29, 2023

0.0.13

Sep 29, 2023

0.0.12

Jul 14, 2023

0.0.11

Jul 14, 2023

This version

0.0.1a0 pre-release

Jul 14, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

guardrail-ml-0.0.1a0.tar.gz (21.8 kB view hashes)

Uploaded Jul 14, 2023 Source

Built Distribution

guardrail_ml-0.0.1a0-py3-none-any.whl (26.3 kB view hashes)

Uploaded Jul 14, 2023 Python 3

Hashes for guardrail-ml-0.0.1a0.tar.gz

Hashes for guardrail-ml-0.0.1a0.tar.gz
Algorithm	Hash digest
SHA256	`ec47b9480789db50587b12323b764b171399b63b3e81ff6e43d87d2df6fde67b`
MD5	`ddf3a35255266bcd6d0f598e773327a6`
BLAKE2b-256	`54e959fc4614ed883e21bda2ace6e79b0d49596239801560b1acba267019e719`

Hashes for guardrail_ml-0.0.1a0-py3-none-any.whl

Hashes for guardrail_ml-0.0.1a0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6496fe280b077fd441fc11b5c9b3d4e3a479e43ad8d41282b38e58e1d0d7dcdc`
MD5	`c5311baec610cef890e9cd04e54d69f6`
BLAKE2b-256	`97cbc1d58ec0b2a5167ee89f5afa163cd0735704ba5ca95d8b31085780dcf1b9`