Skip to main content

Adding guardrails to large language models.

Project description

🛤️ Guardrails

Guardrails is an open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs).

Docs

Note: Guardrails is an alpha release, so expect sharp edges and bugs.

Guardrails provides a format (.rail) for enforcing a specification on an LLM output, and a lightweight wrapper around LLM API calls to implement this spec.

  1. rail (reliable AI markup language) files for specifying structure and type information, validators and corrective actions over LLM outputs.
  2. gd.Guard wraps around LLM API calls to structure, validate and correct the outputs.

📦 Installation

pip install guardrails-ai

🛣️ Roadmap

  • adding more examples, new use cases and domains
  • integrations with langchain, gpt-index, minichain, manifest
  • expanding offered validators
  • more transpilers from .rail -> LLM prompt (e.g. .rail -> TypeScript)
  • informative logging
  • improving reasking logic
  • a guardrails.js implementation
  • VSCode extension for .rail files
  • next version of .rail format

🚀 Getting Started

Let's go through an example where we ask an LLM to explain what a "bank run" is, and generate URL links to relevant news articles. We'll generate a .rail spec for this and then use Guardrails to enforce it. You can see more examples in the docs.

📝 Creating a RAIL spec

RAIL (with extension .rail) is a flavor of XML (stands for Reliable AI Markup Language) that describes the expected structure and types of the LLM output, the quality criteria for the output to be considered valid, and corrective actions to be taken if the output is invalid.

  • Create a RAIL spec that requests the LLM to generate an object with two fields: explanation and follow_up_url.
  • For the explanation field, the max length of the generated string should be 280 characters. If the explanation is not of valid length, reask the LLM.
  • For the follow_up_url field, the URL should be reachable. If the URL is not reachable, we will filter it out of the response.
<rail version="0.1">
<output>
    <object name="bank_run" format="length: 2">
        <string
            name="explanation"
            description="A paragraph about what a bank run is."
            format="length: 200 280"
            on-fail-length="reask"
        />
        <url
            name="follow_up_url"
            description="A web URL where I can read more about bank runs."
            required="true"
            format="valid-url"
            on-fail-valid-url="filter"
        />
    </object>
</output>

<prompt>
Explain what a bank run is in a tweet.

@xml_prefix_prompt

{output_schema}

@json_suffix_prompt_v2_wo_none
</prompt>
</rail>
"""

We specify our quality criteria (generated length, URL reachability) in the format fields of the RAIL spec below. We reask if explanation is not valid, and filter the follow_up_url if it is not valid.

🛠️ Using Guardrails to enforce the RAIL spec

Next, we'll use the RAIL spec to create a Guard object. The Guard object will wrap the LLM API call and enforce the RAIL spec on its output.

import guardrails as gd

guard = gd.Guard.from_rail(f.name)

The Guard object compiles the RAIL specification and adds it to the prompt. (Right now this is a passthrough operation, more compilers are planned to find the best way to express the spec in a prompt.)

print(guard.base_prompt)
Explain what a bank run is in a tweet.

Given below is XML that describes the information to extract from this document and the tags to extract it into.

<output>
    <object name="bank_run" format="length: 2">
        <string name="explanation" description="A paragraph about what a bank run is." format="length: 200 280" on-fail-length="reask" />
        <url name="follow_up_url" description="A web URL where I can read more about bank runs." required="true" format="valid-url" on-fail-valid-url="filter" />
    </object>
</output>

ONLY return a valid JSON object (no other text is necessary). The JSON MUST conform to the XML format, including any types and format requests e.g. requests for lists, objects and specific types. Be correct and concise.

JSON Output:

Call the Guard object with the LLM API call as the first argument and add any additional arguments to the LLM API call as the remaining arguments.

import openai

# Wrap the OpenAI API call with the `guard` object
raw_llm_output, validated_output = guard(openai.Completion.create, engine="text-davinci-003", max_tokens=1024, temperature=0.3)

print(validated_output)
{
    'bank_run': {
        'explanation': 'A bank run is when a large number of people withdraw their deposits from a bank due to concerns about its solvency. This can cause a financial crisis if the bank is unable to meet the demand for withdrawals.',
        'follow_up_url': 'https://www.investopedia.com/terms/b/bankrun.asp'
    }
}

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

guardrails-ai-0.1.0rc2.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

guardrails_ai-0.1.0rc2-py2.py3-none-any.whl (23.7 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file guardrails-ai-0.1.0rc2.tar.gz.

File metadata

  • Download URL: guardrails-ai-0.1.0rc2.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for guardrails-ai-0.1.0rc2.tar.gz
Algorithm Hash digest
SHA256 8a90b7af91056ebe381c140d614debe48bdf3cb9c4033d6d0bbd73d01295c790
MD5 fee23d8c384cbf223cafd439b4438c25
BLAKE2b-256 17dd3767da868dbe6cd6d7e3c245dff96695bff704ff9f62bb19428a71546bfc

See more details on using hashes here.

File details

Details for the file guardrails_ai-0.1.0rc2-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for guardrails_ai-0.1.0rc2-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 179526fdf1890c60c57f11e8d7edd15b70e0451e8ea985f99c6ac3c122d7f2c7
MD5 85c7abeca0fa3c31f1802e119d506d99
BLAKE2b-256 4a79a4782456dc7397da19198ee3471ae23dc05b9a693f15b7cf2ef03a0bd234

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page