LangCore guardrail provider — validates, retries, and self-corrects LLM output against schemas

Project description

LangCore Guardrails

Provider plugin for LangCore — output validation, automatic retry with corrective prompts, and a rich set of pluggable validators.

Overview

langcore-guardrails is a provider plugin for LangCore that validates LLM output against schemas, confidence thresholds, regex patterns, and custom business rules — automatically retrying with corrective prompts when validation fails. It wraps any BaseLanguageModel and acts as a self-correcting layer between the LLM and your application.

Features

Corrective retry loop — when validation fails, constructs a corrective prompt with the original request, invalid output, and error details, then re-prompts the LLM (up to max_retries)
7 built-in validators:
- JsonSchemaValidator — validates output against a JSON Schema with auto-repair and markdown fence stripping
- RegexValidator — matches output against regular expression patterns
- SchemaValidator — validates against Pydantic BaseModel classes (strict or lenient mode)
- ConfidenceThresholdValidator — rejects extractions below a confidence score threshold
- FieldCompletenessValidator — ensures required fields are present and non-empty
- ConsistencyValidator — cross-checks extracted values using custom business rules
- GroundingValidator — rejects hallucinated or poorly-grounded extractions using LangCore's alignment engine
4 on-fail actions — EXCEPTION (raise immediately), REASK (re-prompt LLM), FILTER (silently discard), NOOP (log and continue)
Validator chaining — compose multiple validators with per-validator failure actions via ValidatorChain
Validator registry — register custom validators by name with @register_validator for discovery and reuse
Error-only correction mode — omit invalid output from retry prompts to save tokens on large payloads
Correction prompt truncation — bound prompt and output length in corrective prompts
Custom correction templates — fully customizable retry prompt format
Markdown fence stripping — automatic via json-repair for LLMs that wrap JSON in code fences
Batch-independent retries — each prompt in a batch retries independently
Async concurrency control — max_concurrency semaphore for async batches
Zero-config plugin — auto-registered via Python entry points

Installation

pip install langcore-guardrails

Quick Start

Integration with LangCore

langcore-guardrails integrates with LangCore through the decorator provider pattern. Wrap any LangCore model to add output validation:

import langcore as lx
from pydantic import BaseModel, Field
from langcore_guardrails import (
    GuardrailLanguageModel,
    SchemaValidator,
    ConfidenceThresholdValidator,
    OnFailAction,
)

# Define expected output schema
class ContractEntity(BaseModel):
    party_name: str = Field(description="Name of the contracting party")
    role: str = Field(description="Role in the contract (buyer, seller, etc.)")
    effective_date: str = Field(description="Contract effective date (YYYY-MM-DD)")

# Create the base LLM provider
inner_model = lx.factory.create_model(
    lx.factory.ModelConfig(model_id="litellm/gpt-4o", provider="LiteLLMLanguageModel")
)

# Wrap with guardrails
guard_model = GuardrailLanguageModel(
    model_id="guardrails/gpt-4o",
    inner=inner_model,
    validators=[
        SchemaValidator(ContractEntity, on_fail=OnFailAction.REASK),
        ConfidenceThresholdValidator(min_confidence=0.7, on_fail=OnFailAction.FILTER),
    ],
    max_retries=3,
)

# Use as a drop-in replacement — validation and retries happen automatically
result = lx.extract(
    text_or_documents="This Agreement is entered into by Acme Corp (Seller) and Beta LLC (Buyer), effective January 15, 2025.",
    model=guard_model,
    prompt_description="Extract contracting parties with roles and dates.",
    examples=[
        lx.data.ExampleData(
            text="Agreement between Alpha Inc (Lessor) and Omega Ltd (Lessee), effective March 1, 2024.",
            extractions=[
                lx.data.Extraction("party", "Alpha Inc", attributes={"role": "Lessor", "effective_date": "2024-03-01"}),
                lx.data.Extraction("party", "Omega Ltd", attributes={"role": "Lessee", "effective_date": "2024-03-01"}),
            ],
        )
    ],
)

Built-in Validators

JsonSchemaValidator

Validates LLM output against a JSON Schema. Automatically strips markdown fences and repairs common JSON issues:

from langcore_guardrails import JsonSchemaValidator

schema = {
    "type": "object",
    "properties": {
        "parties": {"type": "array", "items": {"type": "string"}},
        "effective_date": {"type": "string"},
    },
    "required": ["parties", "effective_date"],
}

validator = JsonSchemaValidator(schema=schema, strict=True)

RegexValidator

Validates that output matches a regular expression pattern:

from langcore_guardrails import RegexValidator

validator = RegexValidator(r'\d{4}-\d{2}-\d{2}', description="ISO date format")

SchemaValidator

Validates extraction output against a Pydantic BaseModel. Supports strict mode (no type coercion) and lenient mode:

from pydantic import BaseModel, Field
from langcore_guardrails import SchemaValidator, OnFailAction

class Invoice(BaseModel):
    invoice_number: str = Field(description="Invoice ID")
    amount: float = Field(description="Total amount in USD")
    due_date: str = Field(description="Due date (YYYY-MM-DD)")

validator = SchemaValidator(
    Invoice,
    on_fail=OnFailAction.REASK,  # Re-prompt LLM on failure
    strict=False,                 # Allow type coercion (e.g., "100" → 100.0)
)

ConfidenceThresholdValidator

Rejects extractions whose confidence_score falls below a threshold. Works with LangCore's built-in multi-pass confidence scoring:

from langcore_guardrails import ConfidenceThresholdValidator, OnFailAction

validator = ConfidenceThresholdValidator(
    min_confidence=0.7,
    score_key="confidence_score",
    on_fail=OnFailAction.FILTER,  # Silently discard low-confidence results
)

FieldCompletenessValidator

Ensures all required Pydantic fields are present and non-empty (rejects empty strings, empty lists, and None):

from langcore_guardrails import FieldCompletenessValidator

validator = FieldCompletenessValidator(Invoice, on_fail=OnFailAction.REASK)

ConsistencyValidator

Cross-checks extracted values using user-supplied business rules. Each rule returns None on success or an error string on failure:

from langcore_guardrails import ConsistencyValidator

def dates_ordered(data: dict) -> str | None:
    start = data.get("start_date", "")
    end = data.get("end_date", "")
    if start and end and start > end:
        return "start_date must be before end_date"
    return None

def positive_amount(data: dict) -> str | None:
    if data.get("amount", 0) < 0:
        return "amount must be non-negative"
    return None

validator = ConsistencyValidator(
    rules=[dates_ordered, positive_amount],
    on_fail=OnFailAction.REASK,
)

GroundingValidator

Rejects extractions that are not grounded in the source text. Works post-alignment — uses LangCore's alignment engine signals (alignment_status and char_interval) to detect hallucinated or weakly-grounded extractions:

Unaligned — no alignment_status (hallucinated text not found in source)
Below minimum alignment quality — e.g. only fuzzy-matched when MATCH_LESSER or better is required
Below minimum character coverage — the overlapping span in the source text is too short relative to the extraction

from langcore_guardrails import GroundingValidator, OnFailAction

validator = GroundingValidator(
    min_alignment_quality="MATCH_FUZZY",  # Accepts all aligned, rejects unaligned
    min_coverage=0.5,                      # At least 50% character overlap
    on_fail=OnFailAction.FILTER,           # Silently remove hallucinated extractions
)

# Use after alignment (post-extraction)
passed, filtered = validator.validate_extractions(
    extractions=result.extractions,
    source_text=result.text,
)

# Or convenience method for AnnotatedDocument
passed, filtered = validator.validate_document(result)

Alignment quality levels (best → worst):

Level	Meaning
`MATCH_EXACT`	Extraction text found verbatim in source
`MATCH_GREATER`	Source span is larger than extraction text
`MATCH_LESSER`	Source span is smaller than extraction text
`MATCH_FUZZY`	Approximate match only

Note: GroundingValidator is a post-alignment validator. The validate() method (raw LLM output) always passes. Use validate_extractions() or validate_document() after LangCore's alignment step for meaningful grounding checks.

On-Fail Actions

The OnFailAction enum controls what happens when a validator fails:

Action	Behaviour
`EXCEPTION`	Raise a `GuardrailValidationError` immediately
`REASK`	Re-prompt the LLM with the validation error as feedback
`FILTER`	Silently discard the failing output
`NOOP`	Log the failure but return the output as-is

Validator Chaining

Compose multiple validators with per-validator failure actions using ValidatorChain:

from langcore_guardrails import (
    ValidatorChain,
    ValidatorEntry,
    SchemaValidator,
    ConfidenceThresholdValidator,
    FieldCompletenessValidator,
    OnFailAction,
)

chain = ValidatorChain([
    ValidatorEntry(SchemaValidator(Invoice), OnFailAction.REASK),
    ValidatorEntry(FieldCompletenessValidator(Invoice), OnFailAction.REASK),
    ValidatorEntry(
        ConfidenceThresholdValidator(min_confidence=0.7),
        OnFailAction.FILTER,
    ),
])

result = chain.run(llm_output)
if result.should_reask:
    print("Re-prompting LLM:", result.error_messages)
elif result.should_filter:
    print("Filtering low-confidence extractions")

Custom Validators

Implementing a Validator

Create custom validators by implementing the GuardrailValidator interface:

from langcore_guardrails import GuardrailValidator, ValidationResult

class MaxLengthValidator(GuardrailValidator):
    def __init__(self, max_chars: int) -> None:
        self._max = max_chars

    def validate(self, output: str) -> ValidationResult:
        if len(output) <= self._max:
            return ValidationResult(valid=True)
        return ValidationResult(
            valid=False,
            error_message=f"Output exceeds {self._max} characters ({len(output)} chars)",
        )

Registering for Discovery

from langcore_guardrails import register_validator, get_validator

@register_validator(name="max_length")
class MaxLengthValidator(GuardrailValidator):
    def __init__(self, max_chars: int = 5000) -> None:
        self._max = max_chars

    def validate(self, output: str) -> ValidationResult:
        if len(output) <= self._max:
            return ValidationResult(valid=True)
        return ValidationResult(
            valid=False,
            error_message=f"Output exceeds {self._max} chars ({len(output)})",
        )

# Retrieve by name anywhere in your codebase
cls = get_validator("max_length")
validator = cls(max_chars=10000)

Advanced Configuration

Error-Only Correction Mode

Omit the invalid output from the correction prompt to save tokens on large payloads:

guard_model = GuardrailLanguageModel(
    model_id="guardrails/gpt-4o",
    inner=inner_model,
    validators=[SchemaValidator(Invoice)],
    include_output_in_correction=False,
    max_retries=3,
)

Correction Prompt Truncation

Limit the size of corrective prompts to control token costs:

guard_model = GuardrailLanguageModel(
    model_id="guardrails/gpt-4o",
    inner=inner_model,
    validators=[JsonSchemaValidator()],
    max_correction_prompt_length=2000,
    max_correction_output_length=1000,
)

Custom Correction Template

Provide your own template for corrective retry prompts:

template = (
    "The output was invalid.\n"
    "Original request: {original_prompt}\n"
    "Your response: {invalid_output}\n"
    "Error: {error_message}\n"
    "Please fix the output."
)

guard_model = GuardrailLanguageModel(
    model_id="guardrails/gpt-4o",
    inner=inner_model,
    validators=[JsonSchemaValidator()],
    correction_template=template,
)

Async Usage

Each prompt in a batch retries independently:

results = await guard_model.async_infer(["prompt1", "prompt2"])

How It Works

The original prompt is sent to the inner provider
The response is validated against all configured validators (in order)
If validation passes, the result is returned unchanged
If validation fails:
- A corrective prompt is constructed with the original prompt, invalid output, and error message
- The corrective prompt is sent to the inner provider
- Steps 2–4 repeat up to max_retries times
If all retries are exhausted, the last result is returned with score=0.0

Composing with Other Plugins

langcore-guardrails composes naturally with other LangCore decorator providers:

import langcore as lx
from langcore_audit import AuditLanguageModel, JsonFileSink
from langcore_guardrails import GuardrailLanguageModel, SchemaValidator, OnFailAction
from langcore_hybrid import HybridLanguageModel, RegexRule, RuleConfig

# Base provider
llm = lx.factory.create_model(
    lx.factory.ModelConfig(model_id="litellm/gpt-4o", provider="LiteLLMLanguageModel")
)

# Layer 1: Rule-based extraction for known patterns
hybrid = HybridLanguageModel(
    model_id="hybrid/gpt-4o", inner=llm,
    rule_config=RuleConfig(rules=[RegexRule(r"Date:\s*(?P<date>\d{4}-\d{2}-\d{2})")]),
)

# Layer 2: Output validation
guarded = GuardrailLanguageModel(
    model_id="guardrails/gpt-4o", inner=hybrid,
    validators=[SchemaValidator(MySchema, on_fail=OnFailAction.REASK)],
    max_retries=3,
)

# Layer 3: Audit logging
audited = AuditLanguageModel(
    model_id="audit/gpt-4o", inner=guarded,
    sinks=[JsonFileSink("./audit.jsonl")],
)

result = lx.extract(text_or_documents="...", model=audited, ...)

Development

pip install -e ".[dev]"
pytest

Requirements

Python ≥ 3.12
langcore
jsonschema ≥ 4.26.0
json-repair ≥ 0.58.0
pydantic ≥ 2.12.5

License

Apache License 2.0 — see LICENSE for details.

Project details

Release history Release notifications | RSS feed

This version

1.3.1

Feb 24, 2026

1.3.0

Feb 24, 2026

1.2.1

Feb 23, 2026

1.2.0

Feb 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

langcore_guardrails-1.3.1.tar.gz (37.6 kB view details)

Uploaded Feb 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

langcore_guardrails-1.3.1-py3-none-any.whl (26.6 kB view details)

Uploaded Feb 24, 2026 Python 3

File details

Details for the file langcore_guardrails-1.3.1.tar.gz.

File metadata

Download URL: langcore_guardrails-1.3.1.tar.gz
Upload date: Feb 24, 2026
Size: 37.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for langcore_guardrails-1.3.1.tar.gz
Algorithm	Hash digest
SHA256	`73edd7f092c6fd82191d89b78d474a0322782b9cd956ff5e1ae42420384c7706`
MD5	`317dbcace122d1c594e4b22939d51052`
BLAKE2b-256	`82bda17c4d62366fddda3ad5dace82fb64a3e2be3d8ae9dc19222f69bbddc95f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langcore_guardrails-1.3.1.tar.gz:

Publisher: release.yml on IgnatG/langcore-guardrails

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langcore_guardrails-1.3.1.tar.gz
- Subject digest: 73edd7f092c6fd82191d89b78d474a0322782b9cd956ff5e1ae42420384c7706
- Sigstore transparency entry: 987120459
- Sigstore integration time: Feb 24, 2026
Source repository:
- Permalink: IgnatG/langcore-guardrails@e08b9bd219258a37ab2d5b918568f7d8251bafbf
- Branch / Tag: refs/heads/main
- Owner: https://github.com/IgnatG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@e08b9bd219258a37ab2d5b918568f7d8251bafbf
- Trigger Event: push

File details

Details for the file langcore_guardrails-1.3.1-py3-none-any.whl.

File metadata

Download URL: langcore_guardrails-1.3.1-py3-none-any.whl
Upload date: Feb 24, 2026
Size: 26.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for langcore_guardrails-1.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8b29a1d61c27d532dfee9db59fc82259b13ad4090744d823358b4fc5ab0e1184`
MD5	`e86971e38445be421060dda6f44be8f8`
BLAKE2b-256	`8a5b8ad9f31f38ff514d1e472cf24891722425d90f584816016ed1137552ab8d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for langcore_guardrails-1.3.1-py3-none-any.whl:

Publisher: release.yml on IgnatG/langcore-guardrails

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: langcore_guardrails-1.3.1-py3-none-any.whl
- Subject digest: 8b29a1d61c27d532dfee9db59fc82259b13ad4090744d823358b4fc5ab0e1184
- Sigstore transparency entry: 987120506
- Sigstore integration time: Feb 24, 2026
Source repository:
- Permalink: IgnatG/langcore-guardrails@e08b9bd219258a37ab2d5b918568f7d8251bafbf
- Branch / Tag: refs/heads/main
- Owner: https://github.com/IgnatG
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@e08b9bd219258a37ab2d5b918568f7d8251bafbf
- Trigger Event: push

langcore-guardrails 1.3.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LangCore Guardrails

Overview

Features

Installation

Quick Start

Integration with LangCore

Built-in Validators

JsonSchemaValidator

RegexValidator

SchemaValidator

ConfidenceThresholdValidator

FieldCompletenessValidator

ConsistencyValidator

GroundingValidator

On-Fail Actions

Validator Chaining

Custom Validators

Implementing a Validator

Registering for Discovery

Advanced Configuration

Error-Only Correction Mode

Correction Prompt Truncation

Custom Correction Template

Async Usage

How It Works

Composing with Other Plugins

Development

Requirements

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance