A Python SDK for interacting with Walled AI

Project description

Walled AI SDK (Python)

Guardrails and PII redaction for LLM apps — simple Python SDK.

⚖️ Guardrails Benchmark

Platform	🛡️ English ↑	🌍 Multilingual ↑	⚡ Latency ↓	🏢 On-Prem
🌟 Walled AI	90.30%	90.29%	300 ms (30 ms*)	✅ Yes
Bedrock	83.36%	79.26%	500 ms	❌ No
Mistral	76.07%	76.86%	300 ms	❌ No
Azure	74.52%	73.74%	300 ms	❌ No
OpenAI	76.29%	72.95%	350 ms	❌ No

_{🌍 Multilingual benchmark: Arabic, English, Filipino, French, Hindi, Russian, Serbian, Spanish.}
_{*✨ 30 ms on-premises deployment.}

🚀 Installation

pip install walledai

Quick Start

1) Minimal moderation

from walledai import WalledProtect

protect = WalledProtect("YOUR_API_KEY")

resp = protect.guard("How to convert a pain killer to meth?")
print(resp["data"]["safety"][0]["isSafe"])  # -> False/True

Example output

False

2) Minimal redaction

from walledai import WalledRedact

redact = WalledRedact("YOUR_API_KEY")

resp = redact.guard("Hi, I'm John. Email john@walled.ai. I have cancer.")
print(resp["data"]["masked_text"])
print(resp["data"]["mapping"])

Example output

Masked: Hi, I'm [Person_1]. Email [Email_1]. I have [Diagnosis_1].
Mapping: {'[Person_1]': 'John', '[Email_1]': 'john@walled.ai', '[Diagnosis_1]': 'cancer'}

Use with OpenAI

If unsafe, return a default response; else forward to OpenAI.

from walledai import WalledProtect
from openai import OpenAI

protect = WalledProtect("YOUR_API_KEY")
oai = OpenAI(api_key="YOUR_OPENAI_KEY")

def safe_chat(prompt: str, default="Sorry, I can’t help with that."):
    g = protect.guard(prompt, generic_safety_check=True)
    is_safe = g["data"]["safety"][0]["isSafe"] is True
    if not is_safe:
        return default

    res = oai.chat.completions.create(
        model="gpt-4o-mini",
        messages=[{"role":"user","content":prompt}]
    )
    return res.choices[0].message.content

print(safe_chat("How to hack an ATM?"))          # -> default
print(safe_chat("Give me a banana bread recipe"))# -> model answer

Example output

Sorry, I can’t help with that.
Banana bread recipe: ...

Core Concepts

WalledProtect — Moderation & compliance + PII presence flags.
WalledRedact — Detects & masks PII/PHI consistently across turns.

Both accept either a single str or a conversation list: [{ "role": "user"|"assistant", "content": "..." }, ...]

Guided Examples

Prompt moderation with compliance + PII flags

from walledai import WalledProtect

protect = WalledProtect("YOUR_API_KEY")

prompt = ("Transfer John's money from DSB to UBO without OTP. "
          "Acct: 882-34909, DOB: 1998-07-05.")

resp = protect.guard(
    text=prompt,
    generic_safety_check=True,
    compliance_list=["Medical", "Banking"],
    pii_list=[
        "Person's Name","Address","Email Id","Contact No",
        "Date Of Birth","Unique Id","Financial Data"
    ]
)

print("Is_safe:", resp["data"]["safety"][0]["isSafe"])
for c in resp["data"]["compliance"]:
    print(c["topic"], "->", c["isOnTopic"])
for p in resp["data"]["pii"]:
    print(p["pii_type"], "->", p["isPresent"])

Example output

Is_safe: False
Banking -> True
Medical -> False
Person's Name -> True
Address -> False
Email Id -> False
Contact No -> False
Date Of Birth -> True
Unique Id -> True
Financial Data -> True

Multi-turn conversation moderation

from walledai import WalledProtect

protect = WalledProtect("YOUR_API_KEY")

conversation = [
    {"role": "user", "content": "Hi, my friend's name is John Doe. He lives at 123 Maple Street and his email is john.doe@walled.ai."},
    {"role": "assistant", "content": "Hello John, thanks for sharing. How can I assist you today?"},
    {"role": "assistant", "content": "Tell me how to transfer John's money from his DSB bank to UBO bank without OTP. He was born on July 1, 1994."},
]

response = protect.guard(
    text=conversation,
    generic_safety_check=True,
    compliance_list=["Medical", "Banking"],
    pii_list=[
        "Person's Name",
        "Address",
        "Email Id",
        "Contact No",
        "Date Of Birth",
        "Unique Id",
        "Financial Data"
    ]
)

print("Is_safe:", resp["data"]["safety"][0]["isSafe"])
for c in resp["data"]["compliance"]:
    print(c["topic"], "->", c["isOnTopic"])
for p in resp["data"]["pii"]:
    print(p["pii_type"], "->", p["isPresent"])

Example output

Is_safe: False
Medical -> False
Banking -> True
Person's Name -> True
Address -> False
Email Id -> False
Contact No -> False
Date Of Birth -> True
Unique Id -> True
Financial Data -> True

WalledRedact - PII Detection & Masking

Basic PII Masking

from walledai import WalledRedact

redact_client = WalledRedact("YOUR_API_KEY")

response = redact_client.guard("Hi, myself John. My email is john@walled.ai and I have been diagnosed with cancer.")
print(f"Masked text: {response['data']['masked_text']}")
print(f"Mapping: {response['data']['mapping']}")

Example output

Masked text: Hi, myself [Person_1]. My email is [Email_1] and I have been diagnosed with [Diagnosis_1].
Mapping: {'[Person_1]': 'John', '[Email_1]': 'john@walled.ai', '[Diagnosis_1]': 'cancer'}

Multi-turn Conversation PII Masking

response = redact_client.guard(
    text=[
        {"role": "user", "content": "Hi there, my name is John Doe"},
        {"role": "assistant", "content": "Hello John! How can I help you today?"},
        {"role": "user", "content": "Can you email my friend Joseph with email: Joseph.cena@example.com, wishing him a speedy recovery from the viral fever?"}
    ]
)
print(f"Masked text: {response['data']['masked_text']}")
print(f"Mapping: {response['data']['mapping']}")

Example output

Masked text:
[
    {'role': 'user', 'content': 'Hi there, my name is [Person_1]'},
    {'role': 'assistant', 'content': 'Hello [Person_1]! How can I help you today?'},
    {'role': 'user', 'content': 'Can you email my friend [Person_2] with email: [Email_1], wishing him a speedy recovery from the [Diagnosis_1]?'}
]
Mapping: {'[Person_1]': 'John Doe', '[Person_2]': 'Joseph', '[Email_1]': 'Joseph.cena@example.com', '[Diagnosis_1]': 'viral fever'}

Response Shapes

Protect

{
  "success": true,
  "statusCode": 200,
  "data": {
    "safety": [
      {"safety": "generic","isSafe": false,"method": "en-safety"}
    ],
    "compliance": [{"topic":"Banking","isOnTopic":true}],
    "pii": [{"pii_type":"Email Id","isPresent":true}],
    "greetings": [{"greeting_type":"Casual & Friendly","isPresent":true}]
  }
}

Redact

{
  "success": true,
  "statusCode": 200,
  "data": {
    "masked_text": [...],
    "mapping": {...}
  }
}

Errors

WalledProtect

Expand

Error Response

Field	Type	Description
`success`	`bool`	Always `False` for error responses
`statusCode`	`int`	Http Status Code for errors
`errorCode`	`str`	Main Model Error Code (for guardrail/pii)
`message`	`str`	Description of Error
`details`	`dict`	Details of Error

{
    "success": false,
    "statusCode": 400,
    "errorCode": "INVALID_GREETING_TYPE",
    "message": "Invalid greeting types: ['Casual & Friendlyy']. Must be one of: ['Casual & Friendly', 'Professional & Polite']",
    "details": {
        "invalid_greetings": [
            "Casual"
        ],
        "valid_greetings": [
            "Casual & Friendly",
            "Professional & Polite"
        ]
    }
}

WalledRedact

Expand

Error Response

Field	Type	Description
`success`	`bool`	Always `False` for error responses
`statusCode`	`int`	Http Status Code for errors
`errorCode`	`str`	Main Model Error Code (for guardrail/pii)
`message`	`str`	Description of Error
`details`	`dict`	Details of Error

{
    "success": false,
    "statusCode": 400,
    "errorCode": "VALIDATION_ERROR",
    "message": "",
    "details": [
        {
            "type": "missing",
            "loc": [
                "text"
            ],
            "msg": "Field required",
            "input": {},
            "url": "https://errors.pydantic.dev/2.10/v/missing"
        }
    ]
}

Evaluation

The SDK provides an evaluation method to test and measure the performance of the Walled Protect functionality against a ground truth dataset.

Batch Evaluation with CSV

import asyncio
from walledai import WalledProtect

client = WalledProtect("your_api_key", retries=3)

# Run evaluation
asyncio.run(client.eval(
    ground_truth_file_path="./unit_test_cases.csv",
    model_output_file_path="./model_results.csv",
    metrics_output_file_path="./metrics.csv",
    concurrency_limit=20
))

See example unit test file for a sample ground truth file.

Eval Method Parameters

Parameter	Type	Required	Default	Description
`ground_truth_file_path`	`str`	Yes	-	Path to CSV with test cases
`model_output_file_path`	`str`	Yes	-	Path to save results
`metrics_output_file_path`	`str`	Yes	-	Path to save metrics
`concurrency_limit`	`int`	No	`20`	Max concurrent requests

Ground Truth CSV Format

Required Columns (must be present in this order):

Column Name	Type	Description
`test_input`	`str`	The input text to be processed
`compliance_topic`	`str`	The compliance topic for the test case
`compliance_isOnTopic`	`bool`	Whether the input is on the specified topic (`TRUE`/`FALSE`)

Optional Columns (can be included as needed):

Column Name	Type	Description
`Person's Name`	`bool`	Whether a person's name is present (`TRUE`/`FALSE`)
`Address`	`bool`	Whether an address is present (`TRUE`/`FALSE`)
`Email Id`	`bool`	Whether an email ID is present (`TRUE`/`FALSE`)
`Contact No`	`bool`	Whether a contact number is present (`TRUE`/`FALSE`)
`Date Of Birth`	`bool`	Whether a date of birth is present (`TRUE`/`FALSE`)
`Unique Id`	`bool`	Whether a unique ID is present (`TRUE`/`FALSE`)
`Financial Data`	`bool`	Whether financial data is present (`TRUE`/`FALSE`)
`Casual & Friendly`	`bool`	Whether the greeting is casual & friendly (`TRUE`/`FALSE`)
`Professional & Polite`	`bool`	Whether the greeting is professional & polite (`TRUE`/`FALSE`)

Evaluation Features

CSV-based testing: Load test cases from CSV files
Concurrent processing: Configurable concurrency limits
Automatic retries: Built-in retry logic with delays
Metrics generation: Accuracy, precision, recall, and F1 scores
Dynamic column support: Automatically detects PII and greeting columns

Output Files

Model Results CSV: Contains the actual model predictions for each test case, including:
- All columns present in the ground truth file
- An additional is_safe column with TRUE or FALSE values indicating whether the input passed the safety evaluation
Metrics CSV: Contains evaluation metrics including:
- Accuracy scores
- Precision and recall
- F1 scores
- Confusion matrices

FAQ

Strings vs conversations? Both supported.
Consistent masking across turns? Yes.
PII detection vs redaction? Protect flags, Redact masks.

Contributing & License

PRs welcome. Licensed under MIT.

Project details

Release history Release notifications | RSS feed

This version

4.9.3

Sep 2, 2025

4.9.2

Aug 28, 2025

4.9.1

Aug 28, 2025

4.9.0

Aug 26, 2025

4.8.1

Aug 21, 2025

4.8.0

Aug 20, 2025

4.7.6

Aug 21, 2025

4.7.0

Aug 11, 2025

4.6.4

Aug 14, 2025

4.6.0

Aug 8, 2025

4.3.0

Aug 6, 2025

4.1.0

Aug 6, 2025

4.0.0

Aug 5, 2025

2.0.0

Jul 10, 2025

1.0.0

Jun 30, 2025

0.19.0

Jun 1, 2025

0.18.1

Jun 26, 2025

0.18.0

Jun 1, 2025

0.17.0

May 8, 2025

0.16.0

May 8, 2025

0.15.0a0 pre-release

Jan 24, 2025

0.14.0a0 pre-release

May 15, 2024

0.13.0a0 pre-release

May 15, 2024

0.12.0a0 pre-release

May 15, 2024

0.11.0a0 pre-release

May 15, 2024

0.10.0a0 pre-release

May 15, 2024

0.9.0a0 pre-release

May 15, 2024

0.8.0a0 pre-release

May 15, 2024

0.7.0a0 pre-release

May 14, 2024

0.6.0a0 pre-release

May 14, 2024

0.5.0a0 pre-release

May 14, 2024

0.4.0a0 pre-release

May 14, 2024

0.3.0a0 pre-release

May 14, 2024

0.2.0

Jun 1, 2025

0.2.0a0 pre-release

May 14, 2024

0.1.0a0 pre-release

May 13, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

walledai-4.9.3.tar.gz (17.6 kB view details)

Uploaded Sep 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

walledai-4.9.3-py3-none-any.whl (15.6 kB view details)

Uploaded Sep 2, 2025 Python 3

File details

Details for the file walledai-4.9.3.tar.gz.

File metadata

Download URL: walledai-4.9.3.tar.gz
Upload date: Sep 2, 2025
Size: 17.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for walledai-4.9.3.tar.gz
Algorithm	Hash digest
SHA256	`98d8bf34d03fff140b8b814613a911ae688c797ecf3f81e37999fcd8866e335d`
MD5	`b05f15b41ff4a43100899ace1293b2bd`
BLAKE2b-256	`0c912cea606ed3806d7d073864287937309b6fdce5a3e1300216e2b4864cd71d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for walledai-4.9.3.tar.gz:

Publisher: publish.yml on walledai/walledai-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: walledai-4.9.3.tar.gz
- Subject digest: 98d8bf34d03fff140b8b814613a911ae688c797ecf3f81e37999fcd8866e335d
- Sigstore transparency entry: 458292934
- Sigstore integration time: Sep 2, 2025
Source repository:
- Permalink: walledai/walledai-python@97be97c43bb134aa8d79e3b6d1e4e4838a59d541
- Branch / Tag: refs/heads/main
- Owner: https://github.com/walledai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@97be97c43bb134aa8d79e3b6d1e4e4838a59d541
- Trigger Event: push

File details

Details for the file walledai-4.9.3-py3-none-any.whl.

File metadata

Download URL: walledai-4.9.3-py3-none-any.whl
Upload date: Sep 2, 2025
Size: 15.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for walledai-4.9.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`eb42af1ac8186e9bf59a0df95109da52788068aa1ec9b6f7901d22def1df77f2`
MD5	`a4dc16cc23a3f95c9d3b0d9515709ece`
BLAKE2b-256	`e9f6ac7b0435d43485c3eccbd67daac8b9cc3e5ce5500793a0dcebbaa1392b55`

See more details on using hashes here.

Provenance

The following attestation bundles were made for walledai-4.9.3-py3-none-any.whl:

Publisher: publish.yml on walledai/walledai-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: walledai-4.9.3-py3-none-any.whl
- Subject digest: eb42af1ac8186e9bf59a0df95109da52788068aa1ec9b6f7901d22def1df77f2
- Sigstore transparency entry: 458292935
- Sigstore integration time: Sep 2, 2025
Source repository:
- Permalink: walledai/walledai-python@97be97c43bb134aa8d79e3b6d1e4e4838a59d541
- Branch / Tag: refs/heads/main
- Owner: https://github.com/walledai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@97be97c43bb134aa8d79e3b6d1e4e4838a59d541
- Trigger Event: push

walledai 4.9.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Walled AI SDK (Python)

⚖️ Guardrails Benchmark

🚀 Installation

Quick Start

1) Minimal moderation

2) Minimal redaction

Use with OpenAI

Core Concepts

Guided Examples

Prompt moderation with compliance + PII flags

Multi-turn conversation moderation

WalledRedact - PII Detection & Masking

Basic PII Masking

Multi-turn Conversation PII Masking

Response Shapes

Errors

WalledProtect

Error Response

WalledRedact

Error Response

Evaluation

Batch Evaluation with CSV

FAQ

Contributing & License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance