Skip to main content

Inline PII, credential, and regulated-data classifier for AI agent I/O. Dual-layer: deterministic regex + optional semantic Guard. Library + CLI. Python port of @opena2a/aicomply.

Project description

aicomply (Python)

Inline PII, credential, and regulated-data classifier for AI agent I/O. Catch sensitive content before your agent forwards it to a hosted LLM.

Dual-layer by design: a deterministic regex layer (PII, credentials, controlled markings) that runs always, plus an optional semantic Guard layer when a local nanomind-daemon is reachable. The regex layer sees through common evasions — Unicode homoglyphs (NFKC), zero-width characters, intra-token whitespace, and bounded Base64 / URL-encoded payloads — by normalizing the input before matching.

This is the Python port of @opena2a/aicomply. It reproduces the TypeScript detection baseline against the same shared corpus (bench/corpus), so verdicts agree across languages.

Install

pip install aicomply

Try it (CLI)

No integration code required — point it at a file or pipe content in:

echo "My SSN is 123-45-6789" | aicomply scan
aicomply scan ./support-ticket.txt
cat transcript.log | aicomply scan --json

Exit codes make it a drop-in CI gate: 0 CLEAN, 1 findings present, 2 usage error.

Library API

from aicomply import comply

result = comply("Customer SSN is 516-81-3086, card 5544939082323438.")

print(result.verdict)                       # "VIOLATION"
for v in result.violations:
    print(v.type, v.value, v.confidence)     # SSN 516...86 0.95  (value is masked)

comply() returns a ComplyResult with:

  • verdict"CLEAN", "VIOLATION", or "DENY"
  • violations — each with type, masked value, confidence, classifier, view (which content view caught it), and best-effort original_start/end
  • original_content / normalized_content / normalizations — an audit trail (omitted on DENY, where the input is treated as untrusted bytes)
  • .to_dict() — camelCase JSON wire-compatible with the npm package

Empty string short-circuits to CLEAN; non-str input raises TypeError.

Guard an agent's output

Drop one decorator above any function that emits text bound for an LLM or a user:

from aicomply.integrations import guard_output, ComplianceViolation

@guard_output()                       # raise on any PII/credential egress
def answer(user_msg: str) -> str:
    return call_llm(user_msg)

@guard_output(on_violation="redact")  # or mask findings in place
def answer_redacted(user_msg: str) -> str:
    return call_llm(user_msg)

guard_io() additionally scans string inputs on the way in.

LangChain

pip install 'aicomply[langchain]'
from langchain_openai import ChatOpenAI
from aicomply.integrations.langchain import AIComplyCallbackHandler

llm = ChatOpenAI(callbacks=[AIComplyCallbackHandler()])
llm.invoke("Summarize this support ticket: ...")   # raises if the LLM emits PII

Semantic Guard layer

The regex layer is deterministic and always on. For prompt-injection / exfiltration patterns that regex cannot see, run a local nanomind-daemon; when it is reachable on 127.0.0.1:47200 the dual-layer classifier consults it and merges the verdict (highest severity wins). Its absence never fails a request — the classifier silently falls back to regex-only.

Detection classes

SSN, PAN (Luhn + IIN), credentials (AWS keys, GitHub tokens, Bearer tokens, api_key= patterns), CUI / controlled markings, IBAN (mod-97), passport numbers, MRN, NPI (Luhn with 80840 prefix).

Scope

This port covers the deterministic detection layer (regex + normalization + dual-layer merge + verdict) plus the daemon Guard client. The TypeScript package's Registry-L2, ARP-signature, policy-pack, and session-vault features are not yet ported.

License

Apache-2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aicomply-0.1.0.tar.gz (25.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aicomply-0.1.0-py3-none-any.whl (33.9 kB view details)

Uploaded Python 3

File details

Details for the file aicomply-0.1.0.tar.gz.

File metadata

  • Download URL: aicomply-0.1.0.tar.gz
  • Upload date:
  • Size: 25.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aicomply-0.1.0.tar.gz
Algorithm Hash digest
SHA256 bf01b88625560458bf82e503e14c8dae90af5d1a7d571afc25b5f19640fdd587
MD5 92673b75a1f31d303b3030a0538722cd
BLAKE2b-256 f140c8c463de9f1e8da2704fd089faa51ed3c3181c1eced7e546c0193bf00752

See more details on using hashes here.

Provenance

The following attestation bundles were made for aicomply-0.1.0.tar.gz:

Publisher: release-python.yml on opena2a-org/aicomply

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aicomply-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: aicomply-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 33.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aicomply-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 72fb356b69560d3a294250d443953b993cee1153e3ecde727c004238d1caa68a
MD5 3e456d8931ff380fc8b4e55271c71080
BLAKE2b-256 edef7f65927d4beb38164466292402b7881d393e5d6aafdb0c65ef52eb15de6e

See more details on using hashes here.

Provenance

The following attestation bundles were made for aicomply-0.1.0-py3-none-any.whl:

Publisher: release-python.yml on opena2a-org/aicomply

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page