Lightning-fast PII detection and anonymization library with 190x performance advantage

These details have not been verified by PyPI

Project links

Project description

DataFog Python

DataFog is a Python library for detecting and redacting personally identifiable information (PII).

It provides:

Fast structured PII detection via regex
Optional NER support via spaCy and GLiNER
A simple agent-oriented API for LLM applications
Backward-compatible DataFog and TextService classes

Installation

# Core install (regex engine)
pip install datafog

# Add spaCy support
pip install datafog[nlp]

# Add GLiNER + spaCy support
pip install datafog[nlp-advanced]

# Everything
pip install datafog[all]

Quick Start

import datafog

text = "Contact john@example.com or call (555) 123-4567"
clean = datafog.sanitize(text, engine="regex")
print(clean)
# Contact [EMAIL_1] or call [PHONE_1]

For LLM Applications

import datafog

# 1) Scan prompt text before sending to an LLM
prompt = "My SSN is 123-45-6789"
scan_result = datafog.scan_prompt(prompt, engine="regex")
if scan_result.entities:
    print(f"Detected {len(scan_result.entities)} PII entities")

# 2) Redact model output before returning it
output = "Email me at jane.doe@example.com"
safe_result = datafog.filter_output(output, engine="regex")
print(safe_result.redacted_text)
# Email me at [EMAIL_1]

# 3) One-liner redaction
print(datafog.sanitize("Card: 4111-1111-1111-1111", engine="regex"))
# Card: [CREDIT_CARD_1]

Guardrails

import datafog

# Reusable guardrail object
guard = datafog.create_guardrail(engine="regex", on_detect="redact")

@guard
def call_llm() -> str:
    return "Send to admin@example.com"

print(call_llm())
# Send to [EMAIL_1]

Engines

Use the engine that matches your accuracy and dependency constraints:

regex:
- Fastest and always available.
- Best for structured entities: EMAIL, PHONE, SSN, CREDIT_CARD, IP_ADDRESS, DATE, ZIP_CODE.
spacy:
- Requires pip install datafog[nlp].
- Useful for unstructured entities like person and organization names.
gliner:
- Requires pip install datafog[nlp-advanced].
- Stronger NER coverage than regex for unstructured text.
smart:
- Cascades regex with optional NER engines.
- If optional deps are missing, it degrades gracefully and warns.

Backward-Compatible APIs

The existing public API remains available.

`DataFog` class

from datafog import DataFog

result = DataFog().scan_text("Email john@example.com")
print(result["EMAIL"])

`TextService` class

from datafog.services import TextService

service = TextService(engine="regex")
result = service.annotate_text_sync("Call (555) 123-4567")
print(result["PHONE"])

CLI

# Scan text
datafog scan-text "john@example.com"

# Redact text
datafog redact-text "john@example.com"

# Replace text with pseudonyms
datafog replace-text "john@example.com"

# Hash detected entities
datafog hash-text "john@example.com"

Telemetry

DataFog includes anonymous telemetry by default.

To opt out:

export DATAFOG_NO_TELEMETRY=1
# or
export DO_NOT_TRACK=1

Telemetry does not include input text or detected PII values.

Development

git clone https://github.com/datafog/datafog-python
cd datafog-python
python -m venv .venv
source .venv/bin/activate  # Windows: .venv\Scripts\activate
pip install -e ".[all,dev]"
pytest tests/

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

4.3.0

Feb 13, 2026

4.3.0b1 pre-release

Jun 5, 2025

4.2.0

May 31, 2025

4.1.1

May 25, 2025

4.1.0 yanked

May 3, 2025

4.1.0b6 pre-release

May 7, 2025

4.1.0b3 pre-release

May 5, 2025

4.1.0.dev0 pre-release

May 3, 2025

4.0.0

Aug 30, 2024

4.0.0b0 pre-release

Aug 24, 2024

3.4.0

Aug 6, 2024

3.3.0

Jul 14, 2024

3.2.2

Jun 20, 2024

3.2.2b2 pre-release

Jun 20, 2024

3.2.2b1 pre-release

Jun 20, 2024

3.2.1

May 28, 2024

3.2.1b9 pre-release

May 28, 2024

3.2.1b8 pre-release

May 27, 2024

3.2.1b7 pre-release

May 27, 2024

3.2.1b6 pre-release

May 27, 2024

3.2.1b5 pre-release

May 27, 2024

3.2.1b4 pre-release

May 27, 2024

3.2.1b3 pre-release

May 18, 2024

3.2.1b2 pre-release

May 18, 2024

3.2.1b1 pre-release

May 18, 2024

3.2.0

May 14, 2024

3.2.0b20 pre-release

May 14, 2024

3.2.0b16 pre-release

May 14, 2024

3.2.0b15 pre-release

May 13, 2024

3.2.0b14 pre-release

May 13, 2024

3.2.0b13 pre-release

May 13, 2024

3.2.0b12 pre-release

May 13, 2024

3.2.0b11 pre-release

May 13, 2024

3.2.0b10 pre-release

May 13, 2024

3.2.0b9 pre-release

May 13, 2024

3.2.0b8 pre-release

May 13, 2024

3.2.0b7 pre-release

May 13, 2024

3.2.0b6 pre-release

May 13, 2024

3.2.0b5 pre-release

May 13, 2024

3.2.0b4 pre-release

May 13, 2024

3.2.0b3 pre-release

May 13, 2024

3.2.0b2 pre-release

May 13, 2024

3.2.0b1 pre-release

May 13, 2024

3.1.0

May 10, 2024

3.1.0b1 pre-release

May 7, 2024

3.0.1

May 6, 2024

3.0.1b1 pre-release

May 6, 2024

3.0.0

May 6, 2024

3.0.0b6 pre-release

Apr 29, 2024

3.0.0b5 pre-release

Apr 23, 2024

3.0.0b4 pre-release

Apr 23, 2024

3.0.0b3 pre-release

Apr 22, 2024

3.0.0b2 pre-release

Apr 22, 2024

3.0.0b1 pre-release

Apr 21, 2024

2.4.0

Apr 2, 2024

2.4.0b4 pre-release

Apr 2, 2024

2.4.0b3 pre-release

Apr 1, 2024

2.4.0b2 pre-release

Apr 1, 2024

2.4.0b1 pre-release

Apr 1, 2024

2.4.0a4 pre-release

Apr 1, 2024

2.4.0a3 pre-release

Apr 1, 2024

2.4.0a2 pre-release

Apr 1, 2024

2.4.0a1 pre-release

Apr 1, 2024

2.3.2

Mar 25, 2024

2.3.2b10 pre-release

Mar 25, 2024

2.3.2b9 pre-release

Mar 25, 2024

2.3.2b8 pre-release

Mar 25, 2024

2.3.2b7 pre-release

Mar 25, 2024

2.3.2b6 pre-release

Mar 25, 2024

2.3.2b5 pre-release

Mar 25, 2024

2.3.2b4 pre-release

Mar 25, 2024

2.3.2b3 pre-release

Mar 25, 2024

2.3.2b2 pre-release

Mar 25, 2024

2.3.2b1 pre-release

Mar 14, 2024

2.3.1

Mar 14, 2024

2.3.0

Mar 12, 2024

2.3.0b3 pre-release

Mar 12, 2024

2.3.0b2 pre-release

Mar 11, 2024

2.3.0b1 pre-release

Mar 11, 2024

2.2.2 yanked

Mar 7, 2024

Reason this release was yanked:

Unstable

2.2.0

Mar 10, 2024

2.2.0b1 pre-release

Mar 10, 2024

2.1.1

Mar 5, 2024

2.0.1

Feb 23, 2024

2.0.0 yanked

Feb 21, 2024

1.4.0

Feb 16, 2024

1.3.8

Jun 22, 2023

1.3.7

Jun 22, 2023

1.3.6

Jun 22, 2023

1.3.5

Jun 22, 2023

1.3.4

Jun 22, 2023

1.3.3

Jun 22, 2023

1.3.2

Jun 22, 2023

1.3.1

Jun 22, 2023

1.3.0

Jun 22, 2023

1.2.0

Jun 22, 2023

1.1.0

Jun 22, 2023

1.0

Jun 21, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datafog-4.3.0.tar.gz (72.9 kB view details)

Uploaded Feb 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

datafog-4.3.0-py3-none-any.whl (60.8 kB view details)

Uploaded Feb 13, 2026 Python 3

File details

Details for the file datafog-4.3.0.tar.gz.

File metadata

Download URL: datafog-4.3.0.tar.gz
Upload date: Feb 13, 2026
Size: 72.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for datafog-4.3.0.tar.gz
Algorithm	Hash digest
SHA256	`06331ea195fe2761a0c051f6f0052baf2aa54578a22d93acb57f86d6f818e9c4`
MD5	`91773ffd4d951ac05768105ed3e9f2d4`
BLAKE2b-256	`5f17c34455bd50a7178bd53e7c27e16afe946e614a395313fb181d69ff7e07e9`

See more details on using hashes here.

File details

Details for the file datafog-4.3.0-py3-none-any.whl.

File metadata

Download URL: datafog-4.3.0-py3-none-any.whl
Upload date: Feb 13, 2026
Size: 60.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for datafog-4.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0f27dadca6a6c05d436be1708958562d6028a3ba5b970de7e10105b65a50029a`
MD5	`a1a96a93a57848ba4c02db94cc459898`
BLAKE2b-256	`bdfad62cd25470e4705db00dec00907a92dbe0ff2b9c249583dc22d9f9e1358c`

See more details on using hashes here.

datafog 4.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DataFog Python

Installation

Quick Start

For LLM Applications

Guardrails

Engines

Backward-Compatible APIs

`DataFog` class

`TextService` class

CLI

Telemetry

Development

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

datafog 4.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DataFog Python

Installation

Quick Start

For LLM Applications

Guardrails

Engines

Backward-Compatible APIs

DataFog class

TextService class

CLI

Telemetry

Development

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`DataFog` class

`TextService` class