Skip to main content

PIIFILL: Professional Local-Logic PII Sanitization CLI

Project description

🛡️ PIIFILL CLI v0.1.8

Secure your data. Mask sensitive information. Stay fully offline.

PyPI version Python versions License: MIT Downloads

Built with precision by Bhavin Sachaniya

OverviewInstallationBasic UsageSecurity AnalyticsSupported Formats


📖 Overview

PIIFILL is a high-performance command-line utility designed to automatically detect and redact Personally Identifiable Information (PII) from your documents, datasets, and images.

Whether you are a developer sanitizing logs, a researcher preparing datasets, or a professional sharing sensitive files, PIIFILL provides a "security-first" workflow to ensure privacy compliance.

[!IMPORTANT] 100% Offline Processing: PIIFILL is built for privacy. All detection, masking, and OCR processing happen locally on your machine. Your data is never uploaded to any cloud service.

🛡️ What PII can it detect?

  • 📱 Personal Identifiers: Phone numbers, Email addresses.
  • 📄 Government IDs: Social Security Numbers (SSN), Aadhaar Card details.
  • 💳 Financial Data: Credit/Debit card numbers.
  • 🏠 Location Info: Physical addresses and ZIP codes.
  • ...and much more through intelligent pattern matching.

🚀 Quick Start

Installation

Ensure you have Python 3.8+ installed. You can install PIIFILL directly via pip:

pip install piifill-cli==0.1.8

🛠️ Usage Guide

PIIFILL follows a simple two-phase workflow: Scan (To identify) and Mask (To protect).

1. Identify Privacy Risks (scan)

Use the scan command to audit your files. This is a read-only operation that provides a detailed report of potential PII without modifying your source files.

# Scan a single document
piifill scan sensitive_data.pdf

# Perform a deep search in a folder (recursive)
piifill scan ./private_docs/ --recursive

2. Protect Your Files (mask)

Once verified, use mask to generate sanitized versions of your files. By default, it creates an out/ directory with the protected copies.

# Mask a single file
piifill mask user_records.csv

# Mask all files in a directory
piifill mask ./raw_logs/

📂 Working with Folders

Want to clean up an entire folder of data? PIIFILL makes it easy.

Example: Mask every file in a folder

piifill mask ./data_dump/
  • PIIFILL will scan every file in ./data_dump/.
  • It will create a new folder called ./data_dump/out/.
  • All your safe, cleaned-up files will be waiting for you inside the out folder!

Example: Save the safe files somewhere specific

piifill mask ./private_files/ -o ./safe_backup/
  • This takes everything from private_files and puts the safe versions in safe_backup.

⚙️ Command Reference

Command Description Key Options
scan Detects PII and generates a risk report. --recursive, --format
mask Redacts PII and creates safe file copies. -o (output), --mode
config Displays current PIIFILL configuration. N/A
version Displays version and environment info. N/A

🎭 Masking Modes

You can customize how PII is hidden using the --mode flag:

  • mask (Default): Replaces data with descriptive placeholders (e.g., [REDACTED]).
  • redact: Completely removes the sensitive data from the file.
  • tokenize: Replaces data with unique, trackable tokens (e.g., <EMAIL_123>).

📊 Security & Risk Analytics

PIIFILL doesn't just hide data—it helps you understand your privacy posture through integrated analytics:

  • Security Grade: A standardized rating (A to F) based on PII density.
  • Risk Score (0-100): A quantitative metric representing the severity of data exposure.
  • Frequency Analysis: A detailed breakdown of detected entities (e.g., "5 Credit Cards, 12 Emails found").

📂 Supported File Formats

PIIFILL supports a wide range of formats, including advanced OCR support for image-based documents.

Category Extensions
Structured Data .csv, .json, .sql, .xlsx
Documents .txt, .pdf, .docx
Images (OCR) .png, .jpg, .jpeg

[!TIP] Image Detection: PIIFILL uses built-in OCR capabilities to detect and mask text hidden inside screenshots and scanned documents.


👤 Author

Bhavin Sachaniya


📜 License

This project is licensed under the MIT License. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

piifill_cli-0.1.8.tar.gz (36.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

piifill_cli-0.1.8-py3-none-any.whl (41.4 kB view details)

Uploaded Python 3

File details

Details for the file piifill_cli-0.1.8.tar.gz.

File metadata

  • Download URL: piifill_cli-0.1.8.tar.gz
  • Upload date:
  • Size: 36.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for piifill_cli-0.1.8.tar.gz
Algorithm Hash digest
SHA256 19436918390d9f746fbbf19ef3e4235c2be0afc75cf04c24aa23e2c4b465889c
MD5 5256a555806b8f04046c0e849dbce5f1
BLAKE2b-256 ffc84a4016b19b3be3de6e17e57e2926b77101a55c125106a7fed8a3ec4685ed

See more details on using hashes here.

File details

Details for the file piifill_cli-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: piifill_cli-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 41.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for piifill_cli-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 36730d9931f6da064b335df81ba774e02bdba96352aa2d5a3e756d204d16819b
MD5 162c3c28572458cde663503ebcbed0e9
BLAKE2b-256 3e669f4a43edb416b34b7481f508d879f50512ec47c7a809b49a476e4d0281b6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page