Find PII data in databases
Project description
PII Catcher for Files and Databases
Overview
PIICatcher is a data catalog and scanner for PII and PHI information. It finds PII data in your databases and file systems and tracks critical data. The data catalog can be used as a foundation to build governance, compliance and security applications.
Check out AWS Glue & Lake Formation Privilege Analyzer for an example of how piicatcher is used in production.
Quick Start
PIICatcher is available as a command-line application.
To install use pip:
python3 -m venv .env
source .env/bin/activate
pip install piicatcher
# Install Spacy English package
python -m spacy download en_core_web_sm
# run piicatcher on a sqlite db and print report to console
piicatcher db -c '/db/sqlqb'
╭─────────────┬─────────────┬─────────────┬─────────────╮
│ schema │ table │ column │ has_pii │
├─────────────┼─────────────┼─────────────┼─────────────┤
│ main │ full_pii │ a │ 1 │
│ main │ full_pii │ b │ 1 │
│ main │ no_pii │ a │ 0 │
│ main │ no_pii │ b │ 0 │
│ main │ partial_pii │ a │ 1 │
│ main │ partial_pii │ b │ 0 │
╰─────────────┴─────────────┴─────────────┴─────────────╯
Supported Technologies
PIICatcher supports the following filesystems:
- POSIX
- AWS S3 (for files that are part of tables in AWS Glue and AWS Athena)
- Google Cloud Storage (Coming Soon)
- ADLS (Coming Soon)
PIICatcher supports the following databases:
- Sqlite3 v3.24.0 or greater
- MySQL 5.6 or greater
- PostgreSQL 9.4 or greater
- AWS Redshift
- Oracle
- AWS Glue/AWS Athena
Documentation
For advanced usage refer documentation PIICatcher Documentation.
Contributing
For Contribution guidelines, PIICatcher Developer documentation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for piicatcher-0.9.6-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e990e71b294edad32fe0eaa0f42087d9b219101c31c867c293823022203e7fe6 |
|
MD5 | 28a3b5e431e3a664e7f7903f578ba75d |
|
BLAKE2b-256 | 5d416aa3d18034f0d922175f188634f41eefef283868f71ffb941867386a88ed |