Find PII data in databases
Project description
Pii Catcher for Files and Databases
Overview
PiiCatcher finds PII data in your databases. It scans all the columns in your database and files in your storage.
Quick Start
PiiCatcher is available as a command-line application.
To install use pip:
python3 -m venv .env
source .env/bin/activate
pip install piicatcher
# Install Spacy English package
python -m spacy download en_core_web_sm
# run piicatcher on a sqlite db and print report to console
piicatcher db -c '/db/sqlqb'
╭─────────────┬─────────────┬─────────────┬─────────────╮
│ schema │ table │ column │ has_pii │
├─────────────┼─────────────┼─────────────┼─────────────┤
│ main │ full_pii │ a │ 1 │
│ main │ full_pii │ b │ 1 │
│ main │ no_pii │ a │ 0 │
│ main │ no_pii │ b │ 0 │
│ main │ partial_pii │ a │ 1 │
│ main │ partial_pii │ b │ 0 │
╰─────────────┴─────────────┴─────────────┴─────────────╯
Supported Technologies
PiiCatcher supports the following filesystems:
- POSIX
- AWS S3 (for files that are part of tables in AWS Glue and AWS Athena)
- Google Cloud Storage (Coming Soon)
- ADLS (Coming Soon)
PiiCatcher supports the following databases:
- Sqlite3 v3.24.0 or greater
- MySQL 5.6 or greater
- PostgreSQL 9.4 or greater
- AWS Redshift
- SQL Server
- Oracle
- AWS Glue/AWS Athena
Documentation
For advanced usage refer documentation at its website.
Contributing
For Contribution guidelines, refer to developer documentation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
piicatcher-0.7.1.tar.gz
(13.0 kB
view hashes)
Built Distribution
Close
Hashes for piicatcher-0.7.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7566ca2dc92bf3b825362efd8bf4984e1d2b4b3995cd15e226a3e0425671eafc |
|
MD5 | 36834cdba3b420d24c3cdef7bc96d7f5 |
|
BLAKE2b-256 | 341cd32c50882fa2969affe162df5322fee9875ff53442dee3f16882ba1c60d8 |