Database anonymizer package
Project description
PGAnonymizer
PGAnonymizer is a tool for anonymizing database for testing purposes.
Installation
PGAnonymizer required Python 3.5 and above to run. PGAnonymizer can be installed using pip.
$ pip install pganonymizer
Background
PGAnonymizer will hash columns in table depending on the rules inside CSJ file.
CSJ file format:
"table", "column", "rule"
"schema.table_to_be_hased", "columned_to_be_hased", "hash"
"schema.table_to_be_hased", "columned_to_be_hased", "date"
"schema.table_to_be_hased", "columned_to_be_hased", "phone"
Example for rules for hashing
Hash
is for text type field such as name or address. The return value will be in base32 with length of atmost 16.
Example input:
Customer1
Example output:
EC35FEOXDMVR7Y5C
Date
is for date type field. The return value will keep the original year but hash the month and date field.
Example input:
2000-12-15
Example output:
2000-01-10
Phone
is for phone number. The return value will be of same length as the original number.
Example input:
+66932019385
Example output:
+66254818000
Instruction
Format:
$ pganonymizer [database environment] [option]
To use PGAnonymizer, you can call:
$ pganonymizer --schema path_to_schema_file.csj
By default, PGAnonymizer will connect to database using user's username as database and role.
Otherwise, please specify as follow:
$ pganonymizer -d postgres -U postgres --schema path_to_schema_file.csj
For more information on the usage, please use the following command:
$ pganonymizer --help
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pganonymizer-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8a48015ce228caa42fe2257e659753eb4d2ef38879221ca1832bc673ab8be556 |
|
MD5 | 43bf4725fb5d9919f63c21eac454623c |
|
BLAKE2b-256 | ce89afb9851f7e486cd6d8ca3a52af3435b01c15c35a8180b7f2d42f7805a08c |