Database anonymizer package
Project description
PGAnonymizer
PGAnonymizer is a tool for anonymizing database for testing purposes.
Installation
PGAnonymizer required Python 3.5 and above to run. PGAnonymizer can be installed using pip.
$ pip install pganonymizer
Background
PGAnonymizer will hash columns in table depending on the rules inside CSJ file.
CSJ file format:
"table", "column", "rule"
"schema.table_to_be_hased", "columned_to_be_hased", "hash"
"schema.table_to_be_hased", "columned_to_be_hased", "date"
"schema.table_to_be_hased", "columned_to_be_hased", "phone"
Example for rules for hashing
Hash
is for text type field such as name or address. The return value will be in base32 with length of atmost 16.
Example input:
Customer1
Example output:
EC35FEOXDMVR7Y5C
Date
is for date type field. The return value will keep the original year but hash the month and date field.
Example input:
2000-12-15
Example output:
2000-01-10
Phone
is for phone number. The return value will be of same length as the original number.
Example input:
+66932019385
Example output:
+66254818000
Instruction
Format:
$ pganonymizer [database environment] [option]
To use PGAnonymizer, you can call:
$ pganonymizer --schema path_to_schema_file.csj
By default, PGAnonymizer will connect to database using user's username as database and role.
Otherwise, please specify as follow:
$ pganonymizer -d postgres -U postgres --schema path_to_schema_file.csj
For more information on the usage, please use the following command:
$ pganonymizer --help
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pganonymizer-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d939197ef17d8d9aa4482c20ba9496f463fa1d299752df49fbde091f8b9a1f9c |
|
MD5 | a80547d3502e47fdee05d225e9abdd96 |
|
BLAKE2b-256 | 89423d8814c419a1e8b6e569fb325f6751a5a0d01d6c501ad6fdaaa38e904994 |