Database anonymizer package
Project description
PGAnonymizer
PGAnonymizer is a tool for anonymizing database for testing purposes.
Installation
PGAnonymizer required Python 3.5 and above to run. PGAnonymizer can be installed using pip.
$ pip install pganonymizer
Background
PGAnonymizer will hash columns in table depending on the rules inside CSJ file.
CSJ file format:
"table", "column", "rule"
"schema.table_to_be_hased", "columned_to_be_hased", "hash"
"schema.table_to_be_hased", "columned_to_be_hased", "date"
"schema.table_to_be_hased", "columned_to_be_hased", "phone"
Example for rules for hashing
Hash
is for text type field such as name or address. The return value will be in base32 with length of atmost 16.
Example input:
Customer1
Example output:
EC35FEOXDMVR7Y5C
Date
is for date type field. The return value will keep the original year but hash the month and date field.
Example input:
2000-12-15
Example output:
2000-01-10
Phone
is for phone number. The return value will be of same length as the original number.
Example input:
+66932019385
Example output:
+66254818000
Instruction
Format:
$ pganonymizer [database environment] [option]
To use PGAnonymizer, you can call:
$ pganonymizer --schema path_to_schema_file.csj
By default, PGAnonymizer will connect to database using user's username as database and role.
Otherwise, please specify as follow:
$ pganonymizer -d postgres -U postgres --schema path_to_schema_file.csj
For more information on the usage, please use the following command:
$ pganonymizer --help
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pganonymizer-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6c291df6643bea5f97f2d4e747cb989974ea75cb424189099f732b1e1bc354eb |
|
MD5 | bec1e356d98937c7e7a1f4ea930a7258 |
|
BLAKE2b-256 | e1b84a9cc0f817ce5595380a2598785dd0a785e3259d43f2c8cb6f8d45d97dc4 |