Skip to main content

grafton anonymises data in any input file using an encoding key and informed consent list.

Project description

Grafton

Downloads

Grafton is a GDPR anonymizer for any file using informed consent, encoding key and randomising numbers

It can anonymize various input files such as CSV, json, XML or other flat files. In line with the GDPR legislation, the mandatory fields have been anonymized, so that any detection of the subjects is prevented.

The encryption key for this anonymization process is provided in a separate file (CSV) and must be stored securely afterwards. In accordance with the GDPR, data is only retained when informed consent is given. Consent can be provided in a separate CSV file. The tracked numbers can also be randomized within a similar order of magnitude.

The use of this package does not guarantee compliance with the GDPR. This package only performs the steps described above.

This package is developed by the AI team at VIVES University of Applied Sciences and is used in our research on demand forecasting.


Grafton works with any extension, but is initially designed to anonymise the file flatfile.csv:

  1. with a list of replacement values in the key file: pseudonyms.csv
  2. retaining only users with consent in a list in the consent file: consent.csv

The anonymised file export.csv (any extension possible again) is then saved to the current directory.

The script has default names for all the lists and files to anonymise, but these can be altered if needed.

Getting started

Installation

  1. Install python3.7+

  2. Create a virtual env where you want to install:

    $> python3 -m venv grafton
    
  3. Activate the environment

    $> source grafton/bin/activate
    
  4. Install the package with pip

    $> pip install grafton
    

Randomize numeric entries

from grafton import randomise_number

int_number = 5
randomized_number = randomise_number(int_number)

Anonymize files of any format

from grafton import anonymise
import pandas as pd
# The pseudoynyms file
pseudonyms_url = 'https://raw.githubusercontent.com/yForecasting/grafton/main/grafton/pseudonyms.csv'
pseudonyms_file = pd.read_csv(pseudonyms_url, sep = ';')

# The consent file with the entries they gave their consent
consent_url = 'https://raw.githubusercontent.com/yForecasting/grafton/main/grafton/consent.csv'
consent_file = pd.read_csv(consent_url)

# The location of the flatfile to be anonymized
flat_file = 'https://raw.githubusercontent.com/yForecasting/grafton/main/grafton/flatfile.csv'

# The location of the export file
export_file = '/content/export.csv'

anonymise(pseudonyms_file = pseudonyms_url, consent_file = consent_url, flat_file = flat_file, export_file = export_file)
Original data read.
Export started.
Wait for the end of script signature!
Grafton complete.
---- end of script ----

Contributing

Contribution is welcomed!

Start by reviewing the contribution guidelines. After that, take a look at a good first issue.

Disclaimer

grafton does not save, publish or share with anyone any identifiable user information.
The use of this package does not guarantee compliance with the GDPR. This package only performs the steps described above.

Support

The AI team at VIVES University of Applied Sciences builds and maintains grafton to make it simple and accessible. We are using this software in our research on demand forecasting. A special thanks to Ruben Vanhecke and Filotas Theodosiou for their contribution.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

grafton-0.1.9.tar.gz (18.1 kB view details)

Uploaded Source

Built Distribution

grafton-0.1.9-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file grafton-0.1.9.tar.gz.

File metadata

  • Download URL: grafton-0.1.9.tar.gz
  • Upload date:
  • Size: 18.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.6

File hashes

Hashes for grafton-0.1.9.tar.gz
Algorithm Hash digest
SHA256 3c37ae157965a2b75d3d3ce9baf539fccbb9b7b4b6479a2c0320b08755cee8a8
MD5 ca7652b2447195ee1edbba1141ce7cda
BLAKE2b-256 ae3f88f9aa8206b5506210d98e02594fd92a530c22de5bb907e2ba9909aa21ea

See more details on using hashes here.

File details

Details for the file grafton-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: grafton-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 18.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.8.6

File hashes

Hashes for grafton-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 031942437972ffd620aa6220996fd548e932b1ae97150070190c449d78db5b89
MD5 36c8b43202f50559d1a2b8f87a20f248
BLAKE2b-256 064f141781bac6dc13d3da4f2b8fc4eac285b8745ff5d863949ac062d5d03fa5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page