A Python library offering conventional anonymity functions, as well as Workload-Aware anonymity.
Project description
This is a Python library that offers conventional anonymization techniques, utility metrics, and verification methods.
Instalation
You can install the package via the command line:
pip install anonymity-api
Package Description
The package is divided into three different modules, as mentioned previously:
- anonymity
- utility
- verifier
Anonymity
This module contains the functions to anonymize data.
The conventional anonymization functions available are:
- k-anonymity
- distinct l-diversity
- entropy l-diversity
- recursive (c,l)-diversity
Another available function is the suggestion function that, given a dataset and its characteristics ( list with quasi-identifiers and sensitive attributes), suggests an anonymization to use, returning an anonymized dataset without choosing a technique. This is helpful for users who may not know how to anonymize data or aren't familiar with it.
We also offer Workload-Aware anonymization techniques. These take the usual anonymization parameters also present in the conventional anonymization techniques but, in addition to that, the user can give a query representing to work to be done on the dataset. This ensures higher utility over the tasks to be done.
Technique | Query |
---|---|
Simple query | (quasi-identifier (operation*) value ) |
Keeping correlation | corr( quasi-identifier, sensitive-attribute ) |
Grouping | group( quasi-identifier, value ) |
*operation can be: >, >=, =, < or <=
Utility
The utility module offers some utility techniques and a function that given an anonymized dataset, replaces the interval on the quasi-identifiers with a value comprehended in it.
The utility metrics available are:
- Discernibility Metric
- Average Equivalence Class Size Metric
- Normalized Certainty Penalty
Verifier
This module given an anonymized dataset offers funtions for each of the conventional techniques in the Anonymization module.
These functions will say which parameter was used to anonymize the dataset. For instance, it would give the K for k-anonymity, l for distinct l-diversity, and so forth.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file anonymity_api-0.5.1.tar.gz
.
File metadata
- Download URL: anonymity_api-0.5.1.tar.gz
- Upload date:
- Size: 15.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 37910ec9eec1c2d8904a552bd959b9e349371d07281c03b982e458cd602d8fc0 |
|
MD5 | bdec9ed5b61a56fd2e2f34477f65bf3d |
|
BLAKE2b-256 | 2a32c5dd5fd9f580f20869398ae37805507d5d01fe9a7daa5e3795e443ceefa9 |
File details
Details for the file anonymity_api-0.5.1-py3-none-any.whl
.
File metadata
- Download URL: anonymity_api-0.5.1-py3-none-any.whl
- Upload date:
- Size: 21.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3d4ae8f5335c308245374510b456d011004818027f5a588af370b8fcdef960ec |
|
MD5 | 1529453b6ca80a09770a1415826d079f |
|
BLAKE2b-256 | aa374e137d9f14270e763418d3a47d5c7e7170ce514fb245f6f313f5e6286694 |