Skip to main content

STRM Privacy Diagnostics for reporting privacy metrics on a dataset.

Project description

STRM Privacy Diagnostics

This package contains diagnostics for your data, by means of computing k-Anonymity, l-Diversity and t-Closeness.

You can compute the scores by passing your data and indicating which columns are quasi-identifiers and sensitive attributes.

A 'quasi identifier' is a data attribute on an individual that together with other attributes could identify them. E.g. your length probably doesn't discern you from a larger group of people, but the combination of your length, age and city of birth will if someone has some knowledge about you.

A 'sensitive attribute' is a sensitive data point, like a specific medical diagnosis or credit score.

Installation

Install the package via Pip:

pip install strmprivacy-diagnostics

Usage

Simply import the package and

  • point it to your input data
  • calculate the statistics by passing the quasi identifiers and sensitive attributes
  • print a report by passing the quasi identifiers and sensitive attributes
from strmprivacy.diagnostics import PrivacyDiagnostics

# create an instance of the diagnostics class
d = PrivacyDiagnostics("/path/to/csv")

# calculate the statistics
d.calculate_stats(
    qi=['qi1', 'qi2', ...],  # names of quasi identifier columns,
    sa=['sa1', 'sa2', ...],  # names of sensitive attributes
)

# create report
d.create_report(
    qi=['qi1', 'qi2', ...],  # names of quasi identifier columns,
    sa=['sa1', 'sa2', ...],  # names of sensitive attributes
)

d.stats
>>> {'k': xxx, 'l': {'col1': xxx, ...}, 't': xxx}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

strmprivacy-diagnostics-1.2.2.tar.gz (8.5 kB view details)

Uploaded Source

Built Distribution

strmprivacy_diagnostics-1.2.2-py3-none-any.whl (9.0 kB view details)

Uploaded Python 3

File details

Details for the file strmprivacy-diagnostics-1.2.2.tar.gz.

File metadata

File hashes

Hashes for strmprivacy-diagnostics-1.2.2.tar.gz
Algorithm Hash digest
SHA256 548a6896288ffc6e232cbb72b14d1808d45f55d491206cc5f96576501f5e091f
MD5 e043bc3fb36e36e9dd9330cd211b9b33
BLAKE2b-256 6464d4e5add793c085d29a7e3570d7f6fca16c8073894c852b90e849796e5f7a

See more details on using hashes here.

File details

Details for the file strmprivacy_diagnostics-1.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for strmprivacy_diagnostics-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6251800ba6282cfe5a0f6e662fe4066fa58b95ed24815e9ac012817a9b46a415
MD5 e54ceddd4edb21f091715765d97dfec2
BLAKE2b-256 d517ee256caeb2c96cafe40cb1240f490beecb6090e2d99059b7121ad091577f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page