Skip to main content

Cosmian Cloudproof Anonymization library

Project description

Data Anonymization

Data anonymization is the process of transforming data in such a way that it can no longer be used to identify individuals without the use of additional information. This is often done to protect the privacy of individuals whose data is being collected or processed.

Anonymization techniques can include removing identifying information such as names and addresses, replacing identifying information with pseudonyms, and aggregating data so that individual data points cannot be distinguished. It's important to note that while anonymization can reduce the risk of re-identification, it is not foolproof and must be used in conjunction with other security measures to fully protect personal data.

Features

Cosmian anonymization provides multiple methods:

  • Hashing: transforms data into a fixed-length representation that is difficult to reverse and provides a high level of anonymity. Use anonymization::Hasher to apply the various hash functions.

  • Noise Addition: adds random noise to data in order to preserve privacy. Use anonymization::NoiseGenerator to apply various types of noise distributions to float, integer, and date.

  • Word Masking: hides sensitive words in a text. Use anonymization::WordMasker to mask a list of words.

  • Word Tokenization: removes sensitive words from text by replacing them with tokens. Use anonymization::WordTokenizer to replace a list of words.

  • Word Pattern Masking: replaces a sensitive pattern in text with specific characters or strings. Use anonymization::WordPatternMasker to replace specified pattern regex with a replacement string.

  • Number Aggregation: rounds numbers to a desired power of ten. This method is used to reduce the granularity of data and prevent re-identification of individuals. Use anonymization::NumberAggregator to round float and int values.

  • Date Aggregation: rounds dates based on the specified time unit. This helps to preserve the general time frame of the original data while removing specific details that could potentially identify individuals. Use anonymization::DateAggregator to round date.

  • Number Scaling: scales numerical data by a specified factor. This can be useful for anonymizing data while preserving its relative proportions. Use anonymization::NumberScaler to round float and int values.

Date Format

WARNING: The anonymization functions date input is in RFC3339 string format which is slightly different from ISO format.

ISO format RFC 3339
2023-04-07T12:34:56 2023-04-07T12:34:56Z
2023-04-27T16:23:00+00:00 2023-04-27T16:23:00+00:00
2023-04-27T16:23:00+05:00 2023-04-27T16:23:00+05:00
2023-04-27T16:23:00-05:00 2023-04-27T16:23:00-05:00

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

cloudproof_anonymization-0.1.3-cp37-abi3-win_amd64.whl (833.1 kB view details)

Uploaded CPython 3.7+ Windows x86-64

cloudproof_anonymization-0.1.3-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (882.6 kB view details)

Uploaded CPython 3.7+ manylinux: glibc 2.17+ x86-64

cloudproof_anonymization-0.1.3-cp37-abi3-macosx_10_12_x86_64.whl (820.5 kB view details)

Uploaded CPython 3.7+ macOS 10.12+ x86-64

File details

Details for the file cloudproof_anonymization-0.1.3-cp37-abi3-win_amd64.whl.

File metadata

File hashes

Hashes for cloudproof_anonymization-0.1.3-cp37-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 0c3e2ad858a163b53f77f1c72e3072ed2f64b556afb98be51f4a9c7a4f7207b4
MD5 08eb47681ceda8fa4933ec32a7b625ce
BLAKE2b-256 b07e9a7a94f11f068c3ca36d53df65e4485bcb7e248ed36d0f5f18c79aaffbe7

See more details on using hashes here.

File details

Details for the file cloudproof_anonymization-0.1.3-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for cloudproof_anonymization-0.1.3-cp37-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7a1f4867fe44b951b5f7079afac345238988fc4191006d174a36dcc0a3d430df
MD5 e345d2f23dd96f72a5b66809d862500b
BLAKE2b-256 22264ce959e6928350f4a222c80a982c73d95647e20c6495ea120c21e7bd9555

See more details on using hashes here.

File details

Details for the file cloudproof_anonymization-0.1.3-cp37-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for cloudproof_anonymization-0.1.3-cp37-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 dfa959ab2afbbb28e3dd82f4620f5e94f849e833cd8b9b0e68f550ebb3567fc7
MD5 e8e629d1d3a046382cd09a54dc42ffa1
BLAKE2b-256 7e81953cd99e888d9f7f1f7fe12abe8037755d6f974a3a053a785e3a9a6422b3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page