Skip to main content

High-performance data masking and pseudonymization for pandas/pyarrow DataFrames

Project description

DataForge Mask

High-performance data masking for Python.

Installation

pip install dataforge-mask

Usage

from dataforge_mask import mask
import pandas as pd

df = pd.DataFrame({
    "email": ["a@test.com", "b@test.com", "c@test.com"],
    "ssn": ["123-45-6789", "987-65-4321", "111-22-3333"]
})
transforms = [
    {"column": "email", "type": "hash"},
    {"column": "ssn", "type": "redact"}
]
result = mask(df, transforms, key="secret_key")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dataforge_mask-0.1.0-cp39-cp39-macosx_11_0_arm64.whl (674.3 kB view details)

Uploaded CPython 3.9macOS 11.0+ ARM64

File details

Details for the file dataforge_mask-0.1.0-cp39-cp39-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for dataforge_mask-0.1.0-cp39-cp39-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 c82d1ccef9aefc8cc1769e10045edb0fc4c212d90e82e70ad9342b3b5995565f
MD5 a67cc9bba6778f67bacac44f44c43808
BLAKE2b-256 0bfcc7b6ba447cdd08ce055bdd6d8565aafe00351bb92f7992d79a9c2c516d00

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page