Transform recognized PII instances in a document
Project description
pii-transform
This package takes a source document, a collection of detected PII instances, and transforms the document by replacing the PII instances in the document with a different representation.
The type of substitution done is defined by transformation policies.
Command-line scripts
The package provides three console scripts:
pii-transform
loads a source document & a collection of already-detected PII, and produces a transformed document following the required policies.pii-process
is a full end-to-end script:- load a document, from among the formats supported by
pii-preprocess
- detects PII instances, according to
pii-extract
and its installed plugins - transforms the detected PII instances (according to the indicated policy) and writes out the transformed documennt
- load a document, from among the formats supported by
pii-process-jsonl
is also a full end-to-end script; this one reads JSONL files and processes each line as a separate text buffer (possibly in different languages), producing a transformed JSONL document
Note that pii-process
& pii-process-jsonl
will need additional packages
to be installed:
pii-preprocess
(only forpii-process
)pii-extract-base
, together with any available detection plugins, e.g.pii-extract-plg-regex
and/orpii-extract-plg-presidio
API
The same functionality provided by the command-line scripts can also be accessed via a Python API
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pii-transform-0.5.0.tar.gz
(21.3 kB
view hashes)