Skip to main content

Data anonymization CLI tool

Project description

Lethe

Data anonymization CLI for structured files. Detect and replace PII in CSV, TSV, and plain text using Presidio and spaCy NER, with Faker-generated replacements that stay consistent across your dataset.

Install

pip install lethe-cli
python -m spacy download en_core_web_trf

For a faster, lighter model instead of the transformer:

python -m spacy download en_core_web_sm

Usage

Anonymize

Replace detected PII with consistent fake values:

lethe anonymize data.csv -o anonymized.csv
lethe anonymize data.csv --model sm --threshold 0.7
lethe anonymize notes.txt -o clean.txt --locale nl_NL

Multiply

Generate synthetic rows from an existing dataset:

lethe multiply data.csv --factor 5 -o expanded.csv
lethe multiply data.csv --factor 10 --sanitize --seed 42

Options

Run lethe anonymize --help or lethe multiply --help for the full list of options.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lethe_cli-0.1.1.tar.gz (58.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lethe_cli-0.1.1-py3-none-any.whl (19.3 kB view details)

Uploaded Python 3

File details

Details for the file lethe_cli-0.1.1.tar.gz.

File metadata

  • Download URL: lethe_cli-0.1.1.tar.gz
  • Upload date:
  • Size: 58.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.2

File hashes

Hashes for lethe_cli-0.1.1.tar.gz
Algorithm Hash digest
SHA256 827b2f00b04df71d48efe79bf66e3ceeb4915e8e7dadc55b1249d5b04ddbcce4
MD5 24afae96677ac457fcdc0884795da63b
BLAKE2b-256 44e57fd983db150faf4435d0ba1c2490c4133d293778475952f20ff3cd29478d

See more details on using hashes here.

File details

Details for the file lethe_cli-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: lethe_cli-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 19.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.2

File hashes

Hashes for lethe_cli-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4ccc2ba84ff9bbc8197c0cab42a2062adbc872a0fae28976397da5980361c96c
MD5 a15e9f59d05b3b03cbd7223931df5e7d
BLAKE2b-256 d0bcb0e73c3d18513dd1d6303a9319c1a98fc138845a8a6006962a33aac5f7b5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page