Data anonymization CLI tool
Project description
Lethe
Data anonymization CLI for structured files. Detect and replace PII in CSV, TSV, and plain text using Presidio and spaCy NER, with Faker-generated replacements that stay consistent across your dataset.
Install
pip install lethe-cli
python -m spacy download en_core_web_trf
For a faster, lighter model instead of the transformer:
python -m spacy download en_core_web_sm
Usage
Anonymize
Replace detected PII with consistent fake values:
lethe anonymize data.csv -o anonymized.csv
lethe anonymize data.csv --model sm --threshold 0.7
lethe anonymize notes.txt -o clean.txt --locale nl_NL
Multiply
Generate synthetic rows from an existing dataset:
lethe multiply data.csv --factor 5 -o expanded.csv
lethe multiply data.csv --factor 10 --sanitize --seed 42
Options
Run lethe anonymize --help or lethe multiply --help for the full list of options.
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file lethe_cli-0.1.1.tar.gz.
File metadata
- Download URL: lethe_cli-0.1.1.tar.gz
- Upload date:
- Size: 58.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
827b2f00b04df71d48efe79bf66e3ceeb4915e8e7dadc55b1249d5b04ddbcce4
|
|
| MD5 |
24afae96677ac457fcdc0884795da63b
|
|
| BLAKE2b-256 |
44e57fd983db150faf4435d0ba1c2490c4133d293778475952f20ff3cd29478d
|
File details
Details for the file lethe_cli-0.1.1-py3-none-any.whl.
File metadata
- Download URL: lethe_cli-0.1.1-py3-none-any.whl
- Upload date:
- Size: 19.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4ccc2ba84ff9bbc8197c0cab42a2062adbc872a0fae28976397da5980361c96c
|
|
| MD5 |
a15e9f59d05b3b03cbd7223931df5e7d
|
|
| BLAKE2b-256 |
d0bcb0e73c3d18513dd1d6303a9319c1a98fc138845a8a6006962a33aac5f7b5
|