Skip to main content

No project description provided

Project description

Install

Requires Python 3.10 or 3.11

platform depentdant but make sure to get a distribution with pip and venv

get code

`git clone

navigate

cd templating_ai/ncd-anon

create venb (optional)

python -m venv venv-ncd-anon

windows

./venv-ncd-anon/Scripts/activate

linux/mac

source ./venv venv-ncd-anon/bin/activate

install poetry

pip install poetry

install package

poetry install

Usage

get help

`ncd_anonimize

NCD Anonymizer Tool
==================

A command-line tool to anonymize PDF and TXT files by removing or masking sensitive information.

Usage
-----
    python ncd-anon.py <file_or_folder_path>

Arguments
---------
    file_or_folder_path : str
        Path to either a single file (.pdf or .txt) or a directory containing multiple files.

Output
------
    For each processed file, creates a new file with '.anon.txt' extension in the same directory.
    Example: 'document.pdf' -> 'document.anon.txt'

Examples
--------
    # Anonymize a single PDF file
    python ncd-anon.py /path/to/document.pdf

    # Anonymize a single text file
    python ncd-anon.py /path/to/document.txt

    # Anonymize all PDF and TXT files in a directory
    python ncd-anon.py /path/to/directory

Notes
-----
    - Only .pdf and .txt files are processed
    - Files already ending in '.anon.txt' are skipped
    - The tool will create anonymized text versions of all processed files

`

proper usage

ncd_anonimize testdata/

output

Device set to use cpu Anonymizing C:\project\novapulse\templating_ai\ncd-anon\testdata\petroleum.txt to C:\project\novapulse\templating_ai\ncd-anon\testdata\petroleum.anon.txt { "Midstates Petroleum Company, Inc.": "ORG", "Stephen C. Pugh": "PER", "Executive": "ORG", "Company": "ORG", "COBRA": "ORG", "Company Party": "ORG", "L": "ORG", "##IABILITY": "ORG", "Equal Employment Opportunity Commission": "ORG", "EEOC": "ORG", "Louisiana": "LOC", "Oklahoma": "LOC", "Woods": "LOC", "Alfalfa": "LOC", "Lincoln": "LOC", "Roger Mills": "LOC", "Dewey": "LOC", "Ellis": "LOC", "Woodward": "LOC", "Texas": "LOC", "Lipscomb": "LOC", "Ochiltree": "LOC", "Hansford": "LOC", "Board of Directors": "ORG", "Midstates Incentive Holdings LLC": "ORG", "John P. Foley": "PER", "Post Oak Parkway": "LOC", "Suite": "LOC", "Houston": "LOC", "Party": "ORG", "State of Texas": "LOC", "MIDSTATES PETROLEUM COMPANY": "ORG", "IN": "LOC", "John A. Crum": "PER", "STEPHEN C.": "PER" }

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ncd_anon-0.1.0.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ncd_anon-0.1.0-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file ncd_anon-0.1.0.tar.gz.

File metadata

  • Download URL: ncd_anon-0.1.0.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.11

File hashes

Hashes for ncd_anon-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3cfd3b2603ff3c93ee808ecc2d03f7f52f01c7d1e38289c89eee34e77ce3ce45
MD5 9700798f784933c1aff4f84e80cf1f10
BLAKE2b-256 b78ccd6c554790c105d2e165bdc823e2bc6adb4609c5b98cd869831534879c34

See more details on using hashes here.

File details

Details for the file ncd_anon-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: ncd_anon-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 6.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.11

File hashes

Hashes for ncd_anon-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c42543cbd7535918d8c0752048f9c5cf71475b4254d8dba8d194220947514b65
MD5 fd47eaf9cfec9783e0e3b6d1a957f05e
BLAKE2b-256 8ced0c7848ffd85cb2e80a13562834eafcb6b8d5f77aaca41291a8a4df74edfb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page