Skip to main content

Anonymizes personally identifiable information for Large Language Model APIs

Project description

anonLLM: Anonymize Personally Identifiable Information (PII) for Large Language Model APIs

License: MIT

anonLLM is a Python package designed to anonymize personally identifiable information (PII) in text data before it's sent to Language Model APIs like GPT-3. The goal is to protect user privacy by ensuring that sensitive data such as names, email addresses, and phone numbers are anonymized.

Features

Anonymize names Anonymize email addresses Anonymize phone numbers Support for multiple country-specific phone number formats Reversible anonymization (de-anonymization) Installation

To install anonLLM, run:

pip install anonLLM

Quick Start

Here's how to get started with anonLLM:

from anonLLM.llm import OpenaiLanguageModel
from dotenv import load_dotenv

load_dotenv()

# Anonymize a text
text = "Write a CV for me: My name is Alice Johnson, "\
    "email: alice.johnson@example.com, phone: +1 234-567-8910."\
    "I am a machine learning engineer."

# Anonymization is handled under the hood
llm = OpenaiLanguageModel()

response = llm.generate(text)

print(response)

In this example, the response will contain the correct name provided. At the same time, no PII will be sent to OpenAI.

Contributing

We welcome contributions!

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anonLLM-0.1.6.tar.gz (6.8 kB view details)

Uploaded Source

File details

Details for the file anonLLM-0.1.6.tar.gz.

File metadata

  • Download URL: anonLLM-0.1.6.tar.gz
  • Upload date:
  • Size: 6.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for anonLLM-0.1.6.tar.gz
Algorithm Hash digest
SHA256 038a976df2bfeb2cb2574dcfeff53e0fe002f5a244e8ef6b19dfc6759f956ee1
MD5 4e98837e037ca79af1a0eb8a90947e97
BLAKE2b-256 e0dcd05e94925dbb179e1a4d1464bc668e20b50607c08a57220f6ee5ef504f2a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page