Skip to main content

Anonymizes personally identifiable information for Large Language Model APIs

Project description

anonLLM: Anonymize Personally Identifiable Information (PII) for Large Language Model APIs

License: MIT

anonLLM is a Python package designed to anonymize personally identifiable information (PII) in text data before it's sent to Language Model APIs like GPT-3. The goal is to protect user privacy by ensuring that sensitive data such as names, email addresses, and phone numbers are anonymized.

Features

Anonymize names Anonymize email addresses Anonymize phone numbers Support for multiple country-specific phone number formats Reversible anonymization (de-anonymization) Installation

To install anonLLM, run:

pip install anonLLM

Quick Start

Here's how to get started with anonLLM:

from anonLLM.llm import OpenaiLanguageModel
from dotenv import load_dotenv

load_dotenv()

# Anonymize a text
text = "Write a CV for me: My name is Alice Johnson, "\
    "email: alice.johnson@example.com, phone: +1 234-567-8910."\
    "I am a machine learning engineer."

# Anonymization is handled under the hood
llm = OpenaiLanguageModel()

response = llm.generate(text)

print(response)

In this example, the response will contain the correct name provided. At the same time, no PII will be sent to OpenAI.

Contributing

We welcome contributions!

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

anonLLM-0.1.8.tar.gz (7.3 kB view details)

Uploaded Source

File details

Details for the file anonLLM-0.1.8.tar.gz.

File metadata

  • Download URL: anonLLM-0.1.8.tar.gz
  • Upload date:
  • Size: 7.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for anonLLM-0.1.8.tar.gz
Algorithm Hash digest
SHA256 5ea9512ac5b95b6c3f32080f5b4f1b8cac8bdf588fb21835da2c9fcc3230f662
MD5 a240d354362623adf910467273f3c58f
BLAKE2b-256 0c536cf1019aacbf7fb18393fc36d87afbafd1b62a403d0785e1f251fd42ca8d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page