CLI tool to anonymize code using local LLM before sending to Claude Code

These details have not been verified by PyPI

Project links

Project description

LLM Anonymizer

A CLI tool to anonymize code using a local LLM before sending to Claude Code or other AI services.

Prerequisites

Install Ollama

macOS/Linux:

curl -fsSL https://ollama.ai/install.sh | sh

Windows: Download from ollama.ai
Start Ollama service:
```
ollama serve
```

Install a model (in a new terminal):

# Install Llama 3.2 (recommended)
ollama pull llama3.2

# Or install other models
ollama pull codellama
ollama pull llama3.1

Install UV (if not already installed)

# macOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# Windows
powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

Installation

Clone this repository:
```
git clone <repository-url>
cd llm-anon
```
Install dependencies:
```
uv sync
```

Usage

Basic Usage

# Anonymize a single file and print to stdout
uv run python -m llm_anon.cli example.py

# Anonymize and save to file
uv run python -m llm_anon.cli example.py -o anonymized.py

# Process entire directory
uv run python -m llm_anon.cli src/ -r -o anonymized_output/

Options

-o, --output PATH: Output file or directory
-m, --model TEXT: LLM model to use (default: llama3.2)
-t, --temperature FLOAT: Temperature for generation (default: 0.1)
--preserve-comments: Keep original comments
--preserve-strings: Keep string literals unchanged
-r, --recursive: Process directories recursively
-v, --verbose: Show detailed progress
--validation-config PATH: Path to validation config file with banned strings
--max-retries INTEGER: Maximum retries for validation failures (default: 3)

Examples

# Use different model with higher creativity
uv run python -m llm_anon.cli code.py -m codellama -t 0.3

# Preserve important strings and comments
uv run python -m llm_anon.cli api.py --preserve-strings --preserve-comments

# Process entire project with verbose output
uv run python -m llm_anon.cli ./src -r -v -o ./anonymized

# Use validation to ensure sensitive strings are removed
uv run python -m llm_anon.cli code.py --validation-config banned_strings.txt -v

# Process with custom retry limit for stubborn validations
uv run python -m llm_anon.cli sensitive_code.py --validation-config company_secrets.txt --max-retries 5

Supported Languages

Python (.py)
JavaScript (.js, .jsx)
TypeScript (.ts, .tsx)
Java (.java)
C/C++ (.c, .cpp, .cc, .cxx, .h, .hpp)
Rust (.rs)
Go (.go)

Validation System

The tool includes a powerful validation system to ensure sensitive information is completely removed from anonymized code.

Creating a Validation Config

Create a text file with banned strings (one per line):

# company_secrets.txt
# Company names and branding
MyCompany
mycompany.com

# API keys and credentials  
API_KEY_12345
secret_password_2024

# Contact information
support@mycompany.com
1-800-MYCOMPANY

# Product-specific terms
MyCompanyCustomer
MyCompanyLicenseManager

How Validation Works

Initial Anonymization: LLM processes code normally
String Detection: Scans output for banned strings using word boundaries
Re-prompting: If banned strings found, sends explicit removal instructions
Retry Logic: Repeats up to --max-retries times until validation passes
Failure Handling: Reports specific banned strings if validation ultimately fails

Validation Features

Case-sensitive matching by default
Word boundary detection prevents false positives
Progressive prompting gets more explicit with each retry
Detailed error reporting shows exactly which strings were found

How It Works

File Detection: Automatically detects programming language from file extension
LLM Processing: Sends code to local Ollama model with anonymization prompt
Validation (if enabled): Checks output against banned strings and re-prompts if needed
Smart Replacement: Replaces variable names, function names, and identifiers while preserving:
- Code structure and logic
- Control flow
- Data types
- Import statements
- Syntax and formatting

Troubleshooting

Ollama Connection Issues

# Check if Ollama is running
curl http://localhost:11434/api/tags

# Restart Ollama service
pkill ollama && ollama serve

# List available models
ollama list

Model Not Found

# Pull the default model
ollama pull llama3.2

# Or specify a different model
uv run python -m llm_anon.cli code.py -m llama3.1

Performance Tips

Use smaller models for faster processing: llama3.2:1b
Lower temperature (0.1) for more consistent results
Process files individually for large codebases to avoid timeouts

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.1

Jun 11, 2025

This version

0.1.0

Jun 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_anon-0.1.0.tar.gz (11.1 kB view details)

Uploaded Jun 11, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_anon-0.1.0-py3-none-any.whl (10.8 kB view details)

Uploaded Jun 11, 2025 Python 3

File details

Details for the file llm_anon-0.1.0.tar.gz.

File metadata

Download URL: llm_anon-0.1.0.tar.gz
Upload date: Jun 11, 2025
Size: 11.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.12

File hashes

Hashes for llm_anon-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e274fa510c59e4e6052c77b0c48d680c92dd9f745846008a267d11b24b2d3b59`
MD5	`0cebb9164e006c7882f8c62f3f642cf0`
BLAKE2b-256	`48fd22d1e4196cea269efddb95be558212d9ae9c3f9459673a10f57e6744ca48`

See more details on using hashes here.

File details

Details for the file llm_anon-0.1.0-py3-none-any.whl.

File metadata

Download URL: llm_anon-0.1.0-py3-none-any.whl
Upload date: Jun 11, 2025
Size: 10.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.12

File hashes

Hashes for llm_anon-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1f1f8590de235b037bcc9046b92c960a2b066ec86097cd448eb5f4892050eb13`
MD5	`af1c09666f7f1d09b8c672cdb067ef22`
BLAKE2b-256	`2ba9b16f986323327ea8c5ca3a19df52d45b83463722e63f332279e13766ea8b`

See more details on using hashes here.

llm-anon 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LLM Anonymizer

Prerequisites

Install Ollama

Install UV (if not already installed)

Installation

Usage

Basic Usage

Options

Examples

Supported Languages

Validation System

Creating a Validation Config

How Validation Works

Validation Features

How It Works

Troubleshooting

Ollama Connection Issues

Model Not Found

Performance Tips

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes