Presidio Analyzer package

These details have not been verified by PyPI

Project description

Presidio analyzer

Description

The Presidio analyzer is a Python based service for detecting PII entities in text.

During analysis, it runs a set of different PII Recognizers, each one in charge of detecting one or more PII entities using different mechanisms.

Presidio analyzer comes with a set of predefined recognizers, but can easily be extended with other types of custom recognizers. Predefined and custom recognizers leverage regex, Named Entity Recognition and other types of logic to detect PII in unstructured text.

Language Model-based PII/PHI Detection

Presidio analyzer supports language model-based PII/PHI detection (LLMs, SLMs) for flexible entity recognition. The current implementation uses LangExtract with support for multiple providers:

Ollama - Local model deployment for privacy-sensitive environments
Azure OpenAI - Cloud-based deployment with enterprise features

pip install presidio-analyzer[langextract]

Quick Usage

Ollama (local models):

from presidio_analyzer.predefined_recognizers import BasicLangExtractRecognizer
recognizer = BasicLangExtractRecognizer()  # Uses default config

Azure OpenAI (cloud models):

from presidio_analyzer.predefined_recognizers import AzureOpenAILangExtractRecognizer

# Simple usage - pass everything as parameters
recognizer = AzureOpenAILangExtractRecognizer(
    model_id="gpt-4",  # Your Azure deployment name
    azure_endpoint="https://your-resource.openai.azure.com/",
    api_key="your-api-key"
)

# Or use environment variables (AZURE_OPENAI_ENDPOINT, AZURE_OPENAI_API_KEY):
recognizer = AzureOpenAILangExtractRecognizer(
    model_id="gpt-4"  # Your Azure deployment name
)

# Advanced: Customize entities/prompts with config file
recognizer = AzureOpenAILangExtractRecognizer(
    model_id="gpt-4",
    config_path="./custom_config.yaml",  # Optional: for custom entities/prompts
    azure_endpoint="https://your-resource.openai.azure.com/",
    api_key="your-api-key"
)

Note: LangExtract recognizers do not validate connectivity during initialization. Connection errors or missing models will be reported when analyze() is first called.

See the Language Model-based PII/PHI Detection guide for complete setup and usage instructions.

Deploy Presidio analyzer to Azure

Use the following button to deploy presidio analyzer to your Azure subscription.

Simple usage example

from presidio_analyzer import AnalyzerEngine

# Set up the engine, loads the NLP module (spaCy model by default) and other PII recognizers
analyzer = AnalyzerEngine()

# Call analyzer to get results
results = analyzer.analyze(text="My phone number is 212-555-5555",
                           entities=["PHONE_NUMBER"],
                           language='en')
print(results)

GPU Acceleration

For GPU acceleration, install the appropriate dependencies for your hardware:

Linux with NVIDIA GPU: cupy-cuda12x (or the version matching your CUDA installation)
macOS with Apple Silicon: MPS (Metal Performance Shaders) is currently not supported. The analyzer will use CPU for PyTorch operations.

Documentation

Additional documentation on installation, usage and extending the Analyzer can be found under the Analyzer section of Presidio Documentation

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

2.2.361

Feb 12, 2026

2.2.360

Sep 9, 2025

2.2.359

Jul 13, 2025

2.2.358

Mar 18, 2025

2.2.357

Jan 13, 2025

2.2.356

Dec 15, 2024

2.2.355

Jul 22, 2024

2.2.354

Mar 29, 2024

2.2.353

Feb 12, 2024

2.2.352

Jan 22, 2024

2.2.351

Nov 8, 2023

2.2.350

Nov 2, 2023

2.2.35

Oct 31, 2023

2.2.34

Oct 30, 2023

2.2.33

Jun 4, 2023

2.2.32

Jan 31, 2023

2.2.31

Dec 14, 2022

2.2.30

Oct 25, 2022

2.2.29

Jul 12, 2022

2.2.28

May 8, 2022

2.2.27

Mar 9, 2022

2.2.26

Feb 24, 2022

2.2.25

Feb 22, 2022

2.2.24

Jan 26, 2022

2.2.23

Nov 16, 2021

2.2.22

Oct 3, 2021

2.2.21

Jun 14, 2021

2.2.4

Nov 2, 2023

2.2.2

Jun 9, 2021

2.2.1

May 10, 2021

2.2.0

Apr 12, 2021

2.1.0

Mar 25, 2021

2.0.3

Mar 25, 2021

2.0.2

Mar 25, 2021

2.0.1

Mar 4, 2021

2.0.0

Mar 1, 2021

2.0.0rc1 pre-release

Mar 1, 2021

1.11.0 yanked

Feb 11, 2021

Reason this release was yanked:

pre release

1.10.0 yanked

Feb 8, 2021

Reason this release was yanked:

debug

1.9.0 yanked

Feb 7, 2021

Reason this release was yanked:

debug

0.95

Feb 25, 2021

0.3.10649rc0 pre-release

Feb 9, 2021

0.3.10541.dev0 pre-release

Feb 5, 2021

0.3.9684.dev0 pre-release

Oct 26, 2020

0.3.9671rc0 pre-release

Sep 30, 2020

0.3.9547rc0 pre-release

Aug 10, 2020

0.3.9545rc0 pre-release

Aug 9, 2020

0.3.9542rc0 pre-release

Aug 6, 2020

0.3.9540rc0 pre-release

Aug 5, 2020

0.3.9537.dev0 pre-release

Aug 4, 2020

0.3.9535rc0 pre-release

Aug 1, 2020

0.3.9528rc0 pre-release

Jul 28, 2020

0.3.9525.dev0 pre-release

Jul 27, 2020

0.3.9491.dev0 pre-release

Jul 9, 2020

0.3.9488rc0 pre-release

Jul 7, 2020

0.3.9352rc0 pre-release

Jun 22, 2020

0.3.9237rc0 pre-release

Jun 7, 2020

0.3.9231.dev0 pre-release

Jun 5, 2020

0.3.8917rc0 pre-release

May 26, 2020

0.3.8780.dev0 pre-release

May 6, 2020

0.3.8779.dev0 pre-release

May 6, 2020

0.3.8778.dev0 pre-release

May 6, 2020

0.3.8774rc0 pre-release

May 6, 2020

0.3.8768rc0 pre-release

May 4, 2020

0.3.8767.dev0 pre-release

May 4, 2020

0.3.8763rc0 pre-release

May 3, 2020

0.3.8758.dev0 pre-release

May 2, 2020

0.3.8754rc0 pre-release

Apr 30, 2020

0.3.8750rc0 pre-release

Apr 30, 2020

0.3.8743rc0 pre-release

Apr 30, 2020

0.3.8740.dev0 pre-release

Apr 30, 2020

0.3.8705.dev0 pre-release

Apr 22, 2020

0.3.8697.dev0 pre-release

Apr 22, 2020

0.3.8689rc0 pre-release

Apr 21, 2020

0.3.8666rc0 pre-release

Apr 20, 2020

0.3.8636.dev0 pre-release

Apr 13, 2020

0.3.8635.dev0 pre-release

Apr 13, 2020

0.3.8628rc0 pre-release

Apr 12, 2020

0.3.8622rc0 pre-release

Apr 10, 2020

0.3.8619rc0 pre-release

Apr 10, 2020

0.3.8614rc0 pre-release

Apr 9, 2020

0.3.8602rc0 pre-release

Apr 6, 2020

0.3.8600rc0 pre-release

Apr 5, 2020

0.3.8575rc0 pre-release

Mar 30, 2020

0.3.8573.dev0 pre-release

Mar 30, 2020

0.3.8571.dev0 pre-release

Mar 30, 2020

0.3.8513rc0 pre-release

Mar 24, 2020

0.3.8493.dev0 pre-release

Mar 23, 2020

0.3.8468.dev0 pre-release

Mar 22, 2020

0.3.8467.dev0 pre-release

Mar 22, 2020

0.3.8463.dev0 pre-release

Mar 22, 2020

0.3.8450.dev0 pre-release

Mar 22, 2020

0.3.8443rc0 pre-release

Mar 22, 2020

0.3.8441rc0 pre-release

Mar 22, 2020

0.3.8405rc0 pre-release

Mar 16, 2020

0.3.8400.dev0 pre-release

Mar 15, 2020

0.3.8396.dev0 pre-release

Mar 15, 2020

0.3.8395.dev0 pre-release

Mar 15, 2020

0.3.8359rc0 pre-release

Mar 12, 2020

0.3.8326rc0 pre-release

Mar 12, 2020

0.3.8307rc0 pre-release

Mar 11, 2020

0.3.8303.dev0 pre-release

Mar 11, 2020

0.3.8293.dev0 pre-release

Mar 11, 2020

0.3.8278.dev0 pre-release

Mar 10, 2020

0.3.8273.dev0 pre-release

Mar 10, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

presidio_analyzer-2.2.361-py3-none-any.whl (183.9 kB view details)

Uploaded Feb 12, 2026 Python 3

File details

Details for the file presidio_analyzer-2.2.361-py3-none-any.whl.

File metadata

Download URL: presidio_analyzer-2.2.361-py3-none-any.whl
Upload date: Feb 12, 2026
Size: 183.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for presidio_analyzer-2.2.361-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7054b36303f5f47dd4bb3b00600bc936fb46aa3cc5e6befde3de839f0205f7f2`
MD5	`2744c3e568e02fa3a492a7fd1e98e0d4`
BLAKE2b-256	`f7475f07857a3ae4bea36cb631adc6899ef58081cb37ad1901aab01b6a8b2849`

See more details on using hashes here.

Provenance

The following attestation bundles were made for presidio_analyzer-2.2.361-py3-none-any.whl:

Publisher: release.yml on microsoft/presidio

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: presidio_analyzer-2.2.361-py3-none-any.whl
- Subject digest: 7054b36303f5f47dd4bb3b00600bc936fb46aa3cc5e6befde3de839f0205f7f2
- Sigstore transparency entry: 943379287
- Sigstore integration time: Feb 12, 2026
Source repository:
- Permalink: microsoft/presidio@cae43c1a17ef1d4a221b21f74f9f95f00d646602
- Branch / Tag: refs/heads/main
- Owner: https://github.com/microsoft
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@cae43c1a17ef1d4a221b21f74f9f95f00d646602
- Trigger Event: workflow_dispatch

presidio-analyzer 2.2.361

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Presidio analyzer

Description

Language Model-based PII/PHI Detection

Quick Usage

Deploy Presidio analyzer to Azure

Simple usage example

GPU Acceleration

Documentation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance