Static scanning library for detecting malicious code, backdoors, and other security risks in ML model files

These details have not been verified by PyPI

Project links

Project description

ModelAudit

A security scanner for AI models. Quickly check your AIML models for potential security risks before deployment.

ModelAudit

🔍 What It Does

ModelAudit scans ML model files for:

Malicious code execution (e.g., os.system calls in pickled models)
Suspicious TensorFlow operations (PyFunc, file I/O operations)
Potentially unsafe Keras Lambda layers with arbitrary code execution
Dangerous pickle opcodes (REDUCE, INST, OBJ, STACK_GLOBAL)
Custom ONNX operators and external data integrity issues
Encoded payloads and suspicious string patterns
Risky configurations in model architectures
Suspicious patterns in model manifests and configuration files
Models with blacklisted names or content patterns
Malicious content in ZIP archives including nested archives and zip bombs
Container-delivered models in OCI/Docker layers and manifest files
GGUF/GGML file integrity and tensor alignment validation
Anomalous weight patterns that may indicate trojaned models (statistical analysis)
Enhanced joblib/dill security (format validation, compression bombs, embedded pickle analysis, bypass prevention)
NumPy array integrity issues (malformed headers, dangerous dtypes)

🚀 Quick Start

Installation

ModelAudit is available on PyPI and requires Python 3.9 or higher.

Basic installation:

pip install modelaudit

With optional dependencies for specific model formats:

# For TensorFlow SavedModel scanning
pip install modelaudit[tensorflow]

# For Keras H5 model scanning
pip install modelaudit[h5]

# For PyTorch model scanning
pip install modelaudit[pytorch]

# For ONNX model scanning
pip install modelaudit[onnx]

# For TensorFlow Lite model scanning
pip install modelaudit[tflite]

# For YAML manifest scanning
pip install modelaudit[yaml]

# For SafeTensors model scanning
pip install modelaudit[safetensors]

# For enhanced pickle support (dill serialization with security validation)
pip install modelaudit[dill]

# For Joblib model scanning (includes scikit-learn integration)
pip install modelaudit[joblib]

# For Flax msgpack scanning
pip install modelaudit[flax]

# Install all optional dependencies
pip install modelaudit[all]

Development installation:

git clone https://github.com/promptfoo/modelaudit.git
cd modelaudit

# Using Rye (recommended)
rye sync --features all

# Or using pip
pip install -e .[all]

Docker installation:

# Pull from GitHub Container Registry
docker pull ghcr.io/promptfoo/modelaudit:latest

# Use specific variants
docker pull ghcr.io/promptfoo/modelaudit:latest-full        # All ML frameworks
docker pull ghcr.io/promptfoo/modelaudit:latest-tensorflow  # TensorFlow only

# Run with Docker
docker run --rm -v $(pwd):/data ghcr.io/promptfoo/modelaudit:latest scan /data/model.pkl

Basic Usage

# Scan a single model
modelaudit scan model.pkl

# Scan an ONNX model
modelaudit scan model.onnx

# Scan multiple models (including enhanced dill/joblib support)
modelaudit scan model1.pkl model2.h5 model3.pt llama-model.gguf model4.joblib model5.dill model6.npy flax-checkpoint.msgpack

# Scan a directory
modelaudit scan ./models/

# Export results to JSON
modelaudit scan model.pkl --format json --output results.json

Example output:

$ modelaudit scan suspicious_model.pkl

────────────────────────────────────────────────────────────────────────────────
ModelAudit Security Scanner
Scanning for potential security issues in ML model files
────────────────────────────────────────────────────────────────────────────────
Paths to scan: suspicious_model.pkl
────────────────────────────────────────────────────────────────────────────────

✓ Scanning suspicious_model.pkl

Active Scanner: pickle
Scan completed in 0.02 seconds
Files scanned: 1
Scanned 156 bytes
Issues found: 2 critical, 1 warnings

1. suspicious_model.pkl (pos 28): [CRITICAL] Suspicious module reference found: posix.system
2. suspicious_model.pkl (pos 52): [WARNING] Found REDUCE opcode - potential __reduce__ method execution

────────────────────────────────────────────────────────────────────────────────
✗ Scan completed with findings

✨ Features

Core Capabilities

Multiple Format Support: PyTorch (.pt, .pth, .bin), TensorFlow (SavedModel, .pb), Keras (.h5, .hdf5, .keras), SafeTensors (.safetensors), GGUF/GGML (.gguf, .ggml), Pickle (.pkl, .pickle, .ckpt), Joblib (.joblib), NumPy (.npy, .npz), PMML (.pmml), ZIP archives (.zip), Manifests (.json, .yaml, .xml, etc.), Flax (.msgpack, .ckpt)
Automatic Format Detection: Identifies model formats automatically
Deep Security Analysis: Examines model internals, not just metadata
Recursive Archive Scanning: Scans contents of ZIP files and nested archives
Batch Processing: Scan multiple files and directories efficiently
Configurable Scanning: Set timeouts, file size limits, custom blacklists

Reporting & Integration

Multiple Output Formats: Human-readable text and machine-readable JSON
Detailed Reporting: Scan duration, files processed, bytes scanned, issue severity
Severity Levels: CRITICAL, WARNING, INFO, DEBUG for flexible filtering
CI/CD Integration: Clear exit codes for automated pipeline integration

Security Detection

Code Execution: Detects embedded Python code, eval/exec calls, system commands
Pickle Security: Analyzes dangerous opcodes, suspicious imports, encoded payloads
Enhanced Dill/Joblib Analysis: ML-aware scanning with format validation and bypass prevention
Model Integrity: Checks for unexpected files, suspicious configurations
Archive Security: Automatic Zip-Slip protection against directory traversal, zip bombs, malicious nested files
Pattern Matching: Custom blacklist patterns for organizational policies

🛡️ Supported Model Formats

ModelAudit provides specialized security scanners for different model formats:

Format	File Extensions	What We Check
Pickle	`.pkl`, `.pickle`, `.dill`, `.bin`, `.pt`, `.pth`, `.ckpt`	Malicious code execution, dangerous opcodes, suspicious imports
PyTorch Zip	`.pt`, `.pth`	Embedded pickle analysis, suspicious files, custom patterns
PyTorch Binary	`.bin`	Binary tensor data analysis, embedded content
TensorFlow Lite	`.tflite`	Extreme tensor shapes, custom ops, FlatBuffer integrity
TensorFlow	SavedModel dirs, `.pb`	Suspicious operations, file I/O, Python execution
Keras	`.h5`, `.hdf5`, `.keras`	Lambda layers, custom objects, dangerous configurations
ONNX	`.onnx`	Custom operators, external data validation, tensor integrity
SafeTensors	`.safetensors`	Metadata integrity, tensor validation
Flax	`.msgpack`	MessagePack integrity, suspicious code pattern detection, decompression bomb prevention, embedded content analysis
GGUF/GGML	`.gguf`, `.ggml`	Header validation, tensor integrity, metadata security checks
Joblib	`.joblib`	File format validation, compression bomb detection, embedded pickle analysis, ML-aware security filtering
NumPy	`.npy`, `.npz`	Array integrity, dangerous dtypes, dimension validation
PMML	`.pmml`	XML well-formedness, external entity checks, suspicious extensions
ZIP Archives	`.zip`	Recursive content scanning, zip bombs, directory traversal
Manifests	`.json`, `.yaml`, `.yml`, `.xml`, `.toml`, `.ini`, `.cfg`, `.config`, `.manifest`, `.model`, `.metadata`	Suspicious keys, credential exposure, blacklisted patterns

Weight Analysis

ModelAudit can detect anomalous weight patterns that may indicate trojaned models using statistical analysis. This feature is disabled by default for large language models to avoid false positives.

⚙️ Advanced Usage

Command Line Options

# Set maximum file size to scan (1GB limit)
modelaudit scan model.pkl --max-file-size 1073741824

# Stop scanning after a total of 5GB has been processed
modelaudit scan models/ --max-total-size 5368709120

# Add custom blacklist patterns
modelaudit scan model.pkl --blacklist "unsafe_model" --blacklist "malicious_net"

# Set scan timeout (5 minutes)
modelaudit scan large_model.pkl --timeout 300

# Verbose output for debugging
modelaudit scan model.pkl --verbose

Exit Codes

ModelAudit uses different exit codes to indicate scan results:

0: Success - No security issues found
1: Security issues found (scan completed successfully)
2: Errors occurred during scanning (e.g., file not found, scan failures)

📋 JSON Output Format

When using --format json, ModelAudit outputs structured results:

{
  "scanner_names": ["pickle"],
  "start_time": 1750168822.481906,
  "bytes_scanned": 74,
  "issues": [
    {
      "message": "Found REDUCE opcode - potential __reduce__ method execution",
      "severity": "warning",
      "location": "evil.pickle (pos 71)",
      "details": {
        "position": 71,
        "opcode": "REDUCE",
        "ml_context_confidence": 0.0
      },
      "timestamp": 1750168822.482304
    },
    {
      "message": "Suspicious module reference found: posix.system",
      "severity": "critical",
      "location": "evil.pickle (pos 28)",
      "details": {
        "module": "posix",
        "function": "system",
        "position": 28,
        "opcode": "STACK_GLOBAL",
        "ml_context_confidence": 0.0
      },
      "timestamp": 1750168822.482378
    }
  ],
  "has_errors": false,
  "files_scanned": 1,
  "duration": 0.0005328655242919922
}

Each issue includes a message, severity level (critical, warning, info, debug), location, and scanner-specific details.

🔄 CI/CD Integration

ModelAudit is designed to integrate seamlessly into CI/CD pipelines with clear exit codes:

Exit Code 0: No security issues found
Exit Code 1: Security issues found (fails the build)
Exit Code 2: Scan errors occurred (fails the build)

Basic Integration

# Install ModelAudit
pip install modelaudit[all]

# Scan models and fail build if issues found
modelaudit scan models/ --format json --output scan-results.json

# Optional: Upload scan-results.json as build artifact

Platform Examples

GitHub Actions:

- name: Scan models
  run: |
    rye run modelaudit scan models/ --format json --output scan-results.json
    if [ $? -eq 1 ]; then
      echo "Security issues found in models!"
      exit 1
    fi

GitLab CI:

model-security-scan:
  script:
    - pip install modelaudit[all]
    - modelaudit scan models/ --format json --output results.json
  artifacts:
    paths: [results.json]

Jenkins:

sh 'pip install modelaudit[all]'
sh 'modelaudit scan models/ --format json --output results.json'

🔧 Troubleshooting

Common Issues

Installation Problems:

# If you get dependency conflicts
pip install --upgrade pip setuptools wheel
pip install modelaudit[all] --no-cache-dir

# Install with Rye (recommended)
rye sync --features all

# Or with pip
pip install -e .[all]

# If optional dependencies fail, install base package first
pip install modelaudit
pip install tensorflow h5py torch pyyaml safetensors onnx joblib  # Add what you need

Large Models:

# Increase file size limit and timeout for large models
modelaudit scan large_model.pt --max-file-size 5000000000 --timeout 600 --max-total-size 10000000000

Testing:

# Run all tests
rye run pytest

# Run with coverage
rye run pytest --cov=modelaudit

# Run specific test categories
rye run pytest tests/test_pickle_scanner.py -v
rye run pytest tests/test_integration.py -v

# Run tests with all optional dependencies
rye sync --features all
rye run pytest

# Run comprehensive migration test (tests everything including Docker)
./test_migration.sh

Debug Mode:

# Enable verbose output for troubleshooting
modelaudit scan model.pkl --verbose

Development Commands:

# Run linting and formatting with Ruff
rye run ruff check modelaudit/          # Check for linting issues
rye run ruff check --fix modelaudit/    # Fix auto-fixable issues
rye run ruff format modelaudit/         # Format code

# Type checking
rye run mypy modelaudit/

# Build package
rye build

# Publish (maintainers only)
rye publish

Getting Help:

Use --verbose for detailed output
Use --format json to see all details
Check file permissions and format support
Report issues on the promptfoo GitHub repository

⚠️ Limitations

ModelAudit is designed to find obvious security risks in model files, including direct code execution attempts, known dangerous patterns, malicious archive structures, and suspicious configurations.

What it cannot detect:

Advanced adversarial attacks or subtle weight manipulation
Heavily encoded/encrypted malicious payloads
Runtime behavior that only triggers under specific conditions
Model poisoning through careful data manipulation

Recommendations:

Use ModelAudit as one layer of your security strategy
Review flagged issues manually - not all warnings indicate malicious intent
Combine with other security practices like sandboxed execution and runtime monitoring
Implement automated scanning in CI/CD pipelines

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.45

May 3, 2026

0.2.44

May 3, 2026

0.2.43

May 2, 2026

0.2.42

Apr 27, 2026

0.2.41

Apr 27, 2026

0.2.40

Apr 17, 2026

0.2.39 yanked

Apr 17, 2026

Reason this release was yanked:

picklescan binary did not publish

0.2.38 yanked

Apr 17, 2026

Reason this release was yanked:

picklescan binary did not publish

0.2.37

Apr 12, 2026

0.2.36

Apr 11, 2026

0.2.35

Apr 11, 2026

0.2.34

Apr 11, 2026

0.2.33

Apr 9, 2026

0.2.32

Apr 6, 2026

0.2.31

Apr 4, 2026

0.2.30

Mar 30, 2026

0.2.28

Mar 21, 2026

0.2.27

Mar 5, 2026

0.2.26

Feb 24, 2026

0.2.25

Feb 12, 2026

0.2.24

Dec 29, 2025

0.2.23

Dec 12, 2025

0.2.22

Dec 10, 2025

0.2.21

Dec 9, 2025

0.2.20

Dec 1, 2025

0.2.19

Nov 24, 2025

0.2.18

Nov 21, 2025

0.2.17

Nov 19, 2025

0.2.16

Nov 4, 2025

0.2.15

Oct 31, 2025

0.2.14

Oct 24, 2025

0.2.13

Oct 23, 2025

0.2.12

Oct 23, 2025

0.2.11

Oct 22, 2025

0.2.10

Oct 22, 2025

0.2.9

Oct 21, 2025

0.2.8

Oct 21, 2025

0.2.7

Oct 21, 2025

0.2.6

Sep 10, 2025

0.2.5.post2

Feb 11, 2026

0.2.5.post1

Feb 11, 2026

0.2.5

Sep 7, 2025

0.2.4

Aug 28, 2025

0.2.3

Aug 21, 2025

0.2.2

Aug 21, 2025

0.2.1

Aug 15, 2025

0.2.0

Jul 18, 2025

0.1.5

Jun 20, 2025

This version

0.1.4

Jun 20, 2025

0.1.3

Jun 17, 2025

0.1.2

Jun 17, 2025

0.1.1

Jun 16, 2025

0.1.0

Mar 8, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelaudit-0.1.4.tar.gz (149.3 kB view details)

Uploaded Jun 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

modelaudit-0.1.4-py3-none-any.whl (88.8 kB view details)

Uploaded Jun 20, 2025 Python 3

File details

Details for the file modelaudit-0.1.4.tar.gz.

File metadata

Download URL: modelaudit-0.1.4.tar.gz
Upload date: Jun 20, 2025
Size: 149.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.4

File hashes

Hashes for modelaudit-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`79cc0090be0e3f37ab33fe5c14c7a15a74cdb5f70c5a5aff9bc213164f437bc5`
MD5	`15af8562edd3237777c70520403bbe37`
BLAKE2b-256	`3fc114073b3cd1371c758bbd48419748438a04e5d5f182606b160d8700a568f7`

See more details on using hashes here.

File details

Details for the file modelaudit-0.1.4-py3-none-any.whl.

File metadata

Download URL: modelaudit-0.1.4-py3-none-any.whl
Upload date: Jun 20, 2025
Size: 88.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.4

File hashes

Hashes for modelaudit-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`077630522c685b484918d00aab47d6477c67406f488e16f03d2c8d4d93290d00`
MD5	`b9eaf0f005acb3f4f5469e7f4e91b70a`
BLAKE2b-256	`03b376087e697664cd3a4f0b6b83dd49f3a05f1882aa7cd359e1d7f9c4daa309`

See more details on using hashes here.

modelaudit 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ModelAudit

Table of Contents

🔍 What It Does

🚀 Quick Start

Installation

Basic Usage

✨ Features

Core Capabilities

Reporting & Integration

Security Detection

🛡️ Supported Model Formats

Weight Analysis

⚙️ Advanced Usage

Command Line Options

Exit Codes

📋 JSON Output Format

🔄 CI/CD Integration

Basic Integration

Platform Examples

🔧 Troubleshooting

Common Issues

⚠️ Limitations

📝 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes