Skip to main content

Community-driven AI security audit tool using interpretability techniques

Project description

CI PyPI License

Community AI Security Audit Tool

A community-driven tool for auditing AI models for security vulnerabilities using interpretability techniques. Built for cybersecurity researchers, ML engineers, and AI safety enthusiasts.

Features

  • Vulnerability Scanning — Detect backdoors, adversarial vulnerabilities, data poisoning, model stealing risks, and membership inference.
  • Interpretability Analysis — Apply integrated gradients, LIME, activation clustering, and more to explain model behavior.
  • Risk Scoring & Reporting — Generate actionable reports (Markdown, JSON, HTML, SARIF) with severity scores and mitigation suggestions.
  • Plug & Play — Swap any model provider, SIEM, or scanner without touching the core.
  • Community Sharing — Share audit results, signatures, and detection rules via a lightweight database or git-based sharing.

Installation

git clone https://github.com/your-org/community-ai-audit.git
cd community-ai-audit
pip install -e .

Quick Start

# Discover all components
community-ai-audit discover

# Scan a local PyTorch model
community-ai-audit scan my_model.pt --provider local --scanners adversarial --probe-file examples/data/toy_probe.json

# Full audit with interpretability
community-ai-audit audit my_model.pt --provider local --profile standard --scanners adversarial backdoor --interpreters integrated-gradients --probe-file examples/data/toy_probe.json --input '[0.1,...'

# Generate report in HTML
community-ai-audit audit my_model.pt --provider local --output html --save report.html

Supported Providers

Adapter Type Works Offline Setup
local PyTorch, TF, ONNX ✅ Yes Install torch/tensorflow
huggingface transformers, diffusers ✅ Yes Install transformers
openai GPT-4o, o1, etc. ❌ No Set OPENAI_API_KEY
anthropic Claude-3, etc. ❌ No Set ANTHROPIC_API_KEY
aws_bedrock All Bedrock models ❌ No Set AWS creds
ollama Local LLMs ✅ Yes Install ollama

Supported SIEMs

Connector Platform Auth
splunk Splunk HEC Token
elastic Elastic Security API Key
datadog Datadog Logs API Key
sentinel Microsoft Sentinel Shared Key

Architecture

┌─────────────┐     ┌──────────────┐     ┌─────────────┐
│   Model     │────→│  Scanner(s)  │────→│  Findings   │
│   Adapter   │     └──────────────┘     └─────────────┘
│             │
│             │────→│ Interpreter  │────→│Attributions │
│             │     └──────────────┘     └─────────────┘
└─────────────┘              │                   │
                              ↓                   ↓
                         ┌────────────┐
                         │  Reporter  │
                         │ (md/json/  │
                         │  html/sf)  │
                         └────────────┘

[CLI] → [AuditEngine] → [Plugins] → [SIEM Connector]

See docs/ARCHITECTURE.md for the full architecture.

Documentation

Guide Purpose
ADAPTER_GUIDE.md Add a new model provider (30 min)
SCANNER_GUIDE.md Add a new vulnerability scanner (30 min)
CONNECTOR_GUIDE.md Add a new SIEM connector (30 min)
INTERPRETER_GUIDE.md Add a new interpreter (30 min)
PLUGIN_GUIDE.md Quick reference for all plugins
ARCHITECTURE.md System design and data flow
CONTRIBUTING.md Setup, workflow, PR checklist
PHASE1_BENCHMARK.md Benchmark reproducibility
PHASE2_CONNECTORS.md SIEM integration guide

Project Status

🚧 Pre-release (v0.1.0) — Core infrastructure in progress.

Contributing

See CONTRIBUTING.md for setup instructions and guidelines.

License

MIT License — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

community_ai_audit-0.1.1.tar.gz (59.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

community_ai_audit-0.1.1-py3-none-any.whl (73.4 kB view details)

Uploaded Python 3

File details

Details for the file community_ai_audit-0.1.1.tar.gz.

File metadata

  • Download URL: community_ai_audit-0.1.1.tar.gz
  • Upload date:
  • Size: 59.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for community_ai_audit-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d1327afa7c242b84d9053469cd9bd740999e2a42b1aa9d9ca08e39644009b97b
MD5 685e336676e5b3b68fd4c6cf69b2f918
BLAKE2b-256 1952b3c2e61a79186995f761dacdde577c9559fce62e235d201120330af6b807

See more details on using hashes here.

File details

Details for the file community_ai_audit-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for community_ai_audit-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0cf446907714b821f16f9163b83420a9cbf16ebf28110f95e1b9b1b0a76ca9a9
MD5 1779287c9e6d301d293b61d77eda7c37
BLAKE2b-256 11817dd18a4363bdb89f4aa96a4b7d70b967c1c7cbcc70f4d6f563c19109b2af

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page