Skip to main content

Community-driven AI security audit tool using interpretability techniques

Project description

CI PyPI License

Community AI Security Audit Tool

A community-driven tool for auditing AI models for security vulnerabilities using interpretability techniques. Built for cybersecurity researchers, ML engineers, and AI safety enthusiasts.

Features

  • Vulnerability Scanning — Detect backdoors, adversarial vulnerabilities, data poisoning, model stealing risks, and membership inference.
  • Interpretability Analysis — Apply integrated gradients, LIME, activation clustering, and more to explain model behavior.
  • Risk Scoring & Reporting — Generate actionable reports (Markdown, JSON, HTML, SARIF) with severity scores and mitigation suggestions.
  • Plug & Play — Swap any model provider, SIEM, or scanner without touching the core.
  • Community Sharing — Share audit results, signatures, and detection rules via a lightweight database or git-based sharing.

Installation

git clone https://github.com/your-org/community-ai-audit.git
cd community-ai-audit
pip install -e .

Quick Start

# Discover all components
community-ai-audit discover

# Scan a local PyTorch model
community-ai-audit scan my_model.pt --provider local --scanners adversarial --probe-file examples/data/toy_probe.json

# Full audit with interpretability
community-ai-audit audit my_model.pt --provider local --profile standard --scanners adversarial backdoor --interpreters integrated-gradients --probe-file examples/data/toy_probe.json --input '[0.1,...'

# Generate report in HTML
community-ai-audit audit my_model.pt --provider local --output html --save report.html

Supported Providers

Adapter Type Works Offline Setup
local PyTorch, TF, ONNX ✅ Yes Install torch/tensorflow
huggingface transformers, diffusers ✅ Yes Install transformers
openai GPT-4o, o1, etc. ❌ No Set OPENAI_API_KEY
anthropic Claude-3, etc. ❌ No Set ANTHROPIC_API_KEY
aws_bedrock All Bedrock models ❌ No Set AWS creds
ollama Local LLMs ✅ Yes Install ollama

Supported SIEMs

Connector Platform Auth
splunk Splunk HEC Token
elastic Elastic Security API Key
datadog Datadog Logs API Key
sentinel Microsoft Sentinel Shared Key

Architecture

┌─────────────┐     ┌──────────────┐     ┌─────────────┐
│   Model     │────→│  Scanner(s)  │────→│  Findings   │
│   Adapter   │     └──────────────┘     └─────────────┘
│             │
│             │────→│ Interpreter  │────→│Attributions │
│             │     └──────────────┘     └─────────────┘
└─────────────┘              │                   │
                              ↓                   ↓
                         ┌────────────┐
                         │  Reporter  │
                         │ (md/json/  │
                         │  html/sf)  │
                         └────────────┘

[CLI] → [AuditEngine] → [Plugins] → [SIEM Connector]

See docs/ARCHITECTURE.md for the full architecture.

Documentation

Guide Purpose
ADAPTER_GUIDE.md Add a new model provider (30 min)
SCANNER_GUIDE.md Add a new vulnerability scanner (30 min)
CONNECTOR_GUIDE.md Add a new SIEM connector (30 min)
INTERPRETER_GUIDE.md Add a new interpreter (30 min)
PLUGIN_GUIDE.md Quick reference for all plugins
ARCHITECTURE.md System design and data flow
CONTRIBUTING.md Setup, workflow, PR checklist
PHASE1_BENCHMARK.md Benchmark reproducibility
PHASE2_CONNECTORS.md SIEM integration guide

Project Status

🚧 Pre-release (v0.1.0) — Core infrastructure in progress.

Contributing

See CONTRIBUTING.md for setup instructions and guidelines.

License

MIT License — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

community_ai_audit-0.1.0.tar.gz (59.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

community_ai_audit-0.1.0-py3-none-any.whl (73.4 kB view details)

Uploaded Python 3

File details

Details for the file community_ai_audit-0.1.0.tar.gz.

File metadata

  • Download URL: community_ai_audit-0.1.0.tar.gz
  • Upload date:
  • Size: 59.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for community_ai_audit-0.1.0.tar.gz
Algorithm Hash digest
SHA256 5db0c7e564fecef9a88e466850b3bebf2a2b00a282d2a956ed801a3a7e82d888
MD5 d894051fe5381af4a1634b5251d28085
BLAKE2b-256 cc890fa6d0746da9fccbbda9c5c31489707cf27c5b68b2979ae93d4ecda17242

See more details on using hashes here.

File details

Details for the file community_ai_audit-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for community_ai_audit-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c49c483df0f554991c0d982e9865ba158577ec6c9982f42a097bc34b9f9b259c
MD5 f4ec0cf85500addeda8959abf2faca56
BLAKE2b-256 edd123d179b70dd88bb427c702c7f17801a11fdd738d6ecd2e2aa621484f30c3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page