Skip to main content

Scan LLM agent skill directories for credential leakage

Project description

scankii

Buy Me A Coffee

A simple, local-first security scanner for AI Agents.

What does it do?

When you build or use an AI Agent (like a custom ChatGPT bot or AutoGen agent), you give it "skills." A skill is just a combination of Python code and English instructions.

Standard security scanners only check your Python code. But what if your English instructions accidentally tell the AI to print or expose a secret password?

scankii solves this by reading both your English instructions and your Python code at the same time. It spots dangerous interactions where the prompt tricks the code into giving away your API keys.

Table of Contents

Research vs. scankii

The original paper introduced the problem through an empirical study of thousands of agent skills.

scankii brings those ideas into a developer-friendly static analysis tool that runs locally, integrates with CI/CD, and provides actionable fixes before deployment.

Research → Tool

What does it work with?

scankii is framework-agnostic. It analyzes your raw Python code and Markdown text, which means it works seamlessly with any AI architecture or ecosystem:

  • Agent Frameworks: LangChain, AutoGen, CrewAI, Semantic Kernel, LlamaIndex, OpenAI Tools.
  • AI Coding Assistants: Cursor IDE, Google Antigravity, Claude Code (scan your .cursorrules or custom agent instructions to ensure they don't introduce vulnerabilities).
  • LLMs: OpenAI GPT-4, Claude 3.5, Gemini, Llama 3 (the leaks happen in the framework's execution layer, independent of the model itself).
  • IDEs: Because scankii exports standard SARIF reports, you can view the security warnings natively inside VS Code, Cursor, or GitHub Advanced Security.

The Problem: Cross-Modal Leakage

In modern LLM agent architectures, agents read natural language instructions and execute code. This creates a unique vulnerability:

  1. The Code is "Safe": The source code might securely read an API key from the environment and use it.
  2. The Markdown is "Safe": The SKILL.md might benignly explain how to use the skill.
  3. The Intersection is Vulnerable: If the SKILL.md instructs the agent to pass a credential to a function, and that function prints it for debugging, the agent framework captures that stdout and injects it back into the LLM context window. The secret is now exposed to prompt injection attacks.

scankii is an open-source scanner purpose-built to detect these cross-modal vulnerabilities. It correlates natural language prompts with Abstract Syntax Tree (AST) analysis to catch data leaks before your agent skills hit production.

How scankii works

scankii employs a dual-engine static analysis pipeline to evaluate both the instructional and executable components of an agent skill simultaneously.

graph TD
    subgraph "scankii Pipeline"
        direction TB
        
        subgraph "1. Static Analysis"
            A[SKILL.md] -->|Natural Language| B[NL Semantic Analyzer]
            C[Source Code] -->|AST Parsing| D[AST Syntax Analyzer]
        end
        
        subgraph "2. Cross-Modal Correlation"
            B -->|Extracted Intents| E{Cross-Modal Engine}
            D -->|Variable Sinks| E
        end
        
        subgraph "3. Scoring & Reporting"
            E -->|Unmatched Findings| F[Scorer]
            E -->|Correlated Leaks| F
            F -->|Severity Assessment| G[Reporters]
        end
    end
    
    G --> H((Terminal UI))
    G --> I((JSON))
    G --> J((SARIF))
  1. NL Semantic Analyzer: Uses constrained pattern matching to scan SKILL.md for prompt injections, social engineering, and instructions that mandate the passing of credentials.
  2. AST Syntax Analyzer: Parses the source code to build an Abstract Syntax Tree. It tracks variables and detects if they flow into dangerous sinks like print(), file I/O, or unauthenticated network requests.
  3. Cross-Modal Engine: Correlates findings from both engines. If the SKILL.md instructs passing an API key, and the code prints that parameter to stdout, the engine escalates it as a high-severity cross-modal leak.
  4. Scorer: Applies a multiplicative scoring model based on exploitability, channel risk, and credential type to determine the final severity (LOW to CRITICAL).

Demo

$ scankii scan examples/vulnerable-skill --explain

┏━━━━━━━━┳━━━━━━┳━━━━━━━━━━━━━━━━━━┳━━━━━━━━━┳━━━━━━━━━━┓
┃ File   ┃ Line ┃ Pattern          ┃ Channel ┃ Severity ┃
┡━━━━━━━━┇━━━━━━┇━━━━━━━━━━━━━━━━━━┇━━━━━━━━━┇━━━━━━━━━━┩
│ run.py │    7 │ Cross-Modal Leak │ stdout  │  MEDIUM  │
│ run.py │    8 │ Cross-Modal Leak │ network │ CRITICAL │
└────────┴──────┴──────────────────┴─────────┴──────────┘

  Total: 2  (CRITICAL: 1, MEDIUM: 1)

━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
🚨 CRITICAL — Information Exposure via network
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

Pattern:   Information Exposure
Channel:   network
File:      run.py, line 8
Score:     5.04

  Attack Flow:
    print(f"Using key: {api_key}")  ← sinks to stdout
    ↓
    stdout ← captured by agent framework
    ↓
    LLM context window ← credential queryable via natural language

  Attack Flow:
    requests.get(url, params={"appid": api_key}) ← credential in network call
    ↓
    network ← transmitted to external API
    ↓
    Exposed in transit or server logs

  Suggested Fix:
    Replace:  hardcoded credential in network call
    With:     Read credential from environment variable
              import os
              api_key = os.environ.get('API_KEY')

╭──────────────────────────────╮
│     Scan Summary             │
│  ┏━━━━━━━━━━┳━━━━━━━┓        │
│  ┃ Severity ┃ Count ┃        │
│  ┡━━━━━━━━━━╇━━━━━━━┩        │
│  │ CRITICAL │     1 │        │
│  │ HIGH     │     0 │        │
│  │ MEDIUM   │     1 │        │
│  │ LOW      │     0 │        │
│  │ TOTAL    │     2 │        │
│  └──────────┴───────┘        │
╰──────────────────────────────╯

Install

pip install scankii

Benchmark

Evaluated against the SkillLeakBench dataset (Chen et al., ASE 2026) — the same 520 labeled skills used in the original paper.

Metric scankii (Static)
Precision 100.0%
Recall 68.2%
F1 81.1%
Setup pip install scankii
Skills evaluated 520

scankii is a static-only tool. The paper's pipeline uses dynamic sandbox execution with mock credentials. The remaining recall gap reflects this difference — patterns requiring runtime behavior (interprocedural flows, dynamic credential construction) are on our roadmap. See #42.

scankii is inspired by the methodology of the academic pipeline, but redesigned as a fast, static analysis tool with zero infrastructure overhead. Just install and scan locally.

Usage

scankii runs 100% locally. Your code and proprietary agent skills never leave your machine.

Scan a skill directory (default terminal output)

scankii scan ./my-skill/

Scan with detailed attack flow explanation

scankii scan ./my-skill/ --explain

Export findings as JSON

scankii scan ./my-skill/ --format json

Auto-Fix Vulnerabilities

Automatically rewrites your code to use scankii.runtime.safe_print instead of dangerous standard functions:

scankii scan ./my-skill/ --resolve

Export findings as SARIF (for GitHub Code Scanning)

scankii scan ./my-skill/ --format sarif

What It Detects

# Pattern Description Example
1 Hardcoded API Keys OpenAI, Groq, AWS, GitHub, Google, Slack keys in source API_KEY = "sk-proj-..."
2 Credential-to-Stdout Credentials passed to print(), console.log() print(f"key={api_key}")
3 Credential-to-Network Credentials sent via requests.post(), fetch() requests.post(url, data=token)
4 Cross-Modal Leak SKILL.md instructs agent to pass credential to function that sinks it SKILL.md says "pass api_key" + code has print(api_key)
5 Prompt Injection NL instructions to override safety, ignore prior context "Ignore previous instructions and..."
6 Social Engineering NL patterns soliciting credentials from users "Paste your API key here"
7 Connection String Exposure MongoDB, PostgreSQL, MySQL URIs with embedded passwords mongodb://user:pass@host/db
8 Private Key Exposure RSA/EC private key blocks in source files -----BEGIN RSA PRIVATE KEY-----
9 Reverse Shell / RCE Reverse shells, `curl bash`, base64 obfuscation
10 Credential Theft Reading .env, .aws/credentials, ~/.ssh/id_rsa + exfil open(".aws/credentials").read()

Why Not TruffleHog / GitLeaks / detect-secrets?

Feature TruffleHog GitLeaks detect-secrets scankii
Regex secret scanning
Git history scanning
SKILL.md NL analysis
Cross-modal detection
AST-based sink tracking
stdout→LLM flow detection
Attack flow visualization
Prompt injection detection
Credential redaction runtime
SARIF output

Existing tools scan your code for static secrets. scankii is purpose-built for LLM agent skills, focusing on the intersection of natural language and code execution.

scankii.runtime: The Cure

Finding vulnerabilities is only half the battle. scankii includes a built-in runtime library that acts as a drop-in replacement for print() and Python logging. It automatically redacts credentials before they reach stdout (and therefore the LLM context window).

from scankii.runtime.safe_logger import SafeLogger, safe_print

logger = SafeLogger()
logger.info(f"Using key: {api_key}")
# Output: INFO: Using key: sk-[REDACTED]

safe_print(f"Token: {token}")
# Output: Token: ghp-[REDACTED]

Enterprise Integrations

GitHub Action

Add to your workflow to scan skills on every PR and upload results to GitHub Code Scanning:

name: Skill Guard
on: [push, pull_request]

jobs:
  scan:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: scankii/scankii@v1
        with:
          path: ./skills/
          severity-threshold: high
          sarif-upload: true
          fail-on-findings: true

Pre-commit Hook

Stop secrets from being committed locally. Add to .pre-commit-config.yaml:

repos:
  - repo: https://github.com/ashp15205/scankii
    rev: v1.1.0
    hooks:
      - id: scankii
        name: scankii
        entry: hooks/pre-commit
        language: script
        types: [file]
        files: '\.(md|py|js|ts)$'

Using the Secure Template

Copy our hardened SKILL.md template to start building a new skill securely from day one:

cp templates/SKILL.md.template my-new-skill/SKILL.md

The template includes:

  • Inline security comments explaining what NOT to do
  • Correct credential handling patterns (environment variables only)
  • A security checklist to verify before publishing

Contributing

  1. Fork the repository
  2. Create a feature branch: git checkout -b feature/my-feature
  3. Write tests for your changes
  4. Ensure all tests pass: pytest tests/ -v
  5. Run scankii on the repo: scankii scan .
  6. Submit a pull request

Development Setup

git clone https://github.com/ashp15205/scankii.git
cd scankii
pip install -e ".[dev]"
pytest tests/ -v

Acknowledgments

The 10-pattern leakage taxonomy and benchmark methodology in scankii are based on the empirical research in:

Chen et al., "How Your Credentials Are Leaked by LLM Agent Skills: An Empirical Study" (ASE 2026). Dataset | arXiv:2604.03070

scankii is an independent open-source tool and is not affiliated with the paper's authors or institutions.

Support

If you find scankii useful in your workflow, consider buying me a coffee to support open-source security tools! ☕️ Buy Me A Coffee

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scankii-1.1.0.tar.gz (40.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scankii-1.1.0-py3-none-any.whl (32.0 kB view details)

Uploaded Python 3

File details

Details for the file scankii-1.1.0.tar.gz.

File metadata

  • Download URL: scankii-1.1.0.tar.gz
  • Upload date:
  • Size: 40.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for scankii-1.1.0.tar.gz
Algorithm Hash digest
SHA256 9d2fae11ac7a90dca2455f71ffce04377c3f9356f4333eda704cdc9709f589d6
MD5 d878fbf6e11b4883f80b059ee0311ab2
BLAKE2b-256 88a9c13e6430399d79c6e96d863824d7a9f1a07c6edfa72145c26135221c2ac9

See more details on using hashes here.

File details

Details for the file scankii-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: scankii-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 32.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for scankii-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85a64a8b83e6c05ddced498d3b883a2f24ccddb8fc2c2144333a6250282a6c75
MD5 f5c4cd0e5b84db65d79a27871fb6d8b7
BLAKE2b-256 5ba177af83376ad16693f54b9349670911e8ccf0c4b411084256ea4f8c23c0de

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page