SkillFortify — Formal verification and supply chain security for agentic AI skills

These details have not been verified by PyPI

Project description

SkillFortify

Formal verification for agent skill supply chains -- mathematical proof that your skills are safe.

The Problem

In January 2026, the ClawHavoc campaign infiltrated 1,200+ malicious skills into agent marketplaces. A month later, researchers catalogued 6,487 malicious agent tools -- and showed that conventional virus scanners miss the vast majority of them. CVE-2026-25253 demonstrated remote code execution through a single compromised skill.

Every existing defense tool relies on heuristic pattern matching: YARA rules, LLM-as-judge scoring, or regex-based scanning. They are better than nothing, but they share a fundamental limitation: absence of findings does not mean absence of risk. A sophisticated attacker can evade every heuristic scanner on the market today.

SkillFortify takes a different approach. Instead of guessing whether a skill is safe, SkillFortify formally analyzes skill safety using sound static analysis -- if SkillFortify reports no violations, the capability bounds in the formal model are assured. Using formal verification techniques adapted from supply chain security research, SkillFortify constructs a mathematical model of what each skill can and cannot do -- and verifies that model against declared capabilities. Zero false positives on the benchmark suite. The same rigor that protects critical infrastructure, applied to the agent skill ecosystem.

Quick Start

pip install skillfortify

Scan your agent project for security issues:

skillfortify scan ./my-agent-project

┌─────────────────────────────────────────────────────────────────┐
│                      SkillFortify Scan Results                        │
├──────────────────────┬────────┬────────┬──────────┬─────────────┤
│ Skill                │ Format │ Status │ Findings │ Max Severity│
├──────────────────────┼────────┼────────┼──────────┼─────────────┤
│ deploy-automation    │ -      │  SAFE  │        0 │ -           │
│ data-export          │ -      │ UNSAFE │        2 │ HIGH        │
│ weather-lookup       │ -      │  SAFE  │        0 │ -           │
└──────────────────────┴────────┴────────┴──────────┴─────────────┘
3 skills scanned | 2 safe | 1 unsafe | 2 total findings

What SkillFortify Does

`skillfortify scan <path>` -- Discover and analyze skills

Auto-detects all agent skills in your project directory across supported formats. Runs formal static analysis and reports security findings ranked by severity.

skillfortify scan . --format json              # Machine-readable output
skillfortify scan . --severity-threshold high  # Only show HIGH and CRITICAL

`skillfortify verify <skill>` -- Formal verification of a single skill

Deep analysis of one skill file with full capability inference, including POLA (Principle of Least Authority) compliance checks.

skillfortify verify .claude/skills/deploy.md

┌───────────────────────────────────────────────────────┐
│ Skill: deploy-automation   Status: SAFE               │
├───────────────────────────────────────────────────────┤
│                  Inferred Capabilities                 │
├───────────────────────┬───────────────────────────────┤
│ Resource              │ Access Level                  │
├───────────────────────┼───────────────────────────────┤
│ filesystem            │ READ                          │
│ network               │ READ                          │
└───────────────────────┴───────────────────────────────┘
No findings. Skill passed all checks.

`skillfortify lock <path>` -- Generate skill-lock.json

Creates a deterministic lockfile pinning every skill to its exact version and content hash. Guarantees reproducible agent configurations across environments -- the same way package lockfiles work for traditional dependencies.

skillfortify lock ./my-agent-project
skillfortify lock ./my-agent-project -o custom-lock.json

`skillfortify trust <skill>` -- Trust score computation

Computes a multi-signal trust score combining provenance, behavioral analysis, community signals, and historical record. Maps to graduated trust levels inspired by the SLSA framework.

skillfortify trust .claude/skills/deploy.md

┌───────────────────────────────────────────────────────┐
│ Skill: deploy-automation   Version: 1.0.0             │
├───────────────────────────────────────────────────────┤
│   Intrinsic Score: 0.750                              │
│   Effective Score: 0.750                              │
│   Trust Level:     FORMALLY_VERIFIED                  │
├───────────────────────────────────────────────────────┤
│                   Signal Breakdown                     │
├───────────────────────┬───────────────────────────────┤
│ Provenance            │ 0.500                         │
│ Behavioral            │ 1.000                         │
│ Community             │ 0.500                         │
│ Historical            │ 0.500                         │
└───────────────────────┴───────────────────────────────┘

`skillfortify sbom <path>` -- CycloneDX ASBOM generation

Generates a CycloneDX 1.6 Agent Skill Bill of Materials (ASBOM) for compliance reporting and audit trails. Includes skill inventory, capability declarations, trust scores, and security findings.

skillfortify sbom ./my-agent-project
skillfortify sbom ./my-agent-project --project-name "prod-agent" --project-version "2.1.0"

How It's Different

Feature	SkillFortify	Heuristic Scanners
Verification approach	Formal static analysis with sound capability model	Pattern matching, YARA rules, LLM judges
False positive rate	0% on benchmark suite	Variable, often high
Guarantee semantics	Formal bounds on skill capabilities	"No findings" != "no risk"
Dependency resolution	Constraint-based resolution	Not available
Lockfile generation	Deterministic `skill-lock.json`	Not available
Trust scoring	Multi-signal algebraic model	Not available
SBOM generation	CycloneDX 1.6 ASBOM	Not available
Capability inference	Formal capability model	Ad-hoc
Reproducible configs	Integrity-verified lockfiles	Not available

Supported Formats

SkillFortify auto-detects and analyzes skills across the three major agent skill ecosystems:

Format	Detected From	Skill Location
Claude Code Skills	`.claude/` directory	`.claude/skills/*.md`
MCP Servers	`mcp.json` or `mcp_config.json`	Server configurations
OpenClaw Skills	`.claw/` directory	`.claw/*/`

All formats are parsed into a unified representation for consistent analysis, trust scoring, and SBOM generation.

Benchmark Results

Evaluated on SkillFortifyBench -- a curated dataset of 540 agent skills (clean and malicious samples sourced from documented real-world incidents):

Metric	Value
Precision	100% (zero false positives)
Recall	94.12%
F1 Score	96.95%
Average scan time	2.55 ms per skill

Zero false positives on the benchmark suite means SkillFortify did not flag any safe skill as malicious across 540 test cases. When it reports a skill as unsafe, that finding is backed by formal analysis of the skill's capability bounds, not a heuristic guess.

CI/CD Integration

GitHub Actions

name: Skill Security Scan
on: [push, pull_request]

jobs:
  skillfortify-scan:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - name: Set up Python
        uses: actions/setup-python@v5
        with:
          python-version: "3.11"

      - name: Install SkillFortify
        run: pip install skillfortify

      - name: Scan agent skills
        run: skillfortify scan . --format json

      - name: Verify lockfile integrity
        run: skillfortify lock . --output /tmp/fresh-lock.json

Exit Codes

All SkillFortify commands use consistent exit codes for CI/CD integration:

Code	Meaning
`0`	Success -- all checks passed
`1`	Findings detected -- one or more skills have security issues
`2`	No skills found or parse error

Trust Levels

SkillFortify assigns graduated trust levels to every skill, inspired by the SLSA framework for software supply chain integrity:

Level	Threshold	Meaning
FORMALLY_VERIFIED	>= 0.75	Highest assurance. Formal analysis passed, strong provenance, active community trust
COMMUNITY_VERIFIED	>= 0.50	Multiple positive signals. Community reviewed, usage history, basic behavioral checks
SIGNED	>= 0.25	Basic provenance established. Author signed, but limited community verification
UNSIGNED	< 0.25	No verification. Treat with extreme caution

Documentation

Document	Description
Getting Started	Installation, first scan, and walkthrough
CLI Reference	Complete command documentation
Lockfile Format	`skill-lock.json` specification
ASBOM Output	CycloneDX ASBOM format and compliance

Requirements

Python 3.11 or later
No external services required -- SkillFortify runs entirely offline
Works on Linux, macOS, and Windows

Academic Paper

"Formal Analysis and Supply Chain Security for Agentic AI Skills"

SkillFortify is backed by peer-reviewed research with five formal theorems and full proofs, formalizing the agent skill supply chain threat model, capability verification, trust algebra, and dependency resolution.

Read the paper on Zenodo → | DOI: 10.5281/zenodo.18787663

Part of the AgentAssert suite — building the formal foundations for trustworthy AI agents.

Author

Varun Pratap Bhardwaj — Solution Architect with 15+ years in enterprise technology. Dual qualifications in technology and law (LL.B.), with a focus on formal methods for AI safety and regulatory compliance for autonomous systems.

Research: Formal methods for AI agent safety, behavioral contracts, supply chain security
Prior work: AgentAssert (design-by-contract for AI agents), SuperLocalMemory (privacy-preserving agent memory)
Contact: varun.pratap.bhardwaj@gmail.com
ORCID: 0009-0002-8726-4289

License

MIT License. See LICENSE for details.

Citation

If you use SkillFortify in your research, please cite:

@software{bhardwaj2026skillfortify,
  author    = {Bhardwaj, Varun Pratap},
  title     = {SkillFortify: Formal Analysis and Supply Chain Security for Agentic AI Skills},
  year      = {2026},
  doi       = {10.5281/zenodo.18787663},
  publisher = {Zenodo},
  url       = {https://doi.org/10.5281/zenodo.18787663}
}

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.0

Apr 10, 2026

0.3.3

Mar 6, 2026

0.3.2

Mar 3, 2026

0.3.1

Feb 28, 2026

0.3.0

Feb 28, 2026

0.2.0

Feb 26, 2026

This version

0.1.0

Feb 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skillfortify-0.1.0.tar.gz (134.9 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

skillfortify-0.1.0-py3-none-any.whl (91.1 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file skillfortify-0.1.0.tar.gz.

File metadata

Download URL: skillfortify-0.1.0.tar.gz
Upload date: Feb 26, 2026
Size: 134.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for skillfortify-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9d3d8785b08f168e4f47a418036ba1e34e6ccd6f640d7d6e04ce346c65d0fad5`
MD5	`2865eaf430adecf10371945f6e842be7`
BLAKE2b-256	`a63b4ec74e0fe67e284957205e51c5fb93d9c84d506af22eab0ac41e4c7e5f2b`

See more details on using hashes here.

File details

Details for the file skillfortify-0.1.0-py3-none-any.whl.

File metadata

Download URL: skillfortify-0.1.0-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 91.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for skillfortify-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`936e70314b44d00c7eb78164c2306ccbcb8ed027256ea8674c052670920f3df9`
MD5	`4246e4f306a4db9857619d964ed07928`
BLAKE2b-256	`f49109bfef5ac62aba17b5f3f7a33a6793f77cecf046921e49869ce77b66929c`

See more details on using hashes here.

skillfortify 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

SkillFortify

The Problem

Quick Start

What SkillFortify Does

skillfortify scan <path> -- Discover and analyze skills

skillfortify verify <skill> -- Formal verification of a single skill

skillfortify lock <path> -- Generate skill-lock.json

skillfortify trust <skill> -- Trust score computation

skillfortify sbom <path> -- CycloneDX ASBOM generation

How It's Different

Supported Formats

Benchmark Results

CI/CD Integration

GitHub Actions

Exit Codes

Trust Levels

Documentation

Requirements

Academic Paper

Author

License

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`skillfortify scan <path>` -- Discover and analyze skills

`skillfortify verify <skill>` -- Formal verification of a single skill

`skillfortify lock <path>` -- Generate skill-lock.json

`skillfortify trust <skill>` -- Trust score computation

`skillfortify sbom <path>` -- CycloneDX ASBOM generation