skillsops

SkillsOps — governance CLI for agent skills. Validate, evaluate, optimize, publish, and distribute skills across any IDE.

These details have not been verified by PyPI

Project description

skillsops logo

What kubectl does for Kubernetes, SkillsOps does for agent skills.

Python Coverage Type Checked License Security

The problem

Agent skills are spreading fast — code review skills, deployment skills, IaC security skills — but there's no quality gate between "someone wrote a SKILL.md" and "it's running in production." Teams end up with:

Skills with hardcoded secrets or prompt injection patterns that nobody catches
No way to answer "does this skill actually help?" with data
The same skill copy-pasted across Claude Code, Cursor, Windsurf, and Kiro
Breaking changes that ship without warning or versioning

SkillsOps is the governance layer. One CLI to validate, evaluate, optimize, publish, and distribute agent skills across any IDE and any runtime.

Get started in 60 seconds

pip install skillsops

# Create your first skill
skillctl create skill my-org/code-reviewer
# Edit SKILL.md with your instructions, then:

skillctl validate                         # schema + semver check
skillctl eval audit .                     # security scan → A-F grade
skillctl apply                            # push to governed store
skillctl install my-org/code-reviewer@0.1.0 --target all   # deploy to every IDE

That's the full lifecycle: create → validate → audit → publish → distribute.

Already have skills? Start here

If you already have skills in Claude Code, Cursor, or any IDE — no skill.yaml needed:

# Validate and audit an existing Claude Code skill
skillctl validate ~/.claude/skills/my-skill/SKILL.md
skillctl eval audit ~/.claude/skills/my-skill/

# Install it to other IDEs
skillctl install ~/.claude/skills/my-skill/ --target cursor,windsurf,kiro

# Or install from a URL
skillctl install --from-url https://raw.githubusercontent.com/.../SKILL.md --target all

Add a skillctl: block to your SKILL.md frontmatter for full governance metadata — IDEs ignore it, skillctl reads it:

---
name: code-reviewer
description: Reviews code for security issues
allowed-tools: Read Grep
skillctl:
  namespace: my-org
  version: 1.2.0
  category: security
  tags: [security, code-review]
---

What SkillsOps does

Validate and scan

skillctl validate ./my-skill          # schema, semver, capabilities
skillctl eval audit ./my-skill        # security scan → A-F grade

The security scanner checks 9 threat categories (~50 pattern detectors): hardcoded secrets, prompt injection, data exfiltration URLs, unsafe deserialization, encoded payloads, and more. Skills with critical findings are blocked from publishing. Customize with .skilleval.yaml — run skillctl eval init ./my-skill to generate one.

Evaluate with data

skillctl eval init ./my-skill         # generate eval scaffolds + .skilleval.yaml
skillctl eval functional ./my-skill   # runs agent with/without skill, measures difference
skillctl eval trigger ./my-skill      # does the skill activate when it should?
skillctl eval report ./my-skill       # unified score: 40% audit + 40% functional + 20% trigger

Optimize automatically

skillctl optimize ./my-skill --budget 5.0

Iterative loop: evaluate → identify weaknesses via LLM → generate variants → re-evaluate → promote the best. Works with any LLM via LiteLLM.

Publish with governance

skillctl apply ./my-skill             # validate + security scan + push to store

Every mutation is versioned, diffable, and auditable:

skillctl bump --minor                 # 1.0.0 → 1.1.0
skillctl diff my-org/code-reviewer@1.0.0 my-org/code-reviewer@1.1.0
skillctl get skills                   # list everything in the store
skillctl describe skill my-org/code-reviewer@1.1.0

Install to every IDE

skillctl install my-org/code-reviewer@1.1.0 --target all      # auto-detect IDEs
skillctl install my-org/code-reviewer@1.1.0 --target cursor    # specific IDE
skillctl install my-org/code-reviewer@1.1.0 --target kiro --global  # user-level
skillctl get installations                                     # what's installed where
skillctl uninstall my-org/code-reviewer@1.1.0 --target all     # clean up

Supported targets: Claude Code, Cursor, Windsurf, GitHub Copilot, Kiro. Frontmatter is automatically translated to each IDE's native format.

Export, import, share

skillctl export --namespace my-org    # tar.gz archive of your skills
skillctl import skills-backup.tar.gz  # restore on another machine

Key features

Feature	What it does
Security scanning	9 threat categories, ~50 pattern detectors, A-F grading
Functional evaluation	With/without-skill baseline comparison via LLM-as-judge
Trigger evaluation	Activation recall and specificity measurement
Automated optimization	LLM-driven iterative improvement loop with budget control
Multi-IDE install	Install governed skills to Claude Code, Cursor, Windsurf, Copilot, Kiro
SKILL.md first-class	Works with bare SKILL.md files — no skill.yaml required for local ops
Category taxonomy	12 built-in categories with validation
Content-addressed storage	SHA-256 hashing, integrity verification, structural diffing
Version management	`skillctl bump`, `skillctl diff`, breaking change detection
Self-hosted registry	FastAPI + SQLite + FTS5 search, HMAC-signed audit logs
AWS Agent Registry	Native integration via `bedrock-agentcore-control` API
Claude Code plugin	14 MCP tools + 3 skills for governance inside agentic IDEs
Export/import	Portable skill archives for sharing and backup

How it fits in

Author writes skill
    → skillctl validate        (schema check)
    → skillctl eval audit      (security scan, A-F grade)
    → skillctl eval functional (behavioral testing)
    → skillctl optimize        (automated improvement)
    → skillctl apply           (push to governed store)
    → skillctl install         (distribute to IDEs)
    → Enterprise discovery     (self-hosted registry or AWS Agent Registry)

Claude Code plugin

SkillsOps ships a Claude Code plugin in the plugin/ directory. It gives Claude direct access to all skillctl operations via MCP tools.

claude --plugin-dir ./plugin

# 14 MCP tools: validate, apply, list, describe, delete, diff, create,
#   eval_audit, eval_functional, eval_trigger, eval_report,
#   optimize, optimize_history, install
#
# 3 skills: /skillctl:skill-lifecycle, /skillctl:create-skill, /skillctl:diagnose-skill

Installation

pip install skillsops                  # core CLI (Python 3.10+)
pip install "skillsops[optimize]"      # + optimizer (LiteLLM)
pip install "skillsops[plugin]"        # + MCP server for Claude Code plugin
pip install "skillsops[server]"        # + registry server (FastAPI)
pip install "skillsops[all]"           # everything

Verify your setup:

skillctl doctor                       # checks Python, deps, store, registry, IDE targets

Documentation

Document	Purpose
docs/0-architecture.md	System overview, module map, data flow diagrams
docs/1-skill-format.md	Full CLI reference, skill format, registry server, eval suite, optimizer flags, API endpoints
CHANGELOG.md	Version history and release notes

Development

python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev,optimize,plugin]"
pytest -m "not integration"           # 545 unit tests
pytest -m integration                 # 10 real Bedrock tests (needs AWS creds)

License

MPL-2.0

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.0b3 pre-release

May 4, 2026

This version

0.1.0b2 pre-release

May 1, 2026

0.1.0b1 pre-release

Apr 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skillsops-0.1.0b2.tar.gz (190.1 kB view details)

Uploaded May 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

skillsops-0.1.0b2-py3-none-any.whl (150.8 kB view details)

Uploaded May 1, 2026 Python 3

File details

Details for the file skillsops-0.1.0b2.tar.gz.

File metadata

Download URL: skillsops-0.1.0b2.tar.gz
Upload date: May 1, 2026
Size: 190.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for skillsops-0.1.0b2.tar.gz
Algorithm	Hash digest
SHA256	`ba8fd131ee466d9035e5449216dbcb6ac0b346ca05396937bdb834321ab3a1f2`
MD5	`42aeb19a40ec05eca2c5f9325fb9dc41`
BLAKE2b-256	`0da75f240e0b70005d013ae4910493b126f0638571ab0e39b97b9ef662d3a348`

See more details on using hashes here.

File details

Details for the file skillsops-0.1.0b2-py3-none-any.whl.

File metadata

Download URL: skillsops-0.1.0b2-py3-none-any.whl
Upload date: May 1, 2026
Size: 150.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for skillsops-0.1.0b2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`09f1c855c236f235f0f11d6ef13b8ab0349ff53e50e7a18ed8eca45af0ef1d57`
MD5	`26250448b2a718f918505ae234673218`
BLAKE2b-256	`180847c35647fcf8ca7d94c85cff4fc9adfe2073d25ba625bc3a68e8b5f55d30`

See more details on using hashes here.

skillsops 0.1.0b2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

The problem

Get started in 60 seconds

Already have skills? Start here

What SkillsOps does

Validate and scan

Evaluate with data

Optimize automatically

Publish with governance

Install to every IDE

Export, import, share

Key features

How it fits in

Claude Code plugin

Installation

Documentation

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes