Skip to main content

Agent Skill infrastructure: quality check, behavior test runner, version awareness

Project description

Agent Skill Infrastructure

From dev to ops: quality check, behavior test runner, and version awareness for Agent Skills.

CI Python 3.12+ License: MIT

Install

# PyPI (recommended)
pip install agent-skill-infra

# With LLM judge support (requires Anthropic API key)
pip install agent-skill-infra[llm]

# From source
git clone https://github.com/Liber1917/agent-skill-infra.git
cd agent-skill-infra
uv sync

CLI Commands

skill-test — Behavior Test Runner

Run evals.json test suites against skills with multi-dimension judgment.

# Run tests
skill-test run tests/fixtures/evals.json

# Output as JSON
skill-test run tests/fixtures/evals.json --output json

# Use mock adapter (no Agent runtime needed)
skill-test run tests/fixtures/evals.json --adapter mock

Judge types: keyword, schema, llm, flow, snapshot

skill-quality — Quality Assessment

Score your SKILL.md against helloandy 8-dimension framework.

# Quick quality check
skill-quality /path/to/skill/SKILL.md

# JSON output
skill-quality /path/to/skill/SKILL.md --output json

# With agent-skill-linter integration
skill-quality /path/to/skill/SKILL.md --lint

# With security scan (requires cisco-scanner)
skill-quality /path/to/skill/SKILL.md --security

skill-version — Version Awareness

Track changes, detect regressions, and roll back safely.

# See what changed between versions
skill-version diff /path/to/skill --old-ref HEAD~3 --new-ref HEAD

# Diff + security analysis
skill-version check /path/to/skill --security

# Roll back to previous version
skill-version rollback /path/to/skill --target-ref HEAD~1 --yes

# Store and compare baselines
skill-version baseline store /path/to/skill case-1 output.txt
skill-version baseline detect /path/to/skill case-1 output.txt

Modules

Module CLI Description
quality_check skill-quality 8-dimension scoring, linter integration, security
test_runner skill-test evals.json runner, 5 judge types, snapshot testing
version_aware skill-version diff, rollback, regression, security diff

Development

# Install dev dependencies
uv sync

# Run tests (162+)
uv run pytest

# Lint & format
uv run ruff check .
uv run ruff format .

# Type check
uv run pyright

# Build
uv build

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_skill_infra-0.3.0.tar.gz (76.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agent_skill_infra-0.3.0-py3-none-any.whl (47.1 kB view details)

Uploaded Python 3

File details

Details for the file agent_skill_infra-0.3.0.tar.gz.

File metadata

  • Download URL: agent_skill_infra-0.3.0.tar.gz
  • Upload date:
  • Size: 76.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agent_skill_infra-0.3.0.tar.gz
Algorithm Hash digest
SHA256 ecaf661f5181c7c92712f5eac9e3bcb1935d058900629d1d5bcbded4dee71656
MD5 ebfff8e41cb4efb181a0aca3ea8e5b5a
BLAKE2b-256 e41400d3c8c2a4d3a3a9a3964a0a679599b99eeee701c8385406ad53b04e296a

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_skill_infra-0.3.0.tar.gz:

Publisher: publish.yml on Liber1917/agent-skill-infra

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file agent_skill_infra-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for agent_skill_infra-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 eec50feac88321845c64842886b467b63e7fbf25f258294d461fcb1ddda9435e
MD5 f29bd074558f5742ba551b3295ea895d
BLAKE2b-256 16f3fb2e8e21ff32f96b15b1c0576f84c8b093bd70b72a196440be27bc1b6e58

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_skill_infra-0.3.0-py3-none-any.whl:

Publisher: publish.yml on Liber1917/agent-skill-infra

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page