Skip to main content

OpenAPI scoring, sandbox, and CLI tool scoring for AI-ready agents — specscore

Project description

specscore

A CLI tool that scores OpenAPI specs for AI-readiness across 6 dimensions — giving APIs a letter grade and actionable recommendations so they work well with AI agents and LLM tooling.

Installation

# With uv (recommended)
uv pip install -e .

# Or with pip
pip install -e .

Usage

Scorecard

# Score an OpenAPI spec
specscore score path/to/openapi.yaml

# Output as JSON
specscore score path/to/openapi.yaml --json

# Run against the built-in sample spec to see how scoring works
specscore demo
specscore demo --json

Sandbox

The sandbox spins up a local mock server from your spec and auto-probes every endpoint, reporting a feasibility score — how well the spec can actually be exercised by an agent.

# Start a persistent mock server (press Ctrl+C to stop)
specscore sandbox start path/to/openapi.yaml
specscore sandbox start path/to/openapi.yaml --port 9000

# Probe all endpoints and get a report
specscore sandbox probe path/to/openapi.yaml
specscore sandbox probe path/to/openapi.yaml --json

# Run against the built-in sample spec
specscore sandbox demo
specscore sandbox demo --json

The probe command:

  1. Starts an internal mock server
  2. Generates synthetic request payloads and path parameters from the spec's schemas
  3. Fires requests at every operation
  4. Reports per-endpoint status codes, response times, and any issues found
  5. Exits with code 2 if the feasibility score is below 50%

Exit codes

Code Meaning
0 Success / score ≥ threshold
1 Error reading or parsing the spec
2 Score below threshold (< 60 for scorecard, < 50% for sandbox)

What gets scored

Each spec is evaluated across 6 weighted dimensions:

Dimension Weight What it checks
Foundational Compliance 20% OpenAPI version, info object, paths defined, spec validity
Developer Experience 15% Operation IDs, summaries, request/response examples, parameter descriptions
AI-Readiness & Agent Experience 20% Meaningful descriptions, error responses (4xx/5xx), response schemas, schema property descriptions
Agent Usability 20% Semantic operation IDs, tag usage, parameter schemas, request body documentation
Security & Governance 15% Security schemes defined, operations secured, auth documentation
AI Discoverability 10% API-level description, tags defined, external docs

Grading

Score Grade
90–100 A
80–89 B
70–79 C
60–69 D
< 60 F

Example output

╭─ specscore · OpenAPI AI readiness ───────────────────────────╮
│  Task Manager API  v1.0.0                                      │
│  sample_spec.yaml                                              │
│                                                                │
│  Overall Score: 42.3/100  Grade: F                             │
╰────────────────────────────────────────────────────────────────╯

╭──────────────────────────────────┬───────────┬───────┬──────────────────────────┬────────╮
│ Dimension                        │     Score │ Grade │ Progress                 │ Issues │
├──────────────────────────────────┼───────────┼───────┼──────────────────────────┼────────┤
│ Foundational Compliance          │  75.0/100 │     C │ ███████████████░░░░░     │      1 │
│ Developer Experience             │  38.5/100 │     F │ ███████░░░░░░░░░░░░░░    │      3 │
│ AI-Readiness & Agent Experience  │  31.2/100 │     F │ ██████░░░░░░░░░░░░░░░    │      4 │
│ ...                              │       ... │   ... │ ...                      │    ... │
╰──────────────────────────────────┴───────────┴───────┴──────────────────────────┴────────╯

Issues & Recommendations

  AI-Readiness & Agent Experience
    x 4/6 operations lack descriptive intent (need >30 char description)
       paths.*.*.description
    ! 5/6 operations have no documented error responses (4xx/5xx) — agents cannot handle failure gracefully
       paths.*.*.responses

JSON output

Pass --json to get a machine-readable report:

specscore score openapi.yaml --json | jq '.overall_score'
{
  "api_name": "Task Manager API",
  "api_version": "1.0.0",
  "spec_path": "openapi.yaml",
  "overall_score": 42.3,
  "grade": "F",
  "dimensions": [
    {
      "name": "Foundational Compliance",
      "score": 75.0,
      "grade": "C",
      "issues": [...]
    }
  ]
}

Development

# Install with dev dependencies
uv pip install -e .

# Run directly without installing
python main.py score path/to/spec.yaml

License

The scorecard scoring model in this project (scorecard/ package) is derived from the Jentic API AI-Readiness Framework (JAIRF), which is licensed under the Apache License, Version 2.0. See the NOTICE file for the required upstream attribution.

Original contributions (sandbox, clitic, GitHub Pages site) are copyright (c) 2025-2026 SheepSeb. The project as a whole is distributed under the Apache License, Version 2.0 — see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

specscore-0.1.4.tar.gz (93.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

specscore-0.1.4-py3-none-any.whl (45.7 kB view details)

Uploaded Python 3

File details

Details for the file specscore-0.1.4.tar.gz.

File metadata

  • Download URL: specscore-0.1.4.tar.gz
  • Upload date:
  • Size: 93.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for specscore-0.1.4.tar.gz
Algorithm Hash digest
SHA256 1626709cabeca88bf4de9f1f44754055ead05cea80c9fbf56c273538b895220c
MD5 134c4170989e4052a4502c3492daa884
BLAKE2b-256 b9d1de6c639b304c5fb3e7b0afa8de61ea2dc11386647fb3e7254917c22dab82

See more details on using hashes here.

File details

Details for the file specscore-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: specscore-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 45.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for specscore-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 b94f5ccb04659972358c7f50f7109870112d40c9d7c4cd67913057c494571196
MD5 99fe351d991c4bc55a4aefd2fbf977e9
BLAKE2b-256 0ccbf278495d063e9ae877000ee8b9fffd5b9b9b14774c83faa4e340d122534f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page