Skip to main content

CLI for measuring verbosity and erosion metrics in Python codebases

Project description

scb-check

Python CLI that reports SCBench verbosity and erosion composites for a Python codebase.

  • Paper

  • Source

  • SlopCodeBench (main repo)

  • Verbosity: fraction of SLOC flagged by clone detection or ast-grep slop rules.

  • Erosion: share of function "mass" (complexity * sqrt(sloc)) concentrated in high-complexity functions (cyclomatic complexity > 10).

  • Cognitive erosion: same mass-share calculation using cognitive complexity > 10.

Install

Requires Python 3.12+.

Run without installing (recommended):

uvx scb-check check PATH
uvx --from git+https://github.com/gabeorlanski/scb-check scb-check check PATH

Or install into the current project:

uv sync              # for development in this repo
uv add scb-check     # as a dependency elsewhere

For hash-checked dependency installs from this repository, use the exported lock files:

python -m pip install --require-hashes -r requirements.lock
python -m pip install --require-hashes -r requirements-dev.lock

Regenerate them after dependency changes with:

uv export --format requirements.txt --no-dev --no-emit-project --frozen --output-file requirements.lock
uv export --format requirements.txt --all-groups --no-emit-project --frozen --output-file requirements-dev.lock

Usage

scb-check check PATH                    # human-readable flags
scb-check check PATH --report           # JSON report with verbosity/erosion scores
scb-check check PATH -v / --verbosity   # add info logging
scb-check check PATH -vv                # add debug logging
scb-check check PATH --config FILE      # explicit config path
scb-check check PATH --include-all      # include ignored and boundary-suppressed ast-grep findings
scb-check rule RULE_ID                  # print YAML for a specific ast-grep rule

PATH may be a file or directory. Directories are walked for *.py files.

JSON report fields

verbosity, erosion, cog_erosion, files_scanned, total_loc, verbosity_flagged_loc, clone_loc, ast_grep_flagged_loc, total_functions, high_cc_functions, high_cog_functions, total_mass, high_cc_mass, total_cog_mass, high_cog_mass.

Configuration

scb-check looks for scb-check.toml or a pyproject.toml containing [tool.scb-check], [tool.ruff], or [tool.ty.src], walking upward from the current directory until it hits a .git root.

# scb-check.toml
exclude = ["tests/fixtures/*", "vendor/**"]
context = 1
# pyproject.toml
[tool.scb-check]
exclude = ["tests/fixtures/*"]
context = 2
  • exclude: list of glob patterns to skip while discovering Python files.
  • context: number of surrounding source lines to show around human-readable ast-grep and erosion findings.

When using pyproject.toml, scb-check also includes excludes from:

  • [tool.ruff].exclude
  • [tool.ruff].extend-exclude
  • [tool.ty.src].exclude

Source directives

You can suppress specific ast-grep findings at the source line level with:

# scbc ignore[rule-id]

Same-line form:

value = cfg.get("a", {}).get("b", {})  # scbc ignore[chained-dict-get] Boundary normalization for legacy webhook payloads.

Standalone block form:

# scbc ignore[chained-dict-get]
# Boundary normalization for legacy webhook payloads.
value = cfg.get("a", {}).get("b", {})

Multiple rule IDs:

# scbc ignore[chained-dict-get,dict-get-empty-dict-default]
# Legacy webhook payloads are partially populated and normalized downstream.
value = cfg.get("a", {}).get("b", {})

Function-level boundary suppression is available for code that intentionally validates or normalizes external input:

def _load_toml(path: Path) -> dict[str, Any]:
    # scbc boundary: reads and validates user config
    ...

Boundary directives must be inside the function body, after the def line. By default, ast-grep findings inside that function are hidden. Use --include-all to show ignored and boundary-suppressed ast-grep findings.

Rules:

  • Rule IDs inside ignore[...] are required.
  • Reason text is optional.
  • Same-line ignore directives apply to that same physical line.
  • Standalone ignore directives apply to the next non-blank, non-comment code line.
  • Boundary directives apply to the containing function body.
  • Only ast-grep findings are suppressible; clone and erosion findings are not.
  • Invalid directives fail the run with exit code 2 unless --include-all is used.

How it works

  • Parsing: tree-sitter-python.
  • Clone detection: hashed AST blocks across the scanned set; two or more matching instances become a CloneBlock.
  • Slop patterns: ast-grep rules in src/scb_check/resources/slop_rules/ split by category (e.g. range(len(x)), dict.get(k, None), isinstance ladders, manual min/max, defensive guards).
  • Extra local slop patterns: set SCB_CHECK_EXTRA_SLOP_RULES to a :-separated list of YAML paths to layer additional rules on top of the bundled set.
  • Complexity: per-function cyclomatic and cognitive complexity plus SLOC, combined into mass scores for erosion metrics.

Development

uv run pytest
uv run ruff check
uv run ty check src/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scb_check-0.1.0.tar.gz (107.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scb_check-0.1.0-py3-none-any.whl (59.0 kB view details)

Uploaded Python 3

File details

Details for the file scb_check-0.1.0.tar.gz.

File metadata

  • Download URL: scb_check-0.1.0.tar.gz
  • Upload date:
  • Size: 107.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.8.4

File hashes

Hashes for scb_check-0.1.0.tar.gz
Algorithm Hash digest
SHA256 ae79a1388a0857ecc5c0249016aa635f6b602f15f3fb5e543d65318d8e49823a
MD5 d6551fac9d55e3d99fde612d0084a6c4
BLAKE2b-256 d06a248cc399e3602cbf8085ee4aa7c15e5da81eadfae1a8fbd268c2ac63914a

See more details on using hashes here.

File details

Details for the file scb_check-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: scb_check-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 59.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.8.4

File hashes

Hashes for scb_check-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0f442c87fe1cc4c90787e554c3a98acd7b7ffd09f0410fc3460cac4e414b4200
MD5 75aad7b6a0eb63da2a224d1c067042ea
BLAKE2b-256 e529a06ad02ce1a2c0d76091051d393b032f3e8897ab9813ec56174c079cd18a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page