Linter for AI agent tool descriptions, system prompts, and configs. Catches vague instructions, missing constraints, and schema mismatches. By Hermes Labs.

These details have not been verified by PyPI

Project links

Project description

lintlang

Static linter for AI agent tool descriptions, system prompts, and configs.

Most AI agent bugs aren't code bugs — they're language bugs. Vague tool descriptions make agents pick the wrong tool. Missing constraints cause infinite loops. Schema mismatches break structured output. lintlang catches these at authoring time, in CI, with zero LLM calls.

Install

pip install lintlang

Requires Python 3.10+. One dependency (pyyaml). No API keys, no network access, no LLM calls.

Quick Start

# Scan a single file
lintlang scan agent_config.yaml

# Scan a directory (finds .yaml, .json, .txt, .md, .prompt)
lintlang scan configs/

# JSON output for CI
lintlang scan config.yaml --format json

# Fail CI on CRITICAL/HIGH findings
lintlang scan config.yaml --fail-on fail

# Fail CI on any MEDIUM+ findings
lintlang scan config.yaml --fail-on review

Example Output

  LINTLANG v0.2.0
  bad_tool_descriptions.yaml
  ──────────────────────────────────────────────────

  ❌ FAIL — 1 CRITICAL, 2 HIGH, 6 MEDIUM, 3 LOW

  H1: Tool Description Ambiguity

    !! [CRITICAL] tool:process_ticket
      Tool 'process_ticket' has no description.
      → Add a specific description explaining WHEN to use this tool.

    ! [HIGH] tool:get_user_info
      Tool 'get_user_info' has a very short description (13 chars)
      → Expand to include purpose, when to use, expected input/output.

    ~ [MEDIUM] tool:handle_request
      Tool 'handle_request' starts with vague verb 'handle'.
      → Replace with a specific action verb.

  H2: Missing Constraint Scaffolding

    ! [HIGH] system_prompt
      System prompt defines tools but has no termination conditions.
      → Add: 'Maximum 5 tool calls per task. Stop and report after 2 failures.'

  ──────────────────────────────────────────────────
  lintlang v0.2.0 | H1-H7 structural analysis | Zero LLM calls

How It Works

lintlang gives you a verdict, not a score:

Verdict	Meaning	When
✅ PASS	Ship it	Only LOW/INFO findings or none
⚠️ REVIEW	Has blind spots	MEDIUM findings present
❌ FAIL	Will break in production	CRITICAL or HIGH findings

Each finding includes the pattern (H1-H7), severity, location, and a concrete fix suggestion. No vague "improve your prompt" — specific rewrites you can apply immediately.

Why These 7 Detectors?

These aren't arbitrary rules — they're the 7 structural failure modes that cause real agent breakdowns in production. We identified them across audits of 8 major AI frameworks (LangChain, Semantic Kernel, AutoGen, smolagents, LiteLLM, Anthropic SDK, OpenAI SDK, Agno) and 12 filed PRs. Each detector maps to a specific class of bug that no other linter catches because they're language problems, not code problems.

No existing tool covers this: yamllint checks syntax, semgrep checks code patterns, ruff checks Python style. None of them can tell you that your tool description is ambiguous enough to cause wrong-tool selection, or that your system prompt lacks termination conditions and will loop forever.

Structural Detectors (H1-H7)

Pattern	Name	What Users Report	Severity
H1	Tool Description Ambiguity	"Agent picks wrong tool"	CRITICAL-MEDIUM
H2	Missing Constraint Scaffolding	"Agent loops infinitely"	CRITICAL-HIGH
H3	Schema-Intent Mismatch	"Structured output broken"	CRITICAL-LOW
H4	Context Boundary Erosion	"Agent leaks state across tasks"	HIGH-MEDIUM
H5	Implicit Instruction Failure	"Model doesn't follow instructions"	MEDIUM-LOW
H6	Template Format Contract Violation	"Agent broke after prompt change"	MEDIUM-INFO
H7	Role Confusion	"Chat history is messed up"	CRITICAL-MEDIUM

H5: Context-Aware Negatives

H5 distinguishes between safety constraints and style negatives. Security rules like "Never expose API keys" are correctly exempted. Style issues like "Don't be verbose" are flagged with positive rewrites.

Validated on 26 real-world configs (OpenHands, RAG agents, HIPAA compliance, financial advisors, content moderation, DevOps safety) — see samples/ for examples.

Why not just use GPT-4?

Zero cost, zero latency, zero data exposure. Runs in CI where LLM calls can't. Catches structural patterns (missing termination, schema mismatches, role ordering) that LLMs are blind to because they process content, not structure.

CI Integration

GitHub Actions

- name: Lint agent configs
  run: |
    pip install lintlang
    lintlang scan configs/ --fail-on fail

Verdict-Based Gating

Flag	Exits 1 when	Use case
`--fail-on fail`	Any CRITICAL/HIGH finding	Blocking deploy gate
`--fail-on review`	Any MEDIUM+ finding	Strict quality gate
`--fail-under 80`	Quality score < threshold	Legacy score-based gate

Filter by Severity

# Only show CRITICAL and HIGH
lintlang scan config.yaml --min-severity high

# Only check specific patterns
lintlang scan config.yaml --patterns H1 H3

Programmatic API

from lintlang import scan_file, compute_verdict

result = scan_file("config.yaml")
verdict = compute_verdict(result.structural_findings)
print(f"Verdict: {verdict}")  # PASS, REVIEW, or FAIL

for finding in result.structural_findings:
    print(f"  [{finding.severity.value}] {finding.description}")
    print(f"  → {finding.suggestion}")

# Scan a directory
from lintlang import scan_directory, compute_verdict

results = scan_directory("configs/")
for path, result in results.items():
    verdict = compute_verdict(result.structural_findings)
    print(f"{path}: {verdict}")

Supported Formats

lintlang auto-detects file format:

YAML (.yaml, .yml) — OpenAI function-calling format, tool definitions
JSON (.json) — OpenAI and Anthropic tool schemas, message arrays
Plain text (.txt, .md, .prompt) — System prompts, instruction docs

Unknown extensions are tried as JSON → YAML → plain text.

How Is lintlang Different?

Tool	What It Does	How lintlang Differs
promptfoo	Tests prompts via eval suites at runtime	lintlang is static — no LLM calls, catches issues at authoring time
guardrails-ai	Validates LLM outputs at runtime	lintlang catches root causes (bad instructions), not symptoms
NeMo Guardrails	Runtime dialogue rails	lintlang operates on config files, not live conversations
eslint / ruff	Lints source code	lintlang lints natural language in agent configs

lintlang treats tool descriptions, system prompts, and agent configs as lintable artifacts — static analysis for prose, like eslint for JavaScript.

Development

git clone https://github.com/roli-lpci/lintlang.git
cd lintlang
pip install -e ".[dev]"
pytest

Hermes Labs Ecosystem

lintlang is part of the Hermes Labs open-source suite:

little-canary — Prompt injection detection
zer0dex — Dual-layer memory for AI agents
forgetted — Selective memory governance
zer0lint — mem0 extraction diagnostics
suy-sideguy — Autonomous agent watchdog
quickthink — Planning scaffolding for local LLMs

If lintlang saves you time, please star the repo — it helps others find it.

License

Apache 2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.1

Apr 15, 2026

0.2.0

Mar 25, 2026

0.1.2

Mar 2, 2026

0.1.1

Mar 2, 2026

0.1.0

Mar 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lintlang-0.2.1.tar.gz (47.4 kB view details)

Uploaded Apr 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lintlang-0.2.1-py3-none-any.whl (32.7 kB view details)

Uploaded Apr 15, 2026 Python 3

File details

Details for the file lintlang-0.2.1.tar.gz.

File metadata

Download URL: lintlang-0.2.1.tar.gz
Upload date: Apr 15, 2026
Size: 47.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lintlang-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`5a0cec81b34b2f5be181e7b65294a8e6938f9423e919dcbcca3d84092647b43f`
MD5	`f3b09bdcedca7f1ea163d5698a7f55f3`
BLAKE2b-256	`62cfefad673e63c446b9a3f1cb92775e0fbfe13aa62e89a1b52c672b932e29d2`

See more details on using hashes here.

File details

Details for the file lintlang-0.2.1-py3-none-any.whl.

File metadata

Download URL: lintlang-0.2.1-py3-none-any.whl
Upload date: Apr 15, 2026
Size: 32.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lintlang-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`42f4bf24e044f483d468808b1718f50cf6ebad02b399b8fb6cdecb5a7ac15f39`
MD5	`d4d8c9d4a18be7220a2a0a6abec26d37`
BLAKE2b-256	`8591652a3aec58801671cbb90139b71f5a85ccce06a2ac7877c344cb5b44ee65`

See more details on using hashes here.

lintlang 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

lintlang

Install

Quick Start

Example Output

How It Works

Why These 7 Detectors?

Structural Detectors (H1-H7)

H5: Context-Aware Negatives

Why not just use GPT-4?

CI Integration

GitHub Actions

Verdict-Based Gating

Filter by Severity

Programmatic API

Supported Formats

How Is lintlang Different?

Development

Hermes Labs Ecosystem

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes