Guards LLM agents from accidental large-context file reads, web fetches, and tool output.

Project description

GOT

GOT hero image

Guardians of the Token is a safety layer for agentic coding sessions. It helps Claude Code, Codex, and other LLM clients avoid accidental context explosions from huge files, oversized web fetches, and noisy tool output.

LLM agents are powerful because they can read, run, fetch, and reason. GOT keeps that power controlled.

Prompt guard demo

Why GOT

Agentic coding tools can burn through context in one bad move:

reading a giant log or transcript into chat
dumping command output that should have gone to a file
fetching a large page directly into model context
carrying stale session history until compaction degrades continuity

GOT watches the risky paths and turns them into a clear pause:

🛡️ Guardians of the Token blocked this command.
Target: /tmp/guardians_test_compact
Estimate: ~200,000 tokens (50% of the 400,000-token window)
Current context: ~0 tokens (0%)
Next options:
- Inspect the beginning
- Inspect the end
- Search for a term
- Summarize a bounded section
- Bypass once for the full file

The goal is simple: prevent accidental context damage while keeping the agent moving.

What It Guards

Client	Guarded paths
Claude Code	`Read`, `Bash`, `WebFetch`, oversized tool output, off-topic prompts in large sessions
Codex	risky `Bash` file dumps, URL fetches, oversized Bash output
MCP clients	guarded source indexing and bounded file tools through `guardians-mcp`
API workflows	experimental request-size proxy through `guardians-proxy`

Product Principles

Warn before context is wasted. Large reads and fetches are blocked before they enter the conversation.
Suggest intent, not shell incantations. GOT offers high-level next steps; the agent handles the mechanics.
Prefer bounded work. Inspect, search, summarize, and only bypass when the user explicitly wants the full payload.
Use one language across clients. Claude and Codex have different hook surfaces, but users see the same warning style.
Stay local-first. Hooks run locally and use lightweight token estimates.

Install

Install the PyPI package:

python3 -m pip install guardians-of-the-token

Then run the installer:

guardians-install

The package name is guardians-of-the-token. The Python import package is guardians_of_the_token.

guardians-install shows a terminal banner, detects supported local clients, and presents a checkbox selector for individual integrations:

Codex CLI hooks
Codex app MCP
Claude Code hooks
Claude Desktop MCP

Use Up/Down to move, Space to toggle, and Enter to install the selected integrations. Use guardians-install --yes to install into all detected integrations without prompting.

If your Python environment blocks global installs, use a virtual environment or pipx:

pipx install guardians-of-the-token
guardians-install

From a checkout during development:

python3 -m pip install -e .
guardians-install

Manual install commands:

Codex workspace hooks:

guardians-codex-install /path/to/workspace

Claude Code workspace hooks:

guardians-claude-install /path/to/workspace

Codex global hooks + MCP registration:

guardians-codex-install --global

Claude Code global hooks:

guardians-claude-install --global

Run the experimental MCP demo server:

guardians-mcp

Run the experimental HTTP proxy:

guardians-proxy

How It Works

GOT estimates token impact with a conservative heuristic:

stage 1: size + type hint
stage 2: small text sample near threshold (files only)

Before a risky operation runs, the relevant hook checks file size, URL HEAD metadata, or command shape. File preflight stays cheap by sampling only when an estimate lands near the warning threshold. After a tool runs, post-hooks suppress output above the soft cap before it enters model context.

When a request is too large, GOT blocks it and returns a shared warning template with:

the target
estimated token impact
current context estimate when available
compaction risk when relevant
high-level next options

Prompt Guard (Claude Code only)

Once a session crosses ~30% of the model's context window, GOT also checks each new prompt for topical drift. If the prompt looks unrelated to the ongoing conversation it blocks the submission before Claude processes it, saving you a round-trip's worth of input tokens.

You'll see a block message like:

🛡️ Guardians blocked this prompt before Claude processed it.

Reason: this looks unrelated to the current large Claude session and would
send a lot of unrelated context.
Similarity: 0.07 (block threshold 0.10)
Context: 168.9k / 200.0k tokens (84%)
Estimated cost if sent: $0.5068

To continue anyway, resend the same prompt prefixed with GOT_UNBLOCK.

To override a block, resend the prompt with the configured prefix:

GOT_UNBLOCK <your prompt>

A small ONNX embedding model (~22 MB, all-MiniLM-L6-v2) is downloaded automatically the first time you run guardians-install. You can also fetch it manually:

guardians-download-models

Configure the guard in ~/.guardians.json:

{
  "prompt_guard": {
    "enabled": true,
    "block_context_pct": 0.30,
    "very_low_similarity": 0.10,
    "unblock_prefix": "GOT_UNBLOCK"
  }
}

Set "enabled": false to disable it entirely. All decisions (allow and block) are logged to .got/events.jsonl with the gate that fired.

Experimental Surfaces

The production-quality integrations today are the Claude Code and Codex hooks.

guardians-mcp exposes lightweight preflight and bounded file-access tools for Claude Desktop Projects and other MCP clients. It cannot intercept native file uploads, but it can make projects guard-aware by checking file size before Claude reads a source and by giving Claude safe head, tail, search, and chunk tools.

Recommended Claude Project workflow:

Run guardians-project-init /path/to/project.
Review the GOT policy block appended to that project's CLAUDE.md.
Use got_file_size before analyzing a new or potentially large local file.
Use bounded tools instead of raw full-file ingestion when risk is warning or critical.

GOT MCP is intended as a preflight for unknown, new, or potentially large sources. If a source is classified as safe, Claude should proceed normally with native tools instead of routing ordinary small-file work through GOT.

MCP project state is stored under ./.got/ by default:

.got/
  GUARDIANS_PROJECT_POLICY.md
  index/

Set GUARDIANS_INBOX=/path/to/dir before starting guardians-mcp to use a different storage directory.

Key MCP tools:

got_project_init
got_project_policy
got_file_size
got_url_size
got_file_head
got_file_tail
got_file_search
got_file_chunk_summary

guardians-proxy is currently a minimal FastAPI proxy for Anthropic/OpenAI-style message payloads. It estimates request size before forwarding, but it is not yet a full production gateway.

Configuration

Optional user config lives at ~/.guardians.json:

{
  "warn_threshold_pct": 20,
  "default_input_price_per_million": 3.0,
  "max_output_tokens": 8000,
  "telemetry_enabled": false,
  "telemetry_host": "https://us.i.posthog.com",
  "telemetry_api_key": "phc_..."
}

Project config can live in .guardians.toml:

warn_threshold_pct = 10
max_output_tokens = 8000
default_input_price_per_million = 3.0
telemetry_enabled = false

whitelist = ["README.md", "docs/**"]
ignore = ["node_modules/**", ".git/**", "dist/**", "build/**"]

Whitelisted and ignored files are allowed through without blocking. Use this for known-safe paths that agents often inspect.

Bypass is single-use:

touch /tmp/guardians_bypass

Local Reports

GOT records blocked and suppressed operations locally under .got/events.jsonl. Telemetry is off by default. If you opt in, GOT sends one anonymous install event to a PostHog-compatible capture endpoint. It only sends anonymous installation ID, GOT version, Python version, and OS. GOT does not send IP as event data and disables PostHog GeoIP enrichment. It never sends paths, URLs, prompts, file contents, command text, tool usage, risk levels, actions, geolocation, or token counts. The installer shows telemetry as a default-on selectable option and writes your choice to ~/.guardians.json. You can override it with GUARDIANS_TELEMETRY=0 or GUARDIANS_TELEMETRY=1.

Print a local savings report:

guardians-report

Run the local dashboard:

guardians-dashboard

The dashboard binds to 127.0.0.1:8766 by default and shows estimated tokens and dollars saved, top risky files/URLs, and activity by client.

Test Fixtures

GOT includes deterministic test stubs for hook development:

Fixture	Meaning
`/tmp/guardians_test_small`	small file estimate
`/tmp/guardians_test_large`	large file estimate
`/tmp/guardians_test_compact`	compaction-scale file estimate
`https://guardians-test/large`	large URL estimate
`https://guardians-test/compact`	compaction-scale URL estimate

The install commands refresh the /tmp/guardians_test_* files with short, readable fixture text. The hooks still use deterministic fake sizes for these exact paths, so the files do not need to be physically large. To test blocking, use a full-read command such as:

cat /tmp/guardians_test_compact

For a more realistic URL test, run the local fixture server:

guardians-test-server

If the client cannot reach your host 127.0.0.1, bind all interfaces instead:

guardians-test-server --host 0.0.0.0

The server will print candidate LAN URLs you can use from a sandboxed app.

Then fetch one of these URLs from Claude Code or Codex:

http://127.0.0.1:8765/large
http://127.0.0.1:8765/compact

The server returns large Content-Length values to HEAD requests but only small readable bodies for GET, so hooks can test real pre-fetch size checks without downloading a large payload.

Project Layout

guardians_of_the_token/
  claude/      Claude Code hooks
  codex/       Codex hooks
  messages.py  shared warning templates
  guard.py     core token guard logic
  mcp_server.py
  proxy.py

Status

GOT is early, practical infrastructure for local LLM workflows. The current focus is preventing accidental context loss in Claude Code and Codex, then expanding toward richer byte-size tracking, model-aware thresholds, saved-output workflows, and session-health warnings.

Project details

Release history Release notifications | RSS feed

1.3.0

Jun 8, 2026

1.2.0

May 16, 2026

This version

1.1.1

May 16, 2026

1.0.1

May 9, 2026

1.0.0

May 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

guardians_of_the_token-1.1.1.tar.gz (62.9 kB view details)

Uploaded May 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

guardians_of_the_token-1.1.1-py3-none-any.whl (57.4 kB view details)

Uploaded May 16, 2026 Python 3

File details

Details for the file guardians_of_the_token-1.1.1.tar.gz.

File metadata

Download URL: guardians_of_the_token-1.1.1.tar.gz
Upload date: May 16, 2026
Size: 62.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for guardians_of_the_token-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`60659e6091377d49d5ca42c9184d8b6846a38c7c9615a86bc3775d8661830f9c`
MD5	`72bdbc46737d95ddc52dffd33a495aa3`
BLAKE2b-256	`b02639d5b59aaec53571e443a87144571569905a371c004848eab4dc64391d24`

See more details on using hashes here.

File details

Details for the file guardians_of_the_token-1.1.1-py3-none-any.whl.

File metadata

Download URL: guardians_of_the_token-1.1.1-py3-none-any.whl
Upload date: May 16, 2026
Size: 57.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for guardians_of_the_token-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9915b563a65ec960735143436d9fc5c979e35fb20609c2851d18883207fd8b8c`
MD5	`ffbc8d04d12ba9abbb80c4bbf153e1e2`
BLAKE2b-256	`8a5a03cc317de6a5039f68b800b5d71311797b79ebbd44e7f364b8881d67d9d7`

See more details on using hashes here.

guardians-of-the-token 1.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

GOT

Why GOT

What It Guards

Product Principles

Install

How It Works

Prompt Guard (Claude Code only)

Experimental Surfaces

Configuration

Local Reports

Test Fixtures

Project Layout

Status

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes