LLM proxy that analyses token usage in context windows

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

ContextSpy

A local proxy that intercepts traffic between coding agents (GitHub Copilot, Claude, opencode, OpenAI SDK scripts, etc.) and LLM APIs — either cloud provider APIs or local LLM servers — then analyses context composition and token usage.

Features

Two proxy modes: forward proxy (cloud APIs) and reverse proxy (local LLM servers)
Provider support: OpenAI, Anthropic (Claude), Ollama, llama.cpp, vLLM
Agent detection: Copilot, Claude Desktop, opencode, Cursor, and generic clients
Context analysis: breaks input tokens into 8 categories:
- System prompt, Tool definitions, Tool results, File contents, Conversation history, Current user message, Assistant prefill, Uncategorised
Token estimation via tiktoken (cl100k_base)
Live dashboard — real-time WebSocket updates, charts, session grouping
Session tracking — manually start/end named sessions to group requests
SQLite storage — all data stored locally in ~/.contextspy/

Mode 1: Cloud API Mode (forward proxy)

Use this mode when you want to intercept requests going to cloud LLM APIs such as OpenAI, Anthropic, GitHub Copilot, or Azure OpenAI.

ContextSpy acts as an HTTPS man-in-the-middle proxy. It terminates TLS, inspects the request, and re-encrypts it before forwarding to the provider. This requires installing a local CA certificate once so your OS and tools trust ContextSpy's dynamically-signed server certificates.

Prerequisites

Python 3.11+
uv (pip install uv) — or plain pip
Administrator / sudo access (for CA cert installation)
Node.js 18+ and npm — only needed if you want to modify the frontend

Install

From PyPI (recommended):

pip install contextspy
# or with uv:
uv tool install contextspy

From source:

git clone https://github.com/RimantasZ/contextspy.git
cd contextspy
uv venv
uv pip install -e .

Build the UI (optional — only needed if you change the frontend)

The built UI is bundled with the package. Only rebuild if you modify ui/src/:

cd ui
npm install
npm run build   # outputs to contextspy/_web/
cd ..

Step 1 — Start ContextSpy

# Windows (PowerShell)
uv run contextspy start

# macOS / Linux
uv run contextspy start

This starts:

mitmproxy HTTPS forward proxy on port 8888
FastAPI web dashboard on port 5173
Opens http://127.0.0.1:5173 in your browser automatically

Step 2 — Install the CA certificate

mitmproxy generates a local CA certificate on first run. You must install it into your OS trust store so HTTPS connections through the proxy are trusted.

Automatic install (run once):

uv run contextspy install-cert

Windows: runs certutil -addstore Root ~/.mitmproxy/mitmproxy-ca.pem (requires elevated prompt or UAC prompt)
macOS: runs security add-trusted-cert -d -r trustRoot -k /Library/Keychains/System.keychain ... (requires sudo)
Linux: copies cert to /usr/local/share/ca-certificates/ and runs update-ca-certificates

You can also install from the dashboard → Settings → CA Certificate tab.

One-time operation. The certificate persists in your trust store across restarts. You only need to re-run this if you delete ~/.mitmproxy/ or reinstall the OS.

Step 3 — Configure your agent to use the proxy

Choose the agent you want to intercept and follow the corresponding instructions. You can also run uv run contextspy setup-<agent> for a printed reminder.

GitHub Copilot (VS Code)

Option A — VS Code settings.json (Ctrl+Shift+P → "Open User Settings JSON"):

{
  "http.proxy": "http://127.0.0.1:8888",
  "http.proxyStrictSSL": false
}

Option B — environment variables (set before launching VS Code):

# PowerShell
$env:HTTPS_PROXY = "http://127.0.0.1:8888"
$env:NODE_EXTRA_CA_CERTS = "$env:USERPROFILE\.mitmproxy\mitmproxy-ca-cert.pem"

# Bash / zsh
export HTTPS_PROXY=http://127.0.0.1:8888
export NODE_EXTRA_CA_CERTS=~/.mitmproxy/mitmproxy-ca-cert.pem

Run for the exact snippet:

uv run contextspy setup-copilot

Claude CLI / Claude Code

# PowerShell
$env:HTTPS_PROXY = "http://127.0.0.1:8888"
$env:NODE_EXTRA_CA_CERTS = "$env:USERPROFILE\.mitmproxy\mitmproxy-ca-cert.pem"

# Bash / zsh
export HTTPS_PROXY=http://127.0.0.1:8888
export NODE_EXTRA_CA_CERTS=~/.mitmproxy/mitmproxy-ca-cert.pem

NODE_EXTRA_CA_CERTS is required because Claude CLI is an Electron/Node app and has its own bundled certificate store that ignores the OS trust store. Use mitmproxy-ca-cert.pem (cert-only), not mitmproxy-ca.pem (key+cert bundle).

Run for the exact snippet:

uv run contextspy setup-claude

opencode

# PowerShell
$env:HTTPS_PROXY = "http://127.0.0.1:8888"
$env:SSL_CERT_FILE = "$env:USERPROFILE\.mitmproxy\mitmproxy-ca-cert.pem"
$env:NODE_EXTRA_CA_CERTS = "$env:USERPROFILE\.mitmproxy\mitmproxy-ca-cert.pem"

# Bash / zsh
export HTTPS_PROXY=http://127.0.0.1:8888
export SSL_CERT_FILE=~/.mitmproxy/mitmproxy-ca-cert.pem
export NODE_EXTRA_CA_CERTS=~/.mitmproxy/mitmproxy-ca-cert.pem

opencode uses both the Go TLS stack (SSL_CERT_FILE) and Node.js components (NODE_EXTRA_CA_CERTS), so both variables are needed.

Run for the exact snippet:

uv run contextspy setup-opencode

Python / OpenAI SDK / httpx scripts

import os
os.environ["HTTPS_PROXY"] = "http://127.0.0.1:8888"

Or set env vars before running your script:

# PowerShell
$env:HTTPS_PROXY = "http://127.0.0.1:8888"
python your_script.py

# Bash
HTTPS_PROXY=http://127.0.0.1:8888 python your_script.py

Generic (curl, httpx CLI, etc.)

export HTTPS_PROXY=http://127.0.0.1:8888
export HTTP_PROXY=http://127.0.0.1:8888

Step 4 — Use the dashboard

Open http://127.0.0.1:5173. Requests appear in real-time as your agent makes LLM calls.

Dashboard — token usage totals, category breakdown chart, model distribution
Requests — table of all captured requests with token counts and category bars
Sessions — group requests by task; click Start Session and give it a name

Mode 2: Local LLM Mode (reverse proxy)

Use this mode when you want to intercept requests going to local LLM servers such as llama.cpp / llama-server, Ollama, or vLLM.

Why a different mode? When client and server are both on 127.0.0.1, operating systems route loopback traffic directly — they bypass HTTPS_PROXY entirely. A forward proxy cannot intercept this traffic. Instead, ContextSpy acts as a reverse proxy: the client connects to ContextSpy's listen port; ContextSpy forwards to the real server. No TLS, no certificate installation needed.

Client (opencode / script)
  base_url = http://127.0.0.1:8889/v1   ← ContextSpy listen port
      │
      ▼
ContextSpy reverse proxy (port 8889)
      │
      ▼
llama-server                            ← actual server
  127.0.0.1:8080

Prerequisites

Python 3.11+ and uv (same as cloud mode)
A running local LLM server (llama-server, Ollama, or vLLM) — see setup sections below
No CA certificate needed

Step 1 — Configure `~/.contextspy/config.toml`

Add one [[reverse_targets]] block per local server you want to intercept. The file is auto-created at ~/.contextspy/config.toml on first run (or after running any contextspy command).

# Example: intercept llama-server running on port 8080
[[reverse_targets]]
name        = "llama-server"
listen_port = 8889
target_url  = "http://127.0.0.1:8080"
provider    = "openai"

# Example: intercept Ollama /v1 endpoint on port 11434
[[reverse_targets]]
name        = "ollama"
listen_port = 8890
target_url  = "http://127.0.0.1:11434"
provider    = "openai"

# Example: intercept vLLM on port 8000
[[reverse_targets]]
name        = "vllm"
listen_port = 8891
target_url  = "http://127.0.0.1:8000"
provider    = "openai"

Fields:

Field	Required	Description
`name`	yes	Human-readable label shown in the CLI and logs
`listen_port`	yes	Port ContextSpy binds on `127.0.0.1`
`target_url`	yes	Full base URL of the local LLM server
`provider`	yes	Response parser: `"openai"` for all three servers above

All three servers implement the OpenAI-compatible /v1/chat/completions API, so provider = "openai" is correct in all cases.

You can run uv run contextspy setup-llamaserver (or -ollama, -vllm) for a ready-to-paste config snippet and client configuration instructions.

Step 2 — Start ContextSpy in local mode

uv run contextspy start-local

This starts:

One mitmproxy reverse-proxy listener per [[reverse_targets]] entry
FastAPI web dashboard on port 5173
Opens http://127.0.0.1:5173 in your browser automatically

If [[reverse_targets]] is empty or missing, the command prints a config example and exits without starting anything.

Step 3 — Point your client at ContextSpy

Instead of connecting to the LLM server directly, configure your client to use the ContextSpy listen port.

llama-server (llama.cpp)

Default setup: llama-server on port 8080, ContextSpy on port 8889.

# Print the full setup reminder
uv run contextspy setup-llamaserver

# In your Python / openai SDK client:
from openai import OpenAI
client = OpenAI(
    base_url="http://127.0.0.1:8889/v1",   # ContextSpy port
    api_key="not-needed",
)

# For opencode: set the model's base URL to ContextSpy
# In your opencode config (providers section):
# base_url: http://127.0.0.1:8889/v1

Ollama

Default setup: Ollama on port 11434, ContextSpy on port 8890.

Ollama ≥ 0.1.24 is required for the /v1/chat/completions OpenAI-compatible endpoint. Older versions only have /api/chat.

# Print the full setup reminder
uv run contextspy setup-ollama

# In your Python / openai SDK client:
from openai import OpenAI
client = OpenAI(
    base_url="http://127.0.0.1:8890/v1",   # ContextSpy port
    api_key="ollama",                        # Ollama ignores the key
)

Alternative for cloud mode: If your Ollama client respects HTTPS_PROXY, you can use contextspy start (forward proxy mode) instead — Ollama is in the built-in hostname filter for localhost:11434. Run uv run contextspy setup-ollama for both options.

vLLM

Default setup: vLLM on port 8000, ContextSpy on port 8891.

# Print the full setup reminder
uv run contextspy setup-vllm

# In your Python / openai SDK client:
from openai import OpenAI
client = OpenAI(
    base_url="http://127.0.0.1:8891/v1",   # ContextSpy port
    api_key="not-needed",
)

Step 4 — Use the dashboard

Same as cloud mode — open http://127.0.0.1:5173. All captured requests appear in real-time regardless of which local server they came from.

CLI reference

contextspy start [--proxy-port 8888] [--web-port 5173] [--no-browser]
    Start in cloud/forward-proxy mode. Opens browser on startup.

contextspy start-local [--web-port 5173] [--no-browser]
    Start in local/reverse-proxy mode. Reads [[reverse_targets]] from config.toml.

contextspy install-cert
    Install mitmproxy CA certificate into OS trust store (cloud mode only).

contextspy status
    Show proxy running state, active session, DB path.

contextspy reset-db [--yes]
    Delete ALL requests and sessions (prompts for confirmation).

contextspy db-stats
    Print database row counts (works offline).

contextspy report
    Print aggregate token stats and category breakdown table (works offline).

contextspy setup-claude        Print proxy env-var commands for Claude CLI
contextspy setup-copilot       Print proxy settings for VS Code / GitHub Copilot
contextspy setup-opencode      Print proxy env-var commands for opencode
contextspy setup-llamaserver   Print config.toml snippet and client URL for llama-server
contextspy setup-ollama        Print config.toml snippet and client URL for Ollama
contextspy setup-vllm          Print config.toml snippet and client URL for vLLM

contextspy session start <name>   Start a named capture session
contextspy session end            End the active session
contextspy session list           List all sessions

Architecture

Cloud mode

coding agent → HTTPS_PROXY → mitmproxy (port 8888)
                                  │ TLS terminate + forward
                              cloud LLM API
                                  │
                            ContextSpyAddon
                              → parse, classify, count tokens
                              → write to SQLite
                              → broadcast WebSocket

Local mode

client (base_url=:8889) → mitmproxy reverse proxy (port 8889)
                                  │ plain HTTP forward
                            llama-server (port 8080)
                                  │
                            ContextSpyAddon (provider_override="openai")
                              → parse, classify, count tokens
                              → write to SQLite
                              → broadcast WebSocket

Both modes share the same FastAPI web server (port 5173), SQLite database, and dashboard.

Data storage

All data is stored in ~/.contextspy/:

Path	Description
`~/.contextspy/contextspy.db`	SQLite database
`~/.contextspy/config.toml`	Configuration (auto-created)

Raw request/response bodies are stored per-request and purged automatically 7 days after capture to save disk space (on next server startup).

Token estimation accuracy

Token counts are estimates using tiktoken cl100k_base encoding. Accuracy varies by provider:

Provider	Expected error
OpenAI (GPT-4, GPT-4o)	~2–5%
Anthropic (Claude)	~5–15%
Ollama / llama.cpp / vLLM	~10–20%

When the provider reports exact token counts in the API response, those are stored alongside the estimate for comparison on the request detail page.

Development

Backend

uv venv
uv pip install -e ".[dev]"
uvicorn contextspy.api.main:create_app --factory --reload --port 5173

Frontend

cd ui
npm install
npm run dev   # Vite on :5174, proxies /api to :5173

License

Apache 2.0 — see LICENSE and NOTICE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

RimantasZ

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.3

May 20, 2026

This version

0.1.1

May 19, 2026

0.1.0

May 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contextspy-0.1.1.tar.gz (502.4 kB view details)

Uploaded May 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

contextspy-0.1.1-py3-none-any.whl (510.1 kB view details)

Uploaded May 19, 2026 Python 3

File details

Details for the file contextspy-0.1.1.tar.gz.

File metadata

Download URL: contextspy-0.1.1.tar.gz
Upload date: May 19, 2026
Size: 502.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for contextspy-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`db0fd391a0ed210018d28cc4791598e93f714eae377b6490f72761a91c8e27f6`
MD5	`a4e4bd8f77f43988c801441245e0d2ae`
BLAKE2b-256	`a4a995bd01bf07160fba901a1683dcefb8b212c373dcc2328a506760068d64df`

See more details on using hashes here.

Provenance

The following attestation bundles were made for contextspy-0.1.1.tar.gz:

Publisher: publish.yml on RimantasZ/contextspy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: contextspy-0.1.1.tar.gz
- Subject digest: db0fd391a0ed210018d28cc4791598e93f714eae377b6490f72761a91c8e27f6
- Sigstore transparency entry: 1575589002
- Sigstore integration time: May 19, 2026
Source repository:
- Permalink: RimantasZ/contextspy@b2c80528df35741c2644dfc4934d3bbf62901820
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/RimantasZ
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b2c80528df35741c2644dfc4934d3bbf62901820
- Trigger Event: push

File details

Details for the file contextspy-0.1.1-py3-none-any.whl.

File metadata

Download URL: contextspy-0.1.1-py3-none-any.whl
Upload date: May 19, 2026
Size: 510.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for contextspy-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0009684d1a1c57654a6620018bb1eba51b43528381f87471a0316a332f2059f5`
MD5	`14bcf0e04f449db4a4d544b29e28ed85`
BLAKE2b-256	`81fe95b0eef9bf7cf2230d41c644927f44a43f639ccff8d2570eb696965d67d3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for contextspy-0.1.1-py3-none-any.whl:

Publisher: publish.yml on RimantasZ/contextspy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: contextspy-0.1.1-py3-none-any.whl
- Subject digest: 0009684d1a1c57654a6620018bb1eba51b43528381f87471a0316a332f2059f5
- Sigstore transparency entry: 1575589014
- Sigstore integration time: May 19, 2026
Source repository:
- Permalink: RimantasZ/contextspy@b2c80528df35741c2644dfc4934d3bbf62901820
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/RimantasZ
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b2c80528df35741c2644dfc4934d3bbf62901820
- Trigger Event: push

contextspy 0.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ContextSpy

Features

Mode 1: Cloud API Mode (forward proxy)

Prerequisites

Install

Build the UI (optional — only needed if you change the frontend)

Step 1 — Start ContextSpy

Step 2 — Install the CA certificate

Step 3 — Configure your agent to use the proxy

GitHub Copilot (VS Code)

Claude CLI / Claude Code

opencode

Python / OpenAI SDK / httpx scripts

Generic (curl, httpx CLI, etc.)

Step 4 — Use the dashboard

Mode 2: Local LLM Mode (reverse proxy)

Prerequisites

Step 1 — Configure ~/.contextspy/config.toml

Step 2 — Start ContextSpy in local mode

Step 3 — Point your client at ContextSpy

llama-server (llama.cpp)

Ollama

vLLM

Step 4 — Use the dashboard

CLI reference

Architecture

Cloud mode

Local mode

Data storage

Token estimation accuracy

Development

Backend

Frontend

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Step 1 — Configure `~/.contextspy/config.toml`