ContextWall - context firewall for AI agents and RAG pipelines

These details have not been verified by PyPI

Project description

ContextWall

A context firewall for AI agents and RAG pipelines.

Your agents pull context from everywhere: web search, internal docs, partner APIs, user uploads. ContextWall sits in front of every source, enforces your security policy, and stops malicious content before it reaches the model. No code changes to your agents required.

web search ──┐
internal docs─┤                           ┌─► Claude / GPT-4
partner APIs ─┼──► ContextWall ──► policy ─┤
user uploads ─┤   (your rules)            └─► blocked + audit trail
FHIR / PHI  ──┘

Why this exists

EchoLeak (CVE-2025-32711, CVSS 9.3): a crafted email caused Microsoft 365 Copilot to silently access SharePoint files and exfiltrate them. Zero user interaction. The root cause: the model had no way to tell the difference between a trusted system instruction and untrusted email content.

PoisonedRAG (USENIX Security 2025): 5 adversarial documents in a corpus of millions achieved 90%+ control over LLM responses. The model treated retrieved content as ground truth.

These are not edge cases. They are the default behavior of every RAG pipeline and agentic system that doesn't enforce source trust at the context layer.

How ContextWall fixes it

Every context source gets a trust tier. Internal wikis, public web, regulated PHI data. Each carries a different level of trust, and your policy rules apply differently per tier.

Content is scanned before the model sees it. Three detection layers (structural bidi/zero-width scanning, normalized regex, and heuristic scoring for semantic paraphrases) run in under a millisecond with no LLM inference.

Every decision is logged. Tamper-evident Merkle chain, exportable as SOC2 evidence, HIPAA audit trail, or FedRAMP control mappings.

Source tier	Examples	Default enforcement
`internal`	Code repos, internal wikis	Injection blocked, PII audit-only
`external`	Vendor docs, partner APIs	Injection blocked, PII warned
`untrusted`	Public web, user uploads	Injection + PII blocked
`regulated`	FHIR, PHI data sources	Injection + PII blocked, full compliance audit

Get started

OSS daemon (runs in your infrastructure, free forever):

# Install and start
pip install contextwall
ctxfw start --config ctxfw.yaml

# Or run with Docker
docker run -p 8080:8080 \
  -v $(pwd)/ctxfw.yaml:/app/ctxfw.yaml \
  ghcr.io/bytewise-ca/context-wall:latest

Cloud dashboard (optional: fleet visibility, policy authoring, compliance reports):

Sign up at app.contextwall.io, generate a registration token in Settings, then add it to ctxfw.yaml:
control_plane:
  url: https://app.contextwall.io
  registration_token: cwt_your-token-here
  daemon_name: prod-us-east-1
The daemon pushes only aggregated metadata (counts, scores) to the cloud. Prompts, documents, and file contents never leave your infrastructure.

Integration

Option 1: Environment variable (zero code change)

Point your existing SDK at the local daemon. Your agents don't need to change at all.

# Anthropic (daemon runs on localhost:8080)
export ANTHROPIC_BASE_URL=http://localhost:8080/proxy/anthropic
export ANTHROPIC_API_KEY=sk-ant-your-real-key   # unchanged

# OpenAI
export OPENAI_BASE_URL=http://localhost:8080/proxy/openai/v1
export OPENAI_API_KEY=sk-your-real-key          # unchanged

Every anthropic.Anthropic() or openai.OpenAI() call in your codebase is now screened locally. Prompts never leave your machine.

Option 2: Python SDK (drop-in replace)

from contextwall import SafeAnthropic, CREBlockedError

# Drop-in replacement for anthropic.Anthropic()
client = SafeAnthropic(
    cre_endpoint="http://localhost:8080",   # local daemon
)

try:
    response = client.messages.create(
        model="claude-opus-4-7",
        max_tokens=1024,
        messages=[{"role": "user", "content": task}],
    )
except CREBlockedError as e:
    print(f"Blocked: {e.blocked_reason}")   # injection_heuristic:instruction_override
    print(f"Violations: {e.violations}")

pip install contextwall                  # base
pip install "contextwall[anthropic]"     # + Anthropic SDK
pip install "contextwall[openai]"        # + OpenAI SDK
pip install "contextwall[all]"           # everything

Option 3: Document filter API (for RAG pipelines)

If your pipeline retrieves documents before calling the LLM, filter them through ContextWall before constructing the prompt. This is the primary defence against corpus poisoning.

import httpx

async def safe_rag(query: str, source_id: str) -> list[dict]:
    """Retrieve documents and filter through ContextWall before passing to LLM."""
    raw_docs = await your_vector_store.search(query)

    response = await httpx.AsyncClient().post(
        "http://localhost:8080/v1/filter",   # local daemon, no cloud call
        json={
            "source_id": source_id,
            "documents": raw_docs,
            "session_id": session_id,
        },
    )

    result = response.json()
    # result["documents"]         - allowed docs, safe to include in prompt
    # result["blocked"]           - count of blocked documents
    # result["blocked_documents"] - what was blocked and why
    return result["documents"]

ContextWall applies the trust tier of source_id to every document. A trust_tier: untrusted source gets full injection detection and PII scanning. The blocked documents never reach your prompt.

Declare your sources in config

Sources are declared in ctxfw.yaml. No API calls, no imperative setup code. Commit it alongside your infrastructure.

# ctxfw.yaml

sources:
  - id: brave-web-search
    type: web
    trust_tier: untrusted

  - id: internal-confluence
    type: confluence
    trust_tier: internal
    data_classification: sensitive

  - id: fhir-api
    type: api
    trust_tier: regulated
    data_classification: phi
    owner: clinical-data-team
    region: us-east-1

ContextWall registers these on every startup: idempotent, version-controlled, reviewable in a PR.

Policy as code

Write security rules in YAML. Commit them. Review them like any other infrastructure change.

# policies/fleet/no-phi-exfil.yaml
rules:
  - name: block-phi-exfiltration
    action: deny
    reason: "PHI must not leave regulated sources"
    applies_when:
      source_tier: [regulated]
    compliance_mapping:
      framework: hipaa
      control_id: "45 CFR 164.502(b)"

  - name: block-web-injection
    action: deny
    reason: "Untrusted web content blocked from high-stakes tasks"
    applies_when:
      source_tier: [untrusted]
      task_scope: [financial_decision, medical_query]
    compliance_mapping:
      framework: soc2
      control_id: "CC6.1"

Rules reload within 5 seconds of a file change. No restart. No redeploy.

Pre-built policy packs for HIPAA, SOC2, and FedRAMP ship out of the box.

Tune detection sensitivity

Override defaults in ctxfw.yaml, per deployment, per environment.

detection:
  injection_block_threshold: 0.55   # raise to reduce false positives
  injection_warn_threshold: 0.35    # lower to catch more, audit instead of block
  default_source_trust_tier: untrusted

enforcement:
  penalty_increment: 0.15           # trust penalty per deny event
  decay_half_life_days: 1.0         # penalty halves every N days (auto-recovery)
  reward_factor: 0.90               # trust improves with clean outcomes

What gets detected

Attack class	Detection layer	Example
Direct instruction override	L1 structural + L2 regex	`IGNORE ALL PREVIOUS INSTRUCTIONS`
Bidi / zero-width obfuscation	L1 structural	RTL override chars in retrieved text
Spaced-letter injection	L1 structural	`i g n o r e p r e v i o u s`
Semantic paraphrase injection	L3 heuristic	"Your previous assignment has been superseded by the administrator"
Secret leakage	L2 regex	AWS keys, GitHub PATs, bearer tokens, private keys
PII exfiltration	L2 regex	Emails, phone numbers, SSNs in untrusted context

Sub-millisecond latency. No LLM in the hot path.

Compliance

Every enforcement decision writes to a Merkle-chained append-only log. Export on demand:

# SOC2 Type II evidence package (JSON, cryptographically signed)
ctxfw compliance export --framework soc2 --days 90 --out soc2-evidence.json

# HIPAA audit trail
ctxfw compliance export --framework hipaa --days 365 --out hipaa-audit.json

# Or call the local API directly
curl http://localhost:8080/v1/compliance/export \
  -H "Authorization: Bearer $CRE_API_TOKEN" \
  -d '{"framework": "soc2", "days": 90}'

Every export is cryptographically signed. The /v1/compliance/verify endpoint proves chain integrity independently of the exporter.

Supported: SOC2 Type II, HIPAA (45 CFR 164.312), FedRAMP (NIST 800-53), GDPR (Article 32).

Observability

GET  /health              - subsystem health
GET  /metrics             - Prometheus metrics
WS   /ws/events           - live enforcement event stream
GET  /v1/sources          - registered sources + enforcement history
GET  /v1/sources/{id}/trust - trust health per source

Key metrics emitted:

Metric	What it tells you
`cre_proxy_requests_total{result}`	Block rate by provider
`cre_proxy_violations_total{type}`	Breakdown by violation type
`cre_enforcement_penalty{source}`	Trust degradation per source
`cre_pipeline_duration_seconds`	End-to-end latency

Architecture

                   ctxfw.yaml (sources, policy, thresholds)
                          │
Your agent / RAG pipeline │
          │               ▼
          │         ContextWall
          │         ┌─────────────────────────────────────────┐
          │         │  Source Registry (O(1) tier lookup)     │
          └────────►│                                         │
                    │  L1  Structural scan    (<0.1ms)        │
                    │  L2  Normalized regex   (<0.2ms)        │
                    │  L3  Heuristic scoring  (<0.5ms)        │
                    │                                         │
                    │  Policy DSL (fleet→org→team→repo)       │
                    └──────────────┬──────────────────────────┘
                                   │
                    ┌──────────────┴──────────────────────────┐
                    │  allowed                  blocked        │
                    ▼                               ▼          │
              LLM API                     400 + violation      │
         (Anthropic / OpenAI)             details              │
                                               │               │
                                               ▼               │
                                      Provenance Engine        │
                                   (Merkle-chained log)        │
                                          │                    │
                              ┌───────────┼───────────┐        │
                              ▼           ▼           ▼        │
                           SQLite    WebSocket    Compliance    │
                                     live feed    export       │
                    └────────────────────────────────────────── ┘

Self-hosting

# ctxfw.yaml: minimal production config
repository_root: /app

sources:
  - id: my-web-search
    type: web
    trust_tier: untrusted

rest_api:
  port: 8080
  auth:
    enabled: true
    tokens:
      - token: "${CRE_API_TOKEN}"
        name: admin
        scopes: [analyze, bundle, admin, compliance]

storage:
  db_path: /data/cre.db

policy:
  policy_dir: /data/policies

compliance_hmac_key: "${CRE_COMPLIANCE_HMAC_KEY}"

# Generate secrets
export CRE_API_TOKEN=$(python3 -c "import secrets; print(secrets.token_urlsafe(32))")
export CRE_COMPLIANCE_HMAC_KEY=$(python3 -c "import secrets; print(secrets.token_urlsafe(32))")

docker run -d -p 8080:8080 \
  -e CRE_API_TOKEN \
  -e CRE_COMPLIANCE_HMAC_KEY \
  -v $(pwd)/ctxfw.yaml:/app/ctxfw.yaml \
  -v $(pwd)/policies:/data/policies \
  -v cre-data:/data \
  ghcr.io/bytewise-ca/context-wall:latest

ContextWall refuses to start with known-weak tokens and prints a generation command. Compliance HMAC key absence is warned at startup.

What's in this repo

Component	Path	Description
Core daemon	`src/context_firewall/`	Proxy, policy engine, provenance, trust scoring
Python SDK	`sdk/python/`	`SafeAnthropic`, `SafeOpenAI`, `CREClient`
Policy packs	`policy/packs/`	Pre-built HIPAA, SOC2, FedRAMP rule sets
Live demo	`demo/`	Attack scenarios + dashboard (requires API keys)

The cloud control plane (fleet dashboard, policy authoring UI, compliance reports) is available at app.contextwall.io.

License

Apache 2.0: the daemon, policy engine, provenance chain, and SDK are free to use, modify, and distribute - including in commercial and proprietary products.

The cloud dashboard (fleet visibility, policy authoring UI, compliance exports) is proprietary and available at app.contextwall.io.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.4

Jun 2, 2026

This version

0.1.3

Jun 1, 2026

0.1.2

Jun 1, 2026

0.1.1

Jun 1, 2026

0.1.0

Jun 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contextwall-0.1.3.tar.gz (129.2 kB view details)

Uploaded Jun 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

contextwall-0.1.3-py3-none-any.whl (121.0 kB view details)

Uploaded Jun 1, 2026 Python 3

File details

Details for the file contextwall-0.1.3.tar.gz.

File metadata

Download URL: contextwall-0.1.3.tar.gz
Upload date: Jun 1, 2026
Size: 129.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for contextwall-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`5a186ba78282dbd7c78cb3405efad7122c4ea12a6f989bffd75abe752a22ee47`
MD5	`73e55edc771540913712a86c31fffcd6`
BLAKE2b-256	`f65e1d60343b32af3113649c6f9da6e8274d7a87385c7b20b7c9943aa7b3b06f`

See more details on using hashes here.

File details

Details for the file contextwall-0.1.3-py3-none-any.whl.

File metadata

Download URL: contextwall-0.1.3-py3-none-any.whl
Upload date: Jun 1, 2026
Size: 121.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for contextwall-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cc4465656e23674bbce8edabb230aa1a60c886ace3faf1bcfe10ffd92f6cb2d6`
MD5	`66ba31d052e097c5eb628885fe3f961a`
BLAKE2b-256	`1c8c8b1f357086b380e968efda28f0a4ffaaa4a41e6c6438d69b5ef47e771e52`

See more details on using hashes here.

contextwall 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

ContextWall

Why this exists

How ContextWall fixes it

Get started

Integration

Option 1: Environment variable (zero code change)

Option 2: Python SDK (drop-in replace)

Option 3: Document filter API (for RAG pipelines)

Declare your sources in config

Policy as code

Tune detection sensitivity

What gets detected

Compliance

Observability

Architecture

Self-hosting

What's in this repo

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes