Thin Python SDK for Latence TRACE.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ddickmann

These details have not been verified by PyPI

Project links

Project description

Latence

Latence TRACE Python SDK

Real-time protection for AI agents. Verify answers, redact private data, reduce wasted context, and log decisions without replacing your stack.

PyPI · Quickstart · Sessions · Integrations · Examples · Website

pip install latence

from latence import Latence

trace = Latence(api_key="lat_...")

score = trace.grounding.rag(
    query="Can I promise this customer a refund?",
    response_text="Yes, the refund will arrive within 48 hours.",
    raw_context="Refunds require manual finance approval before timelines are promised.",
)

print(score.risk_band)
print(score.runtime_decision)

Why This Exists

Agents are moving from demos into real workflows. That means private data, unsupported answers, prompt attacks, tool drift, and wasted memory are no longer abstract research problems. They become support escalations, broken automations, audit gaps, token waste, and user trust issues.

TRACE is the protection layer that sits next to your RAG pipeline, coding agent, or tool-using workflow. Your agent keeps running. TRACE checks the turn and returns evidence plus a decision your application can route.

What TRACE Does

TRACE is intentionally small at the SDK layer. The heavy work lives in your TRACE runtime deployment; this package is the thin Python interface.

Verify RAG answers against retrieved context.
Score coding-agent output against codebase context.
Redact private data before it spreads through tools, logs, or prompts.
Compress and repair long-running context with InfiniMem.
Roll up an agent session into review-ready signals.
Persist caller-carried state without forcing sticky server sessions.

Proof Points

These are runtime proof points from the TRACE freeze evidence, not SDK-only microbenchmarks. See the linked artifacts for full context.

Grounding: local managed-runtime 360 reported 1.00 AUROC for grounded vs. ungrounded RAG cases.
Coding agents: local 360 reported 1.00 AUROC for code phantom detection.
Wasted context: held-out unused-context classification reported 1.00 precision and 1.00 recall.
Latency: local concurrency burst reported about 368 ms RAG p95 and 334 ms code p95 in the managed-runtime proof.
Privacy: redaction returns labels, offsets, scores, redacted output, entity counts, and timings for logging-ready GDPR workflows.
Memory: InfiniMem is designed for up to 90% context reduction while keeping hot context available to the agent.
Guard checks: Prompt Guard warmup proved torch.compile enabled on CUDA in the runtime proof.

Evidence:

How It Works

Deploy or access a TRACE runtime: RunPod, FastAPI, VPC, or on-prem.
Install latence.
Send the agent turn to the product path that matches the workflow.
Route on risk_band, runtime_decision, scores, spans, and evidence.
Store only the audit evidence your policy allows.

from latence import Latence

trace = Latence(
    api_key="lat_...",
    base_url="https://your-trace-endpoint.example.com",
)

Environment variables are supported:

export LATENCE_TRACE_API_KEY="lat_..."
export LATENCE_TRACE_URL="https://your-trace-endpoint.example.com"

The SDK Surface

The SDK mirrors the TRACE product API directly:

Privacy: client.privacy.redact(...)
RAG grounding: client.grounding.rag(...)
Code grounding: client.grounding.code(...)
Text compression: client.compression.text(...)
Message compression: client.compression.messages(...)
Memory update: client.memory.step(...)
Stateless rollup: client.rollup(...)
Caller-carried sessions: client.session(...)

Latence is synchronous. AsyncLatence exposes the same surface for asyncio services.

from latence import AsyncLatence, Latence

Base dependencies are only httpx and pydantic. Runtime and model packages such as torch, transformers, triton, FastAPI, and vLLM are not SDK dependencies.

Start With The Path You Need

RAG Agents

Use TRACE when your answer must be grounded in retrieved context.

score = trace.grounding.rag(
    query="What is the refund policy?",
    response_text=agent_answer,
    raw_context=retrieved_context,
)

if score.risk_band.value != "green":
    send_to_review(score)

Example: RAG grounding with guard checks

Coding Agents

Use TRACE when an agent explains, edits, or reasons over code.

score = trace.grounding.code(
    query="Does this patch add retry handling?",
    response_text=agent_answer,
    raw_context=code_context,
    extra={"response_language_hint": "python"},
)

Example: Code grounding

Privacy

Use TRACE before customer data enters prompts, tools, traces, or logs.

redacted = trace.privacy.redact(
    text="Email jane@example.com and charge IBAN DE89370400440532013000.",
)

print(redacted.redacted_text)
print(redacted.unique_labels)

Example: Privacy redaction

Compression And Memory

Use TRACE when long-running workflows start dragging dead context forward.

compressed = trace.compression.text(
    "Long retrieved context...",
    compression_rate=0.4,
)

memory = trace.memory.step(
    turn_text="User asked for manual refund approval.",
    prior_memory_state=current_state,
)
current_state = memory.next_memory_state

Examples: Compression, Memory step

Sessions

TRACE runtimes can stay stateless while the SDK carries state for your agent.

from latence import FileSessionStorage, Latence

trace = Latence()
session = trace.session(
    session_id="support-run-42",
    storage=FileSessionStorage(".trace-sessions"),
)

session.event("tool", "loaded refund policy")
session.memory_step(turn_text="Keep finance approval as required context.")
score = session.rag(
    query="Can I promise the refund?",
    response_text="Yes, the refund is guaranteed in 48 hours.",
    raw_context="Refunds require manual finance approval.",
)
session.save()

Docs: Sessions
Example: Session facade

For Whom

TRACE is for teams building or operating:

RAG products where unsupported answers are expensive.
Coding agents that need codebase-grounded reasoning over many steps.
Support agents that touch customer records and policies.
Legal, finance, healthcare, or regulated workflows that need evidence.
Internal agent platforms where observability, retries, and human review matter.

It is also useful for framework authors and platform teams that need one consistent protection API across LangGraph, LangChain, LlamaIndex, n8n, Cursor, Claude Code, Codex, and custom agent runners.

Integrations

Direct calls are the recommended path. Optional helpers live under latence.integrations.

pip install "latence[langchain]"
pip install "latence[llama_index]"
pip install "latence[openai]"
pip install "latence[haystack]"

Docs: Integrations
Example: Async batch

Phase 5 demos: LibreChat/OpenRouter, native SDK, LangChain, LlamaIndex, LangGraph, and n8n

Async

from latence import AsyncLatence

async with AsyncLatence() as trace:
    score = await trace.grounding.rag(
        query="What changed?",
        response_text="The policy now allows refunds.",
        raw_context="The policy still requires manual approval.",
    )

Now What

If you are integrating TRACE into an agent:

Run the quickstart.
Pick one product path: RAG, code, privacy, compression, memory, or session.
Add one route in your app for green, amber, and red.
Log request ID, risk band, runtime decision, and redacted evidence.
Replay a few real failures and tune your thresholds.

If you are publishing or validating this SDK:

python -m pip install -e ".[dev]"
python -m pytest
python -m ruff check .
python scripts/check_contract.py --manifest ../latence-trace/docs/core_freeze/api_surface_manifest.json
python -m build
python -m twine check dist/*

Clean-wheel smoke testing should run outside the repo root:

python -m venv /tmp/latence-sdk-smoke
/tmp/latence-sdk-smoke/bin/pip install dist/*.whl
cd /tmp && /tmp/latence-sdk-smoke/bin/python - <<'PY'
from importlib.metadata import distribution
from latence import Latence

requires = distribution("latence").requires or []
for forbidden in ("torch", "transformers", "triton", "fastapi", "vllm"):
    assert not any(req.lower().startswith(forbidden) for req in requires), requires

assert Latence(base_url="http://localhost:8090")
print("latence SDK smoke passed")
PY

Migration

Primary imports:

from latence import Latence, AsyncLatence

Preview aliases remain available so existing TRACE preview code can move first and clean up names later:

from latence import LatenceTraceClient, AsyncLatenceTraceClient

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ddickmann

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.0

May 9, 2026

This version

0.1.6

May 7, 2026

0.1.5

May 7, 2026

0.1.4

May 5, 2026

0.1.3

Apr 28, 2026

0.1.2

Apr 24, 2026

0.1.1

Mar 26, 2026

0.1.0

Mar 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

latence-0.1.6.tar.gz (53.4 kB view details)

Uploaded May 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

latence-0.1.6-py3-none-any.whl (47.3 kB view details)

Uploaded May 7, 2026 Python 3

File details

Details for the file latence-0.1.6.tar.gz.

File metadata

Download URL: latence-0.1.6.tar.gz
Upload date: May 7, 2026
Size: 53.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for latence-0.1.6.tar.gz
Algorithm	Hash digest
SHA256	`84c2111b9be5f643f38b379c5964860bb5c1d32e6da11816a6ec869510eeb3c1`
MD5	`f7124541e2a8e96d6a5404b45c0eab5c`
BLAKE2b-256	`e2e7769527d893303cd776438f0cebbe5537db5338e926c0ba19cc5864937b93`

See more details on using hashes here.

Provenance

The following attestation bundles were made for latence-0.1.6.tar.gz:

Publisher: publish.yml on latenceainew/latence-trace-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: latence-0.1.6.tar.gz
- Subject digest: 84c2111b9be5f643f38b379c5964860bb5c1d32e6da11816a6ec869510eeb3c1
- Sigstore transparency entry: 1460031026
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: latenceainew/latence-trace-python@23aa5eb5054c71e065c3312f5ef4c9fd62aab82e
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/latenceainew
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@23aa5eb5054c71e065c3312f5ef4c9fd62aab82e
- Trigger Event: release

File details

Details for the file latence-0.1.6-py3-none-any.whl.

File metadata

Download URL: latence-0.1.6-py3-none-any.whl
Upload date: May 7, 2026
Size: 47.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for latence-0.1.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9a747ca20f8e4e3e90da090391b58c92c3f233624599456d8fcc0864d707903d`
MD5	`e7443a9bd365f96c19f9a9c7551097db`
BLAKE2b-256	`292d011ab105aa39d9a47edb2cc6bd356ef17b1798a484a12c3798fa3a219b78`

See more details on using hashes here.

Provenance

The following attestation bundles were made for latence-0.1.6-py3-none-any.whl:

Publisher: publish.yml on latenceainew/latence-trace-python

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: latence-0.1.6-py3-none-any.whl
- Subject digest: 9a747ca20f8e4e3e90da090391b58c92c3f233624599456d8fcc0864d707903d
- Sigstore transparency entry: 1460031603
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: latenceainew/latence-trace-python@23aa5eb5054c71e065c3312f5ef4c9fd62aab82e
- Branch / Tag: refs/tags/v0.1.6
- Owner: https://github.com/latenceainew
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@23aa5eb5054c71e065c3312f5ef4c9fd62aab82e
- Trigger Event: release

latence 0.1.6

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Latence TRACE Python SDK

Why This Exists

What TRACE Does

Proof Points

How It Works

The SDK Surface

Start With The Path You Need

RAG Agents

Coding Agents

Privacy

Compression And Memory

Sessions

For Whom

Integrations

Async

Now What

Migration

More

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance