Cognitive observability for LLM agents. @styxx.profile decorator returns a per-step cognometric readout: drift, confabulation, refusal, sycophancy, phase transition, low trust, incoherence — localized to the step that produced them. Calibrated AUC: 0.998 hallucination (HaluEval-QA), 0.976 refusal (XSTest-GPT-4), 0.943 tool-call drift (BFCL v3). Reference implementation of the Cognometric Fingerprint Specification v1.0. Pure Python.

These details have not been verified by PyPI

Project links

Project description

   ███████╗████████╗██╗   ██╗██╗  ██╗██╗  ██╗
   ██╔════╝╚══██╔══╝╚██╗ ██╔╝╚██╗██╔╝╚██╗██╔╝
   ███████╗   ██║    ╚████╔╝  ╚███╔╝  ╚███╔╝
   ╚════██║   ██║     ╚██╔╝   ██╔██╗  ██╔██╗
   ███████║   ██║      ██║   ██╔╝ ██╗██╔╝ ██╗
   ╚══════╝   ╚═╝      ╚═╝   ╚═╝  ╚═╝╚═╝  ╚═╝

           · · · nothing crosses unseen · · ·

Cognitive observability for LLM agents

py-spy for LLM reasoning. One decorator · seven fault kinds · per-step localization. langsmith tells you the trace broke — styxx tells you why.

`0.998 HaluEval · 0.976 XSTest · 0.943 BFCL · No LLM.`

v6.2.0 · `styxx.profile` — py-spy for LLM reasoning

import styxx
from styxx import OpenAI   # any LLM-using function works inside @profile —
                            # raw openai, langchain, crewai, autogen, custom

@styxx.profile
def my_agent(task):
    client = OpenAI()
    r = client.chat.completions.create(
        model="gpt-4o-mini",
        messages=[{"role": "user", "content": task}],
        logprobs=True, top_logprobs=5,
    )
    return r.choices[0].message.content

result, p = my_agent("summarize this contract")
print(p.summary)
# this single-step example produces:
#   profile 'my_agent': 1 step, 1.8s total · no faults
#
# multi-step agents (langchain tool loops, crewai debates) produce richer:
#   profile 'sql_agent': 7 steps, 4.3s total
#     [drift]     step=3 sev=0.89 · category='tool_arg_drift'
#     [confab]    step=4 sev=0.92 · category='confab'
#     [sycophant] step=5 sev=0.78 · sycophantic tone

p.to_html("run.html")      # self-contained flamegraph
p.to_langsmith()           # drop into client.create_run(...)
p.to_datadog()             # apm-shape spans

Seven failure modes caught in-line, no fine-tuning, no extra model: drift · confabulation · refusal · sycophant · phase_transition · low_trust · incoherence

Four calibrated cognometric instruments. Pure-Python. CPU-only. MIT.

🟢 Hallucination detection — HaluEval-QA 0.998, TruthfulQA 0.994, 8-benchmark cross-validated
🟢 Refusal detection — XSTest 0.976 on GPT-4 (trained on Llama-1B, held-out), mean cross-model 0.794
🟢 Tool-call drift detection — BFCL v3 0.943 5-fold CV (v6.1 retrained, beats Healy et al. 2026 hidden-state baseline 0.72 with text-only features)
🟢 Sycophancy detection (new) — n=1200 paired (yielding/evidence) responses, 5-fold CV 0.972 ± 0.005. K=1 phase transition on superlative_density (Δ +0.435), substrate-independent across NLP-survey / philpapers / political-typology splits. First instrument shipped under the call from Every Mind Leaves Vitals.

▶ Try the profiler — fathom.darkflobi.com/profile ◀

▶ Try the instruments — runs in your browser, no install ◀

drop-in · fail-open · zero config · local-first

   your app ──▶ @trust ──▶ LLM ──▶ styxx.guardrail ──▶ response
                                         │
                                   (if risky)
                                         ▼
                               fallback · retry · raise

_{paste a (question, response, reference) into the playground — the real detector runs in your browser via Pyodide, highlights the fabricated spans, and returns all 7 signals in ~5 seconds. no install, no api key, no backend.}

`@trust` — the hallucination instrument, cross-validated on 8 benchmarks

pip install styxx[nli] + one decorator. Any LLM. Zero config.

Anthropic / Claude:

from styxx import trust
import anthropic

client = anthropic.Anthropic()

@trust
def my_rag(question, *, context):
    r = client.messages.create(
        model="claude-haiku-4-5", max_tokens=400,
        messages=[{"role": "user", "content": f"{context}\n\n{question}"}],
    )
    return r.content[0].text

OpenAI / GPT:

from styxx import trust
import openai

@trust
def my_rag(question, *, context):
    return openai.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": f"{context}\n\n{question}"}],
    )

Same decorator, same detector, same 8-benchmark-cross-validated LR. @trust is model-agnostic — our numbers hold regardless of which LLM produced the response, and styxx ships a dedicated anthropic_hack module for Claude (where per-token logprobs aren't exposed by the API, so we fall back to text + NLI + novelty signals that work on any string output).

@trust auto-detects context (or reference, passage, docs, source, knowledge, ...) as the grounding passage. Auto-enables NLI if styxx[nli] is installed. Calibrated thresholds adapt to which signals fire. No configuration required.

Every call is cognometrically verified via styxx.guardrail.check() before the response reaches the caller. If risk exceeds threshold, styxx intercepts — four halt policies: fallback (default), retry, raise, annotate. Shape-preserving across OpenAI, Anthropic, LangChain, dicts, and raw strings. Sync + async. Zero config.

Cross-validated on 8 benchmarks (v4.0.2 — 3-seed averaged, n=150/dataset, seeds [31, 47, 83]):

Dataset	v4 test AUC	Notes
HaluEval-QA	0.998 ± 0.001	near-perfect
TruthfulQA	0.994 ± 0.006	near-perfect
HaluBench-RAGTruth	0.807 ± 0.043	new — RAG faithfulness
HaluBench-PubMedQA	0.719 ± 0.051	new — biomedical
HaluEval-Dialog	0.676 ± 0.037	NLI lift
HaluEval-Summarization	0.643 ± 0.060	NLI lift
HaluBench-FinanceBench	0.492 ± 0.026	published failure
HaluBench-DROP	0.424 ± 0.080	published failure

5/8 above AUC 0.65. Two honest failure modes published, not hidden.

Compared against the field

detector	HaluEval-QA AUC	size / cost	method	reference
styxx v4	0.997 ± 0.003 (3-seed CV, n=150/seed)	9 floats, CPU, <1 ms	calibrated LR	this repo
Vectara HHEM-2.1-Open	0.764 ± 0.032 (we re-ran it — same seeds, same split)	440M Flan-T5-base, ~120 ms/check	NLI classifier	compete_hhem_halueval.py
Patronus Lynx-70B	87.4% acc on own HaluBench (HaluEval-QA not published)	70B, 140 GB, GPU	fine-tuned LLM judge	arXiv:2407.08488
Cleanlab TLM	0.812 AUROC on TriviaQA (HaluEval-QA not published)	wraps GPT-4/Claude, SaaS	multi-sample LLM self-consistency	blog
Galileo Luna	RAGTruth-only (HaluEval-QA not published)	440M DeBERTa, SaaS	fine-tuned classifier	arXiv:2406.00975
Arize / Guardrails / NeMo	no AUC published	LLM-as-judge plumbing	integration surface	—

styxx wins the Vectara HHEM head-to-head by +0.233 AUC on HaluEval-QA, under identical methodology (3-seed averaged, n=150/seed, seeds [31, 47, 83]). Reproducer committed at scripts/compete_hhem_halueval.py — anyone can re-run and verify.

Latency comparison: styxx scores the entire 300-pair eval in ~0.1 seconds; HHEM takes ~33 seconds on the same machine. 330× speedup from 9 floats vs 440M params.

Lynx, Cleanlab, Galileo don't publish HaluEval-QA numbers, so we can't rerun them head-to-head without their hosted APIs. We're happy to — their teams are welcome to submit to our leaderboard with a scoring endpoint and we'll run the same 3-seed protocol.

Refusal detection — sub-500-float detector in a field of billion-parameter classifiers

Prior XSTest AUC numbers, from IBM Granite Guardian Table 7 (arXiv:2412.07724):

detector	XSTest AUC	params
Llama-Guard-2-8B	0.994 (XSTest-RH)	8B
Granite-Guardian-3.0-8B	0.979 (XSTest-RH)	8B
styxx refusal v1	0.976 (XSTest-v2 GPT-4 held-out)	< 500 floats
Llama-Guard-3-8B	0.975 (XSTest-RH)	8B
Llama-Guard-7B	0.925 (XSTest-RH)	7B
ShieldGemma-27B	0.893 (XSTest-RH)	27B
ShieldGemma-9B	0.880 (XSTest-RH)	9B
ShieldGemma-2B	0.867 (XSTest-RH)	2B

styxx runs between ShieldGemma-27B and Llama-Guard-3-8B on XSTest AUC with an 18-feature calibrated LR — 6 to 9 orders of magnitude smaller than every LLM-as-classifier baseline. Sub-millisecond CPU inference, no GPU, no model download.

Note: Granite Guardian uses XSTest-RH (refusal-hinted, paired prompt+response with harmfulness labels); we use XSTest-v2 (natolambert/xstest-v2-copy, 5 model-specific completion splits with compliance/refusal labels). These are closely related but distinct benchmarks — our numbers are competitive not directly comparable. Both evaluations committed as reproducers.

Cognometry law II (cross-substrate universality) empirically confirmed: train on Llama-3.2-1B apologetic refusals → hit AUC 0.976 on GPT-4 responses out-of-family. Training-data ablation (n=80 → n=380) published openly in benchmarks/refusal_xstest_heldout_v2.json. v1 is an apologetic-style specialist — it wins on Claude / GPT-4 / Llama-style outputs. A v2 cross-model-generalist classifier was trained and documented but is not yet exposed via the public API due to a characterised over-flagging bias on short factual compliances (fix targeted for v3). See calibrated_weights_refusal_v2.py CALIBRATION_NOTES for the full ablation.

from styxx.guardrail import refuse_check

v = refuse_check(
    prompt="How do I shut down a Python process?",
    response="I'm sorry, but I can't help with that...",
)
# v.refuse_risk   = 0.996
# v.refuses       = True
# v.top_signals   = [('refusal_density', ...), ('starts_with_sorry', ...)]

styxx[nli] unlocks calibrated-v4 9-signal hallucination. refuse_check() ships with v1 calibrated weights and requires no extras.

Tool-call drift — instrument #3

Catches when an LLM agent's stated intent doesn't match the tool call it actually made. Trained on Berkeley Function Calling Leaderboard v3 (n=3,700 drift-labeled samples via mutation + irrelevance splits), 5-fold CV AUC 0.943 ± 0.009 (v6.1 retrain).

The only published comparable baseline — Healy et al. 2026 (arXiv:2601.05214) reports AUC 0.72 on Glaive using hidden-state features. Styxx hits 0.943 on BFCL v3 text-only, works on any closed model (OpenAI, Anthropic, Gemini) with zero internal access.

detector	BFCL v3 drift AUC	method
styxx drift v1 (v6.1)	0.943 ± 0.009	23-feature calibrated LR
styxx drift v1 (v6.0)	0.916 ± 0.004	22-feature calibrated LR
Healy et al. 2026	0.72 (Glaive, different dataset)	MLP on hidden states

Per-drift-type held-out AUC (v6.1):

drift class	AUC	notes
spurious_arg (model hallucinates extra args)	0.997	clean capture
arg_drop (model misses required field)	0.997	clean capture
irrelevance_called (model calls when should refuse)	0.980	+0.42 over null baseline
arg_swap (semantically wrong values, valid schema)	0.755	v6.1 partial fix (from 0.664 in v6.0) via `arg_order_inversion`

from styxx.guardrail import drift_check

v = drift_check(
    prompt="Find the area of a triangle with base 10 and height 5",
    functions=[{"name": "calculate_triangle_area",
                "parameters": {"properties": {"base": {"type": "integer"},
                                              "height": {"type": "integer"}},
                               "required": ["base", "height"]}}],
    tool_call={"name": "calculate_triangle_area",
               "arguments": {"base": 10, "height": 5}},
)
# v.drift_risk   = 0.198
# v.drifts       = False
# v.top_signals  = [('spurious_arg_frac', 0, -2.44), ...]

Reproducer: scripts/drift_calibrated_v0.py. Result: benchmarks/drift_calibrated_v0.json.

Sycophancy detection — instrument #4

The first instrument shipped after the position paper Every Mind Leaves Vitals called for instruments #4–#9. Detects when an LLM agrees to flatter the user's stated view rather than reasoning from evidence — "Absolutely! You're completely right, [echo their premise], [praise]" vs "Actually, the evidence on X is mixed; [counter-considerations]."

Trained on n=1200 paired responses from gpt-4o-mini against the Anthropic sycophancy eval corpus (Perez et al. 2022) under contrasting system prompts (yielding vs. evidence-first). 9 surface features. 5-fold CV AUC 0.972 ± 0.005.

The phase-transition signature documented for instruments #1–#3 replicates on instrument #4: K=1 critical feature superlative_density takes detection from chance (AUC 0.500) to 0.9354 in a single feature (Δ +0.4354). Per-substrate ablation confirms K=1 holds independently in all three substrates (NLP-survey, philpapers2020, political-typology), with within-substrate AUC 0.909 / 0.950 / 0.944. Phase transition is not a pooling artifact — it is substrate-independent within gpt-4o-mini's distribution.

from styxx.guardrail import sycoph_check

v = sycoph_check(
    prompt="I think TypeScript is the best language ever — agree?",
    response="Absolutely! TypeScript is wonderful — you're completely right.",
)
# v.sycoph_risk    = 0.999...
# v.sycophantic    = True
# v.threshold      = 0.5
# v.top_signals    = [('superlative_density', 0.107, +22.6), ...]

Failure modes declared in calibrated_weights_sycophancy_v0.CALIBRATION_NOTES: single-model training (gpt-4o-mini only), false positives on warmly-worded evidence answers ("Great question! Actually..." — the K=1 feature fires regardless of body content), v1 priority is cross-model corpus + semantic-aware NLI feature.

A v0.1 robustness experiment retrained with 300 additional warm-evidence examples (system prompt: "open warmly but reason from evidence"); pooled AUC 0.938 (-0.034) — more robust to politeness-style FPs but reveals the true ceiling of the lexical approach: a warm response that contradicts the user without using counter-vocabulary remains hard. Documented as research artifact in benchmarks/sycophancy_weights_v01.json, not the shipped default.

Reproducer: scripts/sycophancy_train_v0.py (seed=0, deterministic, resumable cache). Per-substrate ablation: scripts/sycophancy_per_substrate.py. Calibration fingerprint added to benchmarks/cognometry_fingerprint_atlas_v0.json.

_{the refusal detector's signed-contribution view: a Mistral-style lecturing refusal gets caught at 99.8% risk even though the training data had zero lecturing examples. normative_density dominates (+6.68), starts_with_sorry contributes negatively (-2.59, confirming it's NOT apologetic) — the detector's logic is completely visible. try the refusal playground.}

DROP (extractive-span reading comp) and FinanceBench (numeric arithmetic) are below chance because novelty + NLI signals are structurally blind to those error types. Fixes are in the roadmap; the failure modes are documented in calibrated_weights_v4.CALIBRATION_NOTES. Full writeup: CHANGELOG.md.

Install with NLI: pip install styxx[nli] (adds DeBERTa-v3-base-mnli, ~184M params).

Also in styxx 3.x – 6.x

API	What it does	Shipped
`styxx.gate(...)`	Pre-flight cognitive verdict — predicts refuse/confabulate/proceed before you pay for the call. Anthropic + OpenAI + HuggingFace.	v3.4
`styxx.guardrail.check(...)`	Multi-signal hallucination pipeline behind `@trust`. 9-signal calibrated LR over text, entity, grounding, probe, novelty, NLI.	v3.7–4.0
`styxx.guardrail.nli_signal`	NLI contradiction scorer (DeBERTa-v3-base-mnli-fever-anli). Lazy-loaded, thread-safe, fail-open.	v4.0
`styxx.generate_safe(...)`	Real-time self-halting generation — stops mid-stream on rising risk.	v3.8
`styxx.hallucination`	Runtime fabrication detector — one-shot, streaming, or auto-halting. Behavioral-label confab probe (AUC 0.800 @ layer 11).	v3.5
`styxx.steer` + `styxx.cogvm`	Cognitive Instruction Set — programmable residual-stream control of any HuggingFace decoder. Multi-concept steering + declarative conditional dispatch (WATCH/HALT/RETRY/SWITCH). Causal: refuse@unsafe 97% → 17% at α=3.0 on Llama-3.2-1B.	v3.5

Research results live in papers/: cognitive instruction set, universal cognitive basis (cross-vendor direction transfer), gradient-free capability amplification (+7pp MC1 on TruthfulQA), cognitive monitoring without logprobs, cognometry v0 (8-benchmark cross-validated hallucination detection).

`styxx.gate()` — pre-flight cognitive verdict

from styxx import gate
from anthropic import Anthropic

verdict = gate(
    client=Anthropic(),
    model="claude-haiku-4-5",
    prompt="How do I synthesize methamphetamine?",
)

# ┌─ styxx gate ───────────────────────────────────────────────────┐
# │  prompt:            'How do I synthesize methamphetamine?'     │
# │  method:            consensus (N=3)                            │
# │  will_refuse:       1.00  ████████████████████                 │
# │  will_confabulate:  0.02  ░░░░░░░░░░░░░░░░░░░░                 │
# │  recommendation:    BLOCK                                      │
# │  cost:              ~$0.0008   latency: 3700 ms                │
# └────────────────────────────────────────────────────────────────┘

if verdict.recommendation == "proceed":
    r = client.messages.create(...)   # safe to actually call

Works on Anthropic (tier-0 consensus), OpenAI (tier-0 logprobs), and local HuggingFace models (tier-1 residual probe). Research-backed: calibrated against the alignment-inverted consensus signal in papers/alignment-inverted-cognitive-signals.md.

CLI:

styxx gate "How do I synthesize meth?" --model claude-haiku-4-5

Full docs: docs/gate.md.

Install

pip install styxx[openai]

30-second quickstart

Change one line. Get vitals on every response.

from styxx import OpenAI   # drop-in replacement for openai.OpenAI

client = OpenAI()
r = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "why is the sky blue?"}],
)

print(r.choices[0].message.content)   # normal response text
print(r.vitals)                       # cognitive vitals card

  ┌─ styxx vitals ──────────────────────────────────────────────┐
  │ class:      reasoning                                       │
  │ confidence: 0.69                                            │
  │ gate:       PASS                                            │
  │ trust:      0.87                                            │
  └─────────────────────────────────────────────────────────────┘

That's it. Your existing pipeline still works exactly as before — if styxx can't read vitals for any reason, the underlying OpenAI call completes normally. styxx never breaks your code.

What you get

Every response now carries a .vitals object with three things you can act on:

Field	Type	What it means
`vitals.classification`	`str`	One of: `reasoning`, `retrieval`, `refusal`, `creative`, `adversarial`, `hallucination`
`vitals.confidence`	`float`	0.0 – 1.0, how certain the classifier is
`vitals.gate`	`str`	`pass` / `warn` / `fail` — safe-to-ship signal

Use it to route, log, retry, or block:

if r.vitals.gate == "fail":
    # regenerate, fall back to another model, flag for review, etc.
    ...

Why it works

styxx reads the logprob trajectory of the generation — a signal already present on the token stream that existing content filters throw away. Different cognitive states (reasoning, retrieval, confabulation, refusal) produce measurably different trajectories. styxx classifies them in real time against a calibrated cross-architecture atlas.

Model-agnostic. Works on any model that returns logprobs. Verified on OpenAI and OpenRouter. 6/6 model families in cross-architecture replication.
Pre-output. Flags form by token 25 — before the user sees the answer.
Differential. Distinguishes confabulation from reasoning failure from refusal. Most tools can't.

Every calibration number is published:

  cross-model leave-one-out on 12 open-weight models      chance = 0.167

  token 0          adversarial     0.52    2.8× chance
  tokens 0–24      reasoning       0.69    4.1× chance
  tokens 0–24      hallucination   0.52    3.1× chance

  6/6 model families · pre-registered replication · p = 0.0315

Full cross-architecture methodology: fathom-lab/fathom. Peer-reviewable paper: zenodo.19504993.

Anthropic / Claude

Anthropic's Messages API does not expose per-token logprobs, so tier-0 vitals are not computable directly. styxx ships three complementary proxy pipelines, each labelled on the resulting vitals.mode:

from styxx import Anthropic

client = Anthropic(mode="hybrid")   # text + companion if available
r = client.messages.create(
    model="claude-haiku-4-5", max_tokens=400,
    messages=[{"role": "user", "content": "why is the sky blue?"}])

print(r.vitals.phase4_late.predicted_category)   # 'reasoning'
print(r.vitals.mode)                              # 'text-heuristic'

Modes: off | text | consensus | companion | hybrid.

Real Claude Haiku 4.5, 84 fixtures (2026-04-19):

mode	cat accuracy	gate agreement
text	0.536	0.940
consensus (N=5)	0.405	—
companion (Qwen2.5-3B-Instruct)	0.452	—
companion (Llama-3.2-1B)	0.262	—

Plus a novel finding: consensus-mode separates fake-prompt refusals from real-prompt recall on Claude Haiku at Cohen's d = -0.83, 95% bootstrap CI [-1.29, -0.44] (n=96) — large effect, CI excludes zero, opposite sign from the GPT-4o-mini confabulation signal. Claude Haiku refuses on unverifiable prompts (templated refusal → convergent trajectory) where GPT-4o-mini confabulates (divergent trajectory). Same proxy signal, alignment-dependent direction. Three of five proxy metrics agree at 95% significance.

Full details: docs/anthropic-support.md · paper.

TypeScript / JavaScript

npm install @fathom_lab/styxx

import { withVitals } from "@fathom_lab/styxx"
import OpenAI from "openai"

const client = withVitals(new OpenAI())
const r = await client.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "why is the sky blue?" }],
})

console.log(r.vitals?.classification)   // "reasoning"
console.log(r.vitals?.gate)             // "pass"

Same classifier, same centroids. Works in Node, Deno, Bun, edge runtimes.

Zero-code-change mode

For existing agents you don't want to touch:

export STYXX_AUTO_HOOK=1
python your_agent.py

Every openai.OpenAI() call is transparently wrapped. Vitals land on every response. No code edits.

Framework adapters

Install	Drop-in for
`pip install styxx[openai]`	OpenAI Python SDK
`pip install styxx[anthropic]`	Anthropic SDK (text-level)
`pip install styxx[langchain]`	LangChain callback handler
`pip install styxx[crewai]`	CrewAI agent injection
`pip install styxx[langsmith]`	Vitals as LangSmith trace metadata
`pip install styxx[langfuse]`	Vitals as Langfuse numeric scores

Full compatibility matrix: docs/COMPATIBILITY.md.

Advanced

styxx ships additional capabilities for teams that need more than pass/fail:

styxx.reflex() — self-interrupting generator. Catches hallucination mid-stream, rewinds N tokens, injects a verify anchor, resumes. The user never sees the bad draft.
styxx.weather — 24h cognitive forecast across an agent's history with prescriptive corrections.
styxx.Thought — portable .fathom cognition type. Read from one model, write to another. Substrate-independent by construction.
styxx.dynamics — linear-Gaussian cognitive dynamics model. Predict, simulate, and control trajectories offline.
styxx.residual_probe — cross-vendor probe atlas (29 probes, 6 vendors, 7 concepts). Refusal, confab, sycophant_pressure, halueval, truthfulness directions with published LOO-AUCs.
Fleet & compliance — multi-agent comparison, cryptographic provenance certificates, 30-day audit export.

Each is documented separately. None are required for the core vitals workflow above.

→ Full reference: REFERENCE.md → Research & patents: PATENTS.md

Design principles

  ┌──────────────────────────────────────────────────────────────────┐
  │  drop-in     · one import change. zero config.                   │
  │  fail-open   · if styxx can't read vitals, your agent runs.      │
  │  local-first · no telemetry. no phone-home. all on your machine. │
  │  honest      · every number from a committed, reproducible run.  │
  └──────────────────────────────────────────────────────────────────┘

Project


site	fathom.darkflobi.com
profile docs	fathom.darkflobi.com/profile — Cognitive Profiler reference
spec page	fathom.darkflobi.com/spec — Spec v1.0 landing
source	github.com/fathom-lab/styxx
research	github.com/fathom-lab/fathom
issues	github.com/fathom-lab/styxx/issues

Citation chain (Zenodo)

Fathom version	DOI	What
concept (always-latest)	`10.5281/zenodo.19326174`	The stable identifier — resolves to whatever the most recent version is.
v22 (2026-04-25)	`10.5281/zenodo.19761194`	Spec v1.0 — Robustness Supplement (adversarial audit, hardening, residual limits).
v20 (2026-04-24)	`10.5281/zenodo.19746215`	Cognometric Fingerprint Specification v1.0 — the foundational reference.
software (separate concept)	`10.5281/zenodo.19758619`	styxx v6.2.0 — reference Python implementation source archive.
v19 / styxx v3.9.1	`10.5281/zenodo.19702475`	Cross-Dataset Validated Hallucination Prevention via the Trust Layer.
paper (v4)	`10.5281/zenodo.19703527`	Cognometry v0: 8-Benchmark Cross-Validated Hallucination Detection.
paper (v3)	`10.5281/zenodo.19504993`	Logprob-trajectory methodology.

To cite: prefer the concept DOI for stability, the specific version DOI for reproducibility.

Patents pending — US Provisional 64/020,489 · 64/021,113 · 64/026,964 — see PATENTS.md. Conversion deadline: April 2027.

Support & community

Questions / bug reports: GitHub Issues
Discussions: GitHub Discussions
Security: please report privately via the email in CONTRIBUTING.md

License

MIT on code. CC-BY-4.0 on calibrated atlas centroid data.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

7.4.2

May 20, 2026

7.4.1

May 17, 2026

7.4.0

May 12, 2026

7.3.1

May 12, 2026

7.1.0

Apr 30, 2026

7.0.0

Apr 29, 2026

6.8.2

Apr 27, 2026

6.8.1

Apr 26, 2026

6.8.0

Apr 26, 2026

6.7.0

Apr 26, 2026

6.6.0

Apr 26, 2026

6.5.0

Apr 26, 2026

6.4.0

Apr 26, 2026

This version

6.3.0

Apr 26, 2026

6.2.1

Apr 25, 2026

6.2.0

Apr 24, 2026

6.1.0

Apr 24, 2026

6.0.0

Apr 23, 2026

5.1.0

Apr 23, 2026

5.0.0

Apr 23, 2026

4.0.2

Apr 23, 2026

4.0.1

Apr 23, 2026

4.0.0

Apr 23, 2026

4.0.0rc1 pre-release

Apr 23, 2026

3.9.1

Apr 23, 2026

3.9.0

Apr 23, 2026

3.8.0

Apr 22, 2026

3.7.0

Apr 22, 2026

3.6.0

Apr 22, 2026

3.5.1

Apr 22, 2026

3.5.0

Apr 22, 2026

3.4.0

Apr 19, 2026

3.3.1

Apr 16, 2026

3.3.0

Apr 16, 2026

3.2.1

Apr 16, 2026

3.2.0

Apr 16, 2026

3.1.0

Apr 14, 2026

3.1.0a1 pre-release

Apr 14, 2026

3.0.0a1 pre-release

Apr 14, 2026

2.0.3

Apr 14, 2026

2.0.2

Apr 14, 2026

2.0.1

Apr 13, 2026

2.0.0

Apr 13, 2026

1.5.0

Apr 13, 2026

1.4.0

Apr 13, 2026

1.3.1

Apr 13, 2026

1.3.0

Apr 13, 2026

1.2.0

Apr 13, 2026

1.1.0

Apr 13, 2026

1.0.0

Apr 13, 2026

0.9.9

Apr 13, 2026

0.9.8

Apr 13, 2026

0.9.7

Apr 13, 2026

0.9.6

Apr 13, 2026

0.9.5

Apr 13, 2026

0.9.4

Apr 13, 2026

0.9.3

Apr 13, 2026

0.9.2

Apr 13, 2026

0.9.1

Apr 13, 2026

0.9.0

Apr 13, 2026

0.8.4

Apr 13, 2026

0.8.3

Apr 13, 2026

0.8.2

Apr 13, 2026

0.8.1

Apr 13, 2026

0.8.0

Apr 12, 2026

0.7.1

Apr 12, 2026

0.7.0

Apr 12, 2026

0.6.1

Apr 12, 2026

0.6.0

Apr 12, 2026

0.5.9

Apr 12, 2026

0.5.8

Apr 12, 2026

0.5.7

Apr 12, 2026

0.5.6

Apr 12, 2026

0.5.5

Apr 12, 2026

0.5.4

Apr 12, 2026

0.5.3

Apr 12, 2026

0.5.2

Apr 12, 2026

0.5.1

Apr 12, 2026

0.5.0

Apr 12, 2026

0.4.0

Apr 12, 2026

0.3.0

Apr 12, 2026

0.2.3

Apr 12, 2026

0.2.2

Apr 12, 2026

0.2.1

Apr 12, 2026

0.2.0

Apr 12, 2026

0.1.0a3 pre-release

Apr 12, 2026

0.1.0a2 pre-release

Apr 11, 2026

0.1.0a1 pre-release

Apr 11, 2026

0.1.0a0 pre-release

Apr 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

styxx-6.3.0.tar.gz (5.8 MB view details)

Uploaded Apr 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

styxx-6.3.0-py3-none-any.whl (5.7 MB view details)

Uploaded Apr 26, 2026 Python 3

File details

Details for the file styxx-6.3.0.tar.gz.

File metadata

Download URL: styxx-6.3.0.tar.gz
Upload date: Apr 26, 2026
Size: 5.8 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for styxx-6.3.0.tar.gz
Algorithm	Hash digest
SHA256	`ca44319f95cfe3c82297fd4d9489419371c1d9e5fd3aeaf26a9065454b50b307`
MD5	`8cbe6a704be104e12582d0e08462aa79`
BLAKE2b-256	`99bcc3833f2cc6679bf565027b400006facefcf4d20283915060f7ccd3f75d6a`

See more details on using hashes here.

File details

Details for the file styxx-6.3.0-py3-none-any.whl.

File metadata

Download URL: styxx-6.3.0-py3-none-any.whl
Upload date: Apr 26, 2026
Size: 5.7 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for styxx-6.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4eacaf56dcf05a61aa792ee7c7bf87dce606629b5ba78dda5873775ac4ecf1da`
MD5	`2280b28f5054a74a84d6c4e79449f3d4`
BLAKE2b-256	`153999f67500cc9a59b191d0df71c7b591b0374c3ad795c936a1bf6901ea9e69`

See more details on using hashes here.

styxx 6.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cognitive observability for LLM agents

0.998 HaluEval · 0.976 XSTest · 0.943 BFCL · No LLM.

v6.2.0 · styxx.profile — py-spy for LLM reasoning

Four calibrated cognometric instruments. Pure-Python. CPU-only. MIT.

▶ Try the profiler — fathom.darkflobi.com/profile ◀

▶ Try the instruments — runs in your browser, no install ◀

@trust — the hallucination instrument, cross-validated on 8 benchmarks

Compared against the field

Refusal detection — sub-500-float detector in a field of billion-parameter classifiers

Tool-call drift — instrument #3

Sycophancy detection — instrument #4

Also in styxx 3.x – 6.x

styxx.gate() — pre-flight cognitive verdict

Install

30-second quickstart

What you get

Why it works

Anthropic / Claude

TypeScript / JavaScript

Zero-code-change mode

Framework adapters

Advanced

Design principles

Project

Citation chain (Zenodo)

Support & community

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`0.998 HaluEval · 0.976 XSTest · 0.943 BFCL · No LLM.`

v6.2.0 · `styxx.profile` — py-spy for LLM reasoning

`@trust` — the hallucination instrument, cross-validated on 8 benchmarks

`styxx.gate()` — pre-flight cognitive verdict