VERDICT WEIGHT™ — Context-Adaptive Multi-Source Confidence Synthesis. 57.8% adversarial suppression. N=10,000 validated. 12 domain profiles.
Project description
VERDICT WEIGHT™
A Context-Adaptive Multi-Source Confidence Synthesis Framework for Autonomous AI Intelligence Systems
"Calibrated multi-source confidence scoring is not an optional feature of autonomous AI systems — it is a foundational architectural requirement."
The Problem
Autonomous AI systems treat all intelligence sources as equal. A rumor on a threat forum and a Mandiant primary incident report receive the same weight. A 30-day-old signal and a real-time alert are scored identically. A spoofed high-credibility source sails straight through.
That is not a UI problem. That is a systematic architectural vulnerability.
Validated Results
Validated across N=10,000 synthetic scenarios (seed=42, fully reproducible).
Dataset SHA-256: 40bc6e227e30f5292796b3c8df60c68a8339180eea4e2379f1ab9d1e5ac8bd63
| Method | Brier ↓ | 95% CI | AUC-ROC | McNemar |
|---|---|---|---|---|
| VERDICT WEIGHT™ | 0.2079 | [0.2036, 0.2122] | 0.7499 | — |
| Equal Weight | 0.2170 | [0.2130, 0.2210] | 0.7450 | p<0.001 *** |
| Single Source | 0.2499 | [0.2447, 0.2553] | 0.6537 | p<0.001 *** |
| Two Stream | 0.2298 | [0.2251, 0.2346] | 0.7258 | p<0.001 *** |
Key results:
- 57.8% suppression of adversarial spoofed intelligence vs single-source baselines
- +16.8% Brier Score improvement vs single source (p<0.001 ***)
- +14.7% AUC-ROC improvement vs single source (p<0.001 ***)
- 5-fold CV stability: Brier 0.2079 ± 0.0046 — results do not overfit
- All significance tests: McNemar p=0.000000 against all baselines
Cross-Vertical Performance (N=1,000 per vertical)
| Vertical | Brier Δ% | AUC Δ% | Adv. Suppression |
|---|---|---|---|
| Cybersecurity | +17.7% | +17.0% | 57.4% |
| Healthcare | +9.6% | +20.9% | — |
| Financial | +10.3% | +15.1% | — |
| Manufacturing | +9.2% | +8.5% | 40.5% |
| Legal | +4.4% | +3.8% | 30.7% |
| Defense | -2.5% | +0.3% | 16.6% |
| Enterprise RAG | -6.7% | -1.1% | 25.7% |
Defense and RAG show negative Brier improvement due to synthetic data characteristics. See audit report for full failure mode analysis.
What VERDICT WEIGHT™ Does
Four evidence streams → Three outputs → One decision.
Four Evidence Streams
| Stream | Symbol | Description |
|---|---|---|
| Source Reliability | SR | Credibility of the originating source (0.01–0.99) |
| Cross-Feed Corroboration | CC | Independent confirmation across feeds |
| Temporal Decay | TD | Recency of the intelligence signal |
| Historical Source Accuracy | HA | Empirical track record of the source |
Three Output Components
| Output | Symbol | Range | Description |
|---|---|---|---|
| Signal Strength | SS | 0–1 | Confidence the signal is real |
| Doubt Index | DI | 0–1 | Inter-stream disagreement |
| Consequence Weight | CW | 0–1 | Actionability after doubt adjustment |
Twelve Context Profiles
| Domain | Profile Type |
|---|---|
| Cybersecurity (General) | Corroboration-dominant |
| Cybersecurity (APT) | Source reliability elevated, slow decay |
| Cybersecurity (Zero-Day) | Temporal decay dominant |
| Cybersecurity (Disinformation) | Maximum corroboration + doubt penalty |
| Healthcare (Diagnostic) | High doubt penalty, surfaces uncertainty |
| Healthcare (Drug Safety) | Highest doubt penalty in registry |
| Financial (Fraud) | Corroboration + recency co-dominant |
| Financial (Market) | Temporal decay dominant, fast decay |
| Defense Intelligence | Multi-source fusion, slow strategic decay |
| Autonomous Vehicle | Sub-second decay, highest doubt penalty |
| Legal Evidence | Minimal decay, chain of custody weighted |
| Enterprise RAG | LLM retrieval confidence scoring |
Quick Start
pip install verdict-weight
from verdict_weight import VerdictWeight, ContextType
vw = VerdictWeight()
# Score a cybersecurity threat intelligence signal
result = vw.score(
source_reliability=0.92, # How credible is this source?
n_corroborating_sources=3, # How many independent sources confirm?
age_value=2.5, # How old is this intelligence (days)?
correct_predictions=45, # Source's historical correct calls
total_predictions=50, # Source's total historical calls
context=ContextType.CYBERSECURITY_APT
)
print(result.action_tier) # CRITICAL
print(result.consequence_weight) # 0.8380
print(result.doubt_index) # 0.0691
print(result.interpretation) # "Act immediately. High-confidence..."
print(result.to_json()) # Full JSON output
Adversarial signal — watch the suppression
# High-credibility source but zero corroboration — spoofed intel
result = vw.score(
source_reliability=0.95, # Looks credible
n_corroborating_sources=0, # Nobody else is confirming this
age_value=1.0,
context=ContextType.CYBERSECURITY_DISINFO
)
print(result.action_tier) # NOISE
print(result.consequence_weight) # 0.147 — suppressed by 84%
Score pre-computed streams directly
result = vw.score_streams(
SR=0.92, CC=0.78, TD=0.94, HA=0.88,
context=ContextType.FINANCIAL_FRAUD
)
Repository Structure
verdict-weight/
├── verdict_weight/
│ ├── __init__.py # Public API
│ └── core.py # VerdictWeight engine — all 12 profiles
├── validation/
│ ├── synthetic_validation.py # N=10,000 validation (seed=42)
│ └── ablation_study.py # 324-config weight ablation
├── examples/
│ ├── cybersecurity.py
│ └── healthcare.py
├── docs/
│ └── VERDICT_WEIGHT_Paper.pdf # SSRN 6532658
├── pyproject.toml
├── requirements.txt
└── README.md
Mathematical Foundation
Signal Strength (weighted geometric mean):
SS = ∏(S_i + ε)^w_i where Σw_i = 1.0
Doubt Index (normalized coefficient of variation):
DI = clip(σ(SR,CC,TD,HA) / μ(SR,CC,TD,HA), 0, 1)
Consequence Weight:
CW = clip(SS × (1 - δ × DI), 0, 1)
The geometric mean is chosen because it penalizes weak streams multiplicatively. A single stream near zero collapses the score — preventing one strong source from masking fundamental evidence gaps. This is the structural guarantee behind the 57.8% adversarial suppression result.
Citation
@misc{byrd2026verdictweight,
title={VERDICT WEIGHT: A Context-Adaptive Multi-Source Confidence Synthesis
Framework for Autonomous AI Intelligence Systems},
author={Byrd, Andre},
year={2026},
howpublished={SSRN Preprint},
note={SSRN Abstract ID: 6532658},
url={https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6532658},
doi={10.5281/zenodo.19447547}
}
Reproducibility
Results are fully reproducible:
- Clone this repository
- Run
python validation/synthetic_validation.py - Verify SHA-256 matches:
40bc6e227e30f5292796b3c8df60c68a8339180eea4e2379f1ab9d1e5ac8bd63
Master seed: 42 (never changes — all results are deterministic)
Legal
VERDICT WEIGHT™ is a trademark of Six Sense Enterprise Services LLC (Odingard Security). USPTO Serial Number: 99747827.
© 2026 Six Sense Enterprise Services LLC. All rights reserved.
This software is made available for research and evaluation purposes. Commercial deployment requires a license agreement.
For licensing: andre.byrd@odingard.com Paper: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6532658 DOI: https://doi.org/10.5281/zenodo.19447547
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file verdict_weight-1.0.1.tar.gz.
File metadata
- Download URL: verdict_weight-1.0.1.tar.gz
- Upload date:
- Size: 14.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
430d53daa988fd65faed1c4741506bbe399c011e38ec1f2c578a060c076875cc
|
|
| MD5 |
e737dfb77fa5d8891f2b1372ed88ed48
|
|
| BLAKE2b-256 |
192a6f301ee868015f1be10674ece162b41a6bd8252648f572654d84082af5c9
|
File details
Details for the file verdict_weight-1.0.1-py3-none-any.whl.
File metadata
- Download URL: verdict_weight-1.0.1-py3-none-any.whl
- Upload date:
- Size: 12.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7d1caf4c9b980192d7204afac905f3829b18a3b19972e0a90cc691c5ed99905c
|
|
| MD5 |
1d217e530298a70fb57e3e2afc343a71
|
|
| BLAKE2b-256 |
135b1e29de44ca8ed123b0334caa52e52ef06847aa5a6dddf3a694aab976e368
|