Skip to main content

OtiLLM v2.4: A governed cognitive runtime for evidence-native, memory-augmented and explainable frontier AI systems.

Project description

OtiLLM v2.4

A Governed Cognitive Runtime for Evidence-Native, Memory-Augmented and Explainable Frontier AI Systems

OtiLLM is a research-grade governed cognitive runtime framework designed to make Large Language Model systems more reliable, explainable, evidence-aware, and operationally trustworthy.

Unlike conventional Retrieval-Augmented Generation (RAG), agentic orchestration, or guardrail-based systems, OtiLLM treats inference as a governed runtime process rather than a simple prompt-to-response operation.

At runtime, OtiLLM continuously evaluates evidence sufficiency, contradiction density, uncertainty, governance risk, memory utility, retrieval quality, and explainability requirements before selecting an action.

OtiLLM can dynamically decide whether to answer, retrieve more evidence, reformulate retrieval, escalate uncertainty, refuse unsupported generation, update memory, or generate an explainable runtime trace.


Why OtiLLM Exists

Modern frontier AI systems are increasingly powerful, but they remain vulnerable to hallucination, unsupported claims, weak retrieval grounding, poor uncertainty calibration, limited runtime explainability, unsafe memory persistence, fragile governance enforcement, and adversarial retrieval attacks.

The central idea behind OtiLLM is:

Trustworthy AI requires governed cognition, not just larger models.


Core Innovations

Governed Cognitive Runtime

OtiLLM models inference as a continuously governed runtime process involving retrieval, governance, uncertainty, memory, utility optimisation, and explainability.

Evidence Fabric

The Evidence Fabric evaluates retrieval quality, contradiction density, source reliability, freshness, and evidence sufficiency before reasoning proceeds.

Retrieval Reconsideration Dynamics

Retrieval Reconsideration Dynamics enables OtiLLM to reformulate, expand, diversify, or refresh retrieval when evidence quality is insufficient.

Cognitive Utility Optimisation

OtiLLM selects runtime actions according to expected cognitive utility under evidence and governance constraints.

Governed Cognitive Memory

OtiLLM introduces confidence-aware and governance-constrained memory persistence to prevent unsafe, contradictory, or stale memory propagation.

Explainable Runtime Reasoning Graph

The Explainable Runtime Reasoning Graph (ERRG) captures structured runtime provenance across evidence, governance, uncertainty, memory, and reasoning transitions.


Installation

Install from source:

git clone https://github.com/OtiEdema/otillm.git
cd otillm
python -m pip install -e .

Install development dependencies:

python -m pip install -r requirements-dev.txt

Quick Start

from otillm.runtime import GovernedRuntime

runtime = GovernedRuntime()
result = runtime.run("Who wrote Pride and Prejudice?")

print(result.answer)
print(result.action)
print(result.trace.to_dict())

Benchmark Results

Metric Improvement
Truthfulness +14.8% over Strong RAG
Faithfulness +16.5% over Strong RAG
Unsupported claim reduction 66.7% reduction
Adversarial retrieval robustness 70.8% unsupported claim reduction
Refusal calibration 91% accuracy
Trace completeness +38% improvement

Running the JAIR Experiments

cd experiments/jair_v2

python prepare_jair_datasets.py
python run_jair_experiments.py
python run_jair_stress_tests.py
python statistical_validation.py
python plot_jair_results.py

Generated outputs:

results/
├── jair_all_results.csv
├── jair_summary_results.csv
├── jair_stress_results.csv
├── jair_statistical_tests.csv
└── figures/

Repository Structure

otillm/
├── runtime/
├── governance/
├── retrieval/
├── evidence/
├── orchestration/
├── explainability/
├── memory/
├── multimodal/
├── benchmarks/
├── experiments/
├── notebooks/
├── docs/
├── tests/
└── examples/

Citation

@article{edema2026otillm,
  title={OtiLLM v2.4: A Governed Cognitive Runtime for Evidence-Native, Memory-Augmented and Explainable Frontier AI Systems},
  author={Edema, Oti},
  journal={Submitted to the Journal of Artificial Intelligence Research},
  year={2026}
}

Licence

MIT Licence.


Author

Oti Edema
GitHub: https://github.com/OtiEdema
LinkedIn: https://www.linkedin.com/in/oti-e-34838485/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

otillm-0.2.0.tar.gz (12.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

otillm-0.2.0-py3-none-any.whl (14.2 kB view details)

Uploaded Python 3

File details

Details for the file otillm-0.2.0.tar.gz.

File metadata

  • Download URL: otillm-0.2.0.tar.gz
  • Upload date:
  • Size: 12.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for otillm-0.2.0.tar.gz
Algorithm Hash digest
SHA256 01868bf15c44fefb19bb225d19190979a1bcbaa0a1ff8a94b5111403edf0efc7
MD5 f971c2eabe1e14478d72a4bc63884a8e
BLAKE2b-256 402ca8dc16f7fc204bd88f5c686deea7b51fd79bf1f031f708845389ebce3f55

See more details on using hashes here.

File details

Details for the file otillm-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: otillm-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 14.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for otillm-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 63d51e2d9c090c3632bcd1757762cbb52c435f10427e60ad88f94a911502cd71
MD5 8dcd244fb9b99567c7dc2130d338bf02
BLAKE2b-256 a4894f16ef56f9d15da74c9b5966d0295535d012778a898622d434180b720f89

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page