OtiLLM 0.1.0: Evidence-native, policy-aware AI runtime for reliable AI systems

Project description

OtiLLM 0.1.0

Evidence-Native, Policy-Aware AI Runtime for Reliable AI Systems

Overview

OtiLLM 0.1.0 is the founding open-source release of OtiLLM, a next-generation AI runtime architecture designed to improve the reliability, governance, and explainability of modern AI systems.

While large language models and retrieval systems have advanced significantly, their real-world deployment often exposes fundamental weaknesses. OtiLLM addresses these by restructuring how AI systems operate internally, introducing a runtime in which evidence, policy, memory, and explainability are tightly integrated and enforced.

This repository provides a working, extensible implementation of that architecture for researchers, engineers, and organisations building high-trust AI systems.

The Problem OtiLLM Solves

Modern AI systems frequently fail in high-stakes environments due to:

answers generated without sufficient or verifiable evidence
weak or non-existent policy enforcement
uncontrolled or low-quality memory accumulation
limited visibility into reasoning and decision processes
unreliable behaviour in long-running or agent-based workflows

These limitations are not purely model problems. They are system design problems.

The OtiLLM Approach

OtiLLM introduces a structured runtime in which every meaningful output follows a controlled lifecycle:

Input → Evidence → Reason → Verify → Align → Act → Explain

This replaces loosely coupled pipelines with a bounded, auditable, and evidence-driven execution model.

How OtiLLM Differs from Existing Approaches

Standard LLM Pipelines

rely heavily on prompt engineering
limited visibility into reasoning
no explicit evidence validation
no runtime governance

Traditional RAG Systems

improve factual grounding
but often rely on naive retrieval
lack policy awareness
limited explainability
no structured memory control

Agent-Based Systems

powerful but often unbounded
difficult to control or audit
prone to unsafe or inconsistent behaviour

OtiLLM

OtiLLM combines the strengths of these approaches while addressing their weaknesses:

evidence is explicitly retrieved, scored, and validated
policies are enforced before execution
memory is gated and quality-controlled
outputs are traceable and explainable
system behaviour is bounded and auditable

Key Components

Evidence Fabric

A hybrid retrieval layer that evaluates information using multiple signals:

semantic relevance
keyword overlap
temporal freshness
graph-aware signals
source trust (provenance)

This enables more reliable evidence selection than standard retrieval pipelines.

Policy Engine

A runtime governance layer that evaluates whether a request or action is allowed before execution.

This enables safer deployment in regulated and high-trust environments.

Memory Engine

A gated memory system that only stores information when it is:

sufficiently high quality
policy-compliant
novel

This prevents uncontrolled accumulation and improves long-term reliability.

Cognitive Orchestrator

The central coordination layer that integrates retrieval, validation, scoring, and generation.

It ensures that outputs are only produced when evidence and confidence thresholds are satisfied.

Explainability Layer

A built-in tracing system that provides visibility into how each response is generated, including:

retrieved sources
evidence scores
confidence estimation
policy decisions
execution outcomes
memory updates

Architecture Overview

OtiLLM is organised as a structured runtime pipeline:

Multimodal Input Perception Layer Evidence Fabric Cognitive Orchestrator Policy Engine Memory Engine Generator / Action Layer Explainability Trace Output

This design enables controlled, interpretable, and verifiable AI behaviour.

What This Release Includes

This initial release provides:

a modular Python package implementing the OtiLLM runtime
evidence ingestion and hybrid retrieval
policy-aware request handling
memory-gated storage logic
explainability trace generation
working examples demonstrating usage
a test suite for core components
packaging configuration for distribution

What This Release Does Not Claim

OtiLLM 0.1.0 is a foundational runtime framework.

It does not claim:

state-of-the-art benchmark performance
a fully trained large-scale foundation model
production-grade distributed infrastructure
complete multimodal training pipelines

Instead, it establishes the architectural and implementation foundation required for those capabilities.

Installation

Clone the repository:

git clone https://github.com/YOUR_GITHUB_USERNAME/OtiLLM.git
cd OtiLLM

Install locally:

pip install -e .

For development:

pip install -e .[dev]

Quick Start

from otillm import OtiLLM

model = OtiLLM()

model.add_evidence(
    content="Retrieval-Augmented Generation reduces hallucination by grounding outputs in external knowledge.",
    source="rag_reference",
    trust_score=0.9
)

model.add_evidence(
    content="Policy-aware AI systems are essential in regulated environments such as healthcare and finance.",
    source="governance_reference",
    trust_score=0.95
)

response = model.query("Why is policy-aware retrieval important?")

print(response.answer)
print(model.explain(response))

Example Output Behaviour

The system produces:

A grounded response based on retrieved evidence
A detailed trace explaining:

which sources were used
how they were scored
confidence level
evidence sufficiency
policy decision
execution outcome
whether memory was updated

This makes OtiLLM suitable for applications where transparency and accountability are required.

Use Cases

OtiLLM is particularly suited for:

enterprise-grade RAG systems
explainable AI assistants
policy-aware AI copilots
regulated decision-support systems
multimodal intelligence applications
auditable AI workflows

Repository Structure

OtiLLM/
├── otillm/
│   ├── core/
│   ├── evidence/
│   ├── multimodal/
│   ├── explainability/
│   └── utils/
├── tests/
├── examples/
├── README.md
├── LICENSE
├── pyproject.toml
└── setup.py

Running Tests

pytest

Roadmap

Future versions will extend this release with:

vector database integration
pluggable LLM backends
benchmark evaluation framework
CI/CD pipelines
enhanced multimodal processing
domain-specific policy modules
advanced memory and retrieval optimisation

Research Positioning

OtiLLM represents a shift from model-centric AI design to runtime-centric AI systems.

Rather than relying solely on model scale, OtiLLM focuses on:

structured evidence grounding
policy-aware execution
controlled memory evolution
built-in explainability

The framework supports ongoing research into reliable and governed AI systems.

Author

Oti Edema AI/ML Research Engineer and Data Scientist

LinkedIn: https://www.linkedin.com/in/oti-e-34838485/

Contributing

Contributions are welcome.

Areas of interest include:

retrieval system improvements
multimodal extensions
policy and governance modules
benchmarking and evaluation
documentation and examples

License

This project is released under the MIT License.

Project details

Release history Release notifications | RSS feed

0.2.0

May 13, 2026

This version

0.1.0

Apr 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

otillm-0.1.0.tar.gz (15.9 kB view details)

Uploaded Apr 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

otillm-0.1.0-py3-none-any.whl (15.8 kB view details)

Uploaded Apr 18, 2026 Python 3

File details

Details for the file otillm-0.1.0.tar.gz.

File metadata

Download URL: otillm-0.1.0.tar.gz
Upload date: Apr 18, 2026
Size: 15.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for otillm-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`900bb155f218b660365f6a6bd3a9877f72692f3c297325e4e5c41eaa35c7909d`
MD5	`96a5cfb149b733ba354922bdc17abadc`
BLAKE2b-256	`821aad19531f66201266af74215a50e7fc2c8b0febd56f4e797fb16c04253e33`

See more details on using hashes here.

File details

Details for the file otillm-0.1.0-py3-none-any.whl.

File metadata

Download URL: otillm-0.1.0-py3-none-any.whl
Upload date: Apr 18, 2026
Size: 15.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for otillm-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bf405c418497551009509b172b9f9d672aa648d7c79e45495f752e1964b607e3`
MD5	`fdaac9ad99e6948bac145257e1dfe3fe`
BLAKE2b-256	`80554327e2ccac35af498db396107410c2ac607e2bc068a0e9edd47fa48f33ce`

See more details on using hashes here.

otillm 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

OtiLLM 0.1.0

Overview

The Problem OtiLLM Solves

The OtiLLM Approach

How OtiLLM Differs from Existing Approaches

Standard LLM Pipelines

Traditional RAG Systems

Agent-Based Systems

OtiLLM

Key Components

Evidence Fabric

Policy Engine

Memory Engine

Cognitive Orchestrator

Explainability Layer

Architecture Overview

What This Release Includes

What This Release Does Not Claim

Installation

Quick Start

Example Output Behaviour

Use Cases

Repository Structure

Running Tests

Roadmap

Research Positioning

Author

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes