Skip to main content

OtiLLM 0.1.0: Evidence-native, policy-aware AI runtime for reliable AI systems

Project description

OtiLLM 0.1.0

Evidence-Native, Policy-Aware AI Runtime for Reliable AI Systems


Overview

OtiLLM 0.1.0 is the founding open-source release of OtiLLM, a next-generation AI runtime architecture designed to improve the reliability, governance, and explainability of modern AI systems.

While large language models and retrieval systems have advanced significantly, their real-world deployment often exposes fundamental weaknesses. OtiLLM addresses these by restructuring how AI systems operate internally, introducing a runtime in which evidence, policy, memory, and explainability are tightly integrated and enforced.

This repository provides a working, extensible implementation of that architecture for researchers, engineers, and organisations building high-trust AI systems.


The Problem OtiLLM Solves

Modern AI systems frequently fail in high-stakes environments due to:

  • answers generated without sufficient or verifiable evidence
  • weak or non-existent policy enforcement
  • uncontrolled or low-quality memory accumulation
  • limited visibility into reasoning and decision processes
  • unreliable behaviour in long-running or agent-based workflows

These limitations are not purely model problems. They are system design problems.


The OtiLLM Approach

OtiLLM introduces a structured runtime in which every meaningful output follows a controlled lifecycle:

Input → Evidence → Reason → Verify → Align → Act → Explain

This replaces loosely coupled pipelines with a bounded, auditable, and evidence-driven execution model.


How OtiLLM Differs from Existing Approaches

Standard LLM Pipelines

  • rely heavily on prompt engineering
  • limited visibility into reasoning
  • no explicit evidence validation
  • no runtime governance

Traditional RAG Systems

  • improve factual grounding
  • but often rely on naive retrieval
  • lack policy awareness
  • limited explainability
  • no structured memory control

Agent-Based Systems

  • powerful but often unbounded
  • difficult to control or audit
  • prone to unsafe or inconsistent behaviour

OtiLLM

OtiLLM combines the strengths of these approaches while addressing their weaknesses:

  • evidence is explicitly retrieved, scored, and validated
  • policies are enforced before execution
  • memory is gated and quality-controlled
  • outputs are traceable and explainable
  • system behaviour is bounded and auditable

Key Components

Evidence Fabric

A hybrid retrieval layer that evaluates information using multiple signals:

  • semantic relevance
  • keyword overlap
  • temporal freshness
  • graph-aware signals
  • source trust (provenance)

This enables more reliable evidence selection than standard retrieval pipelines.

Policy Engine

A runtime governance layer that evaluates whether a request or action is allowed before execution.

This enables safer deployment in regulated and high-trust environments.

Memory Engine

A gated memory system that only stores information when it is:

  • sufficiently high quality
  • policy-compliant
  • novel

This prevents uncontrolled accumulation and improves long-term reliability.

Cognitive Orchestrator

The central coordination layer that integrates retrieval, validation, scoring, and generation.

It ensures that outputs are only produced when evidence and confidence thresholds are satisfied.

Explainability Layer

A built-in tracing system that provides visibility into how each response is generated, including:

  • retrieved sources
  • evidence scores
  • confidence estimation
  • policy decisions
  • execution outcomes
  • memory updates

Architecture Overview

OtiLLM is organised as a structured runtime pipeline:

Multimodal Input Perception Layer Evidence Fabric Cognitive Orchestrator Policy Engine Memory Engine Generator / Action Layer Explainability Trace Output

This design enables controlled, interpretable, and verifiable AI behaviour.


What This Release Includes

This initial release provides:

  • a modular Python package implementing the OtiLLM runtime
  • evidence ingestion and hybrid retrieval
  • policy-aware request handling
  • memory-gated storage logic
  • explainability trace generation
  • working examples demonstrating usage
  • a test suite for core components
  • packaging configuration for distribution

What This Release Does Not Claim

OtiLLM 0.1.0 is a foundational runtime framework.

It does not claim:

  • state-of-the-art benchmark performance
  • a fully trained large-scale foundation model
  • production-grade distributed infrastructure
  • complete multimodal training pipelines

Instead, it establishes the architectural and implementation foundation required for those capabilities.


Installation

Clone the repository:

git clone https://github.com/YOUR_GITHUB_USERNAME/OtiLLM.git
cd OtiLLM

Install locally:

pip install -e .

For development:

pip install -e .[dev]

Quick Start

from otillm import OtiLLM

model = OtiLLM()

model.add_evidence(
    content="Retrieval-Augmented Generation reduces hallucination by grounding outputs in external knowledge.",
    source="rag_reference",
    trust_score=0.9
)

model.add_evidence(
    content="Policy-aware AI systems are essential in regulated environments such as healthcare and finance.",
    source="governance_reference",
    trust_score=0.95
)

response = model.query("Why is policy-aware retrieval important?")

print(response.answer)
print(model.explain(response))

Example Output Behaviour

The system produces:

  1. A grounded response based on retrieved evidence
  2. A detailed trace explaining:
  • which sources were used
  • how they were scored
  • confidence level
  • evidence sufficiency
  • policy decision
  • execution outcome
  • whether memory was updated

This makes OtiLLM suitable for applications where transparency and accountability are required.


Use Cases

OtiLLM is particularly suited for:

  • enterprise-grade RAG systems
  • explainable AI assistants
  • policy-aware AI copilots
  • regulated decision-support systems
  • multimodal intelligence applications
  • auditable AI workflows

Repository Structure

OtiLLM/
├── otillm/
│   ├── core/
│   ├── evidence/
│   ├── multimodal/
│   ├── explainability/
│   └── utils/
├── tests/
├── examples/
├── README.md
├── LICENSE
├── pyproject.toml
└── setup.py

Running Tests

pytest

Roadmap

Future versions will extend this release with:

  • vector database integration
  • pluggable LLM backends
  • benchmark evaluation framework
  • CI/CD pipelines
  • enhanced multimodal processing
  • domain-specific policy modules
  • advanced memory and retrieval optimisation

Research Positioning

OtiLLM represents a shift from model-centric AI design to runtime-centric AI systems.

Rather than relying solely on model scale, OtiLLM focuses on:

  • structured evidence grounding
  • policy-aware execution
  • controlled memory evolution
  • built-in explainability

The framework supports ongoing research into reliable and governed AI systems.


Author

Oti Edema AI/ML Research Engineer and Data Scientist

LinkedIn: https://www.linkedin.com/in/oti-e-34838485/


Contributing

Contributions are welcome.

Areas of interest include:

  • retrieval system improvements
  • multimodal extensions
  • policy and governance modules
  • benchmarking and evaluation
  • documentation and examples

License

This project is released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

otillm-0.1.0.tar.gz (15.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

otillm-0.1.0-py3-none-any.whl (15.8 kB view details)

Uploaded Python 3

File details

Details for the file otillm-0.1.0.tar.gz.

File metadata

  • Download URL: otillm-0.1.0.tar.gz
  • Upload date:
  • Size: 15.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for otillm-0.1.0.tar.gz
Algorithm Hash digest
SHA256 900bb155f218b660365f6a6bd3a9877f72692f3c297325e4e5c41eaa35c7909d
MD5 96a5cfb149b733ba354922bdc17abadc
BLAKE2b-256 821aad19531f66201266af74215a50e7fc2c8b0febd56f4e797fb16c04253e33

See more details on using hashes here.

File details

Details for the file otillm-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: otillm-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 15.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for otillm-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bf405c418497551009509b172b9f9d672aa648d7c79e45495f752e1964b607e3
MD5 fdaac9ad99e6948bac145257e1dfe3fe
BLAKE2b-256 80554327e2ccac35af498db396107410c2ac607e2bc068a0e9edd47fa48f33ce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page