Skip to main content

Epistemic uncertainty layer for LLMs. Stop hallucinations. Let AI say 'I don't know.'

Project description

KATERYNA

Epistemic Uncertainty Layer for LLMs

Stop your LLMs from hallucinating. Let them say "I don't know."

PyPI version License: MIT Docs

Documentation | PyPI | GitHub


The Problem

User: "What is the capital of Freedonia?"

GPT-4 (binary): "The capital of Freedonia is Fredville."
                 ^ Confident. Wrong. Freedonia is fictional.

Kateryna Layer:  "I don't know. I found no grounded information
                  about Freedonia in my knowledge base."
                 ^ Abstained. Correct response.

LLMs hallucinate because binary architecture forces yes/no responses. There's no native way to represent "I don't know."

Kateryna adds that capability.


The Solution: Ternary Logic

Based on Nikolai Brusentsov's 1958 Setun computer - the first (and only) balanced ternary computer ever mass-produced.

State Value Meaning Action
CONFIDENT +1 Strong RAG evidence, response matches grounding Return answer
UNCERTAIN 0 Weak/no evidence, model hedging Abstain
OVERCONFIDENT -1 Confident language WITHOUT evidence DANGER FLAG

The Critical Insight

The -1 state is the breakthrough.

Traditional confidence scores miss this:

  • Binary: "Is the model confident?" Yes/No
  • Probability: "How confident?" 0-100%

Neither asks: "Is the confidence justified?"

Kateryna asks: Does the confidence level match the grounding evidence?

Confident response + Strong RAG grounding = +1 (Trust it)
Uncertain response + Weak RAG grounding   =  0 (Appropriate uncertainty)
Confident response + Weak RAG grounding   = -1 (HALLUCINATION RISK)

The -1 state catches confident bullshit.


Installation

# Core package (works with any LLM)
pip install kateryna

# With OpenAI support
pip install kateryna[openai]

# With Anthropic support
pip install kateryna[anthropic]

# With local Ollama support
pip install kateryna[ollama]

# All adapters
pip install kateryna[all]

Quick Start

Standalone Detector (Any LLM)

from kateryna import EpistemicDetector, TernaryState

detector = EpistemicDetector()

# Analyze any LLM output
state = detector.analyze(
    text="The capital of Freedonia is definitely Fredville.",
    question="What is the capital of Freedonia?",
    retrieval_confidence=0.05,  # Low RAG score
    chunks_found=0
)

if state.is_danger_zone:
    print(f"DANGER: {state.reason}")
    # "DANGER: Confident response without grounding (RAG: 5%)"

# The -1 state catches hallucinations that LOOK confident
print(state.state)  # TernaryState.OVERCONFIDENT

With OpenAI

from openai import OpenAI
from kateryna.adapters.openai import OpenAISyncEpistemicAdapter

client = OpenAISyncEpistemicAdapter(OpenAI(), model="gpt-4")

response = client.generate_with_rag(
    prompt="How does this function work?",
    rag_chunks=[
        {"content": "def add(a, b): return a + b", "distance": 0.1},
        {"content": "# Adds two numbers together", "distance": 0.15},
    ]
)

if response.epistemic_state.grounded:
    print(f"Confident answer: {response.content}")
elif response.epistemic_state.is_danger_zone:
    print("WARNING: Potential hallucination detected")

With Local Ollama

from kateryna.adapters.ollama import OllamaSyncEpistemicAdapter

client = OllamaSyncEpistemicAdapter(model="llama3.2")

response = client.generate_with_rag(
    prompt="Explain this COBOL code",
    rag_chunks=my_retrieved_chunks
)

Pre-Question Filtering (Save Tokens!)

detector = EpistemicDetector()

# Don't even call the LLM for unanswerable questions
should_abstain, reason = detector.should_abstain_on_question(
    "What will Bitcoin be worth in 2030?"
)

if should_abstain:
    print("Abstaining - question asks for prediction")

RAG Integration

Kateryna works with any vector database. Just pass chunks with a score field:

# Pinecone / ChromaDB (distance)
chunks = [{"content": "...", "distance": 0.1}]

# Weaviate (score)
chunks = [{"text": "...", "score": 0.85}]

# Custom (relevance or similarity)
chunks = [{"content": "...", "relevance": 0.9}]
chunks = [{"content": "...", "similarity": 0.88}]

Ternary State Mapping

RAG Confidence LLM Response State Grounded Action
High (>0.7) Confident +1 Yes Return
High (>0.7) Uncertain 0 Yes Abstain
Medium (0.3-0.7) Confident +1 Yes Return
Medium (0.3-0.7) Uncertain 0 No Abstain
Low (<0.3) Confident -1 No DANGER
Low (<0.3) Uncertain 0 No Abstain
None Any 0 No Abstain

Key insight: A confident response without evidence is MORE dangerous than an uncertain one.


Named After

Kateryna Yushchenko (1919-2001) - Ukrainian computer scientist who invented indirect addressing (pointers) in 1955, systematically erased from Western computing history.

Her work on indirect addressing made modern programming possible. This work on epistemic addressing makes AI trustworthy.


Research Foundation

  • Setun Computer (1958) - Nikolai Brusentsov, Moscow State University
  • Three-Valued Logic - Jan Lukasiewicz (1920s)
  • Ternary Inference - DOI: 10.5281/zenodo.17875182

License

MIT


"AI that knows when it doesn't know."

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kateryna-1.0.4.tar.gz (32.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kateryna-1.0.4-py3-none-any.whl (24.1 kB view details)

Uploaded Python 3

File details

Details for the file kateryna-1.0.4.tar.gz.

File metadata

  • Download URL: kateryna-1.0.4.tar.gz
  • Upload date:
  • Size: 32.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for kateryna-1.0.4.tar.gz
Algorithm Hash digest
SHA256 7cb2cdece0fbb0969ceba8a1e98cff5815558725d9479e24e11b2675a5acb023
MD5 5fef91861b6b4136eaa278b2682dac92
BLAKE2b-256 1f461e1db34a637c33f1e1b2127cee2ac45cc59db3f80fcb6f5e2009166000a9

See more details on using hashes here.

File details

Details for the file kateryna-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: kateryna-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 24.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for kateryna-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 3c5dde654ecc7c9f507e367dcccd591ac0ecdcb41855e8780f1e494670e99d88
MD5 66a196f6d5b18f3bc3c7cee0bed46184
BLAKE2b-256 ac86e403d4d5b9b3974937f1734789d862beca473e725b97846ef6f7700d35bd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page