Real-time LLM hallucination guardrail — NLI + RAG fact-checking with token-level streaming halt

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

anulum

These details have not been verified by PyPI

Project links

Homepage

Project description

Director-AI — Real-time LLM Hallucination Guardrail

Director-AI

Real-time LLM hallucination guardrail — NLI + RAG fact-checking with token-level streaming halt

Sales Pitch & Pricing · Contact Sales · invest@anulum.li

What It Does

Director-AI sits between your LLM and the user. It scores every output for hallucination before it reaches anyone — and can halt generation mid-stream if coherence drops below threshold.

from director_ai import CoherenceAgent

agent = CoherenceAgent()
result = agent.process("What color is the sky?")

print(result.coherence.score)      # 0.94 — high coherence
print(result.coherence.approved)   # True
print(result.coherence.h_logical)  # 0.10 — low contradiction probability
print(result.coherence.h_factual)  # 0.10 — low factual deviation

Three things make it different:

Token-level streaming halt — not post-hoc review. The safety kernel monitors coherence token-by-token and severs output the moment it degrades.
Dual-entropy scoring — NLI contradiction detection (DeBERTa) + RAG fact-checking against your own knowledge base. Both must pass.
Your data, your rules — ingest PDFs, directories, or any text into a ChromaDB-backed knowledge base. The scorer checks LLM output against your ground truth, not a generic model.

Architecture

          ┌──────────────────────────┐
          │    Coherence Agent       │
          │    (Orchestrator)        │
          └─────────┬────────────────┘
                    │
       ┌────────────┼────────────────┐
       │            │                │
┌──────▼──────┐ ┌───▼──────────┐ ┌───▼────────────┐
│  Generator  │ │  Coherence   │ │  Safety        │
│  (LLM       │ │  Scorer      │ │  Kernel        │
│   backend)  │ │              │ │  (streaming    │
│             │ │  NLI + RAG   │ │   interlock)   │
└─────────────┘ └───┬──────────┘ └────────────────┘
                    │
          ┌─────────▼─────────┐
          │  Ground Truth     │
          │  Store            │
          │  (ChromaDB / RAM) │
          └───────────────────┘

Installation

# Basic install (heuristic scoring, no GPU needed)
pip install director-ai

# With NLI model (DeBERTa-based contradiction detection)
pip install director-ai[nli]

# With vector store (ChromaDB for custom knowledge bases)
pip install director-ai[vector]

# With LangChain or LlamaIndex
pip install director-ai[langchain]
pip install director-ai[llamaindex]

# With REST API server
pip install director-ai[server]

# Fine-tuning pipeline
pip install director-ai[train]

# Everything
pip install "director-ai[nli,vector,server]"

# Development
git clone https://github.com/anulum/director-ai.git
cd director-ai
pip install -e ".[dev]"

Usage

Score a single response

from director_ai.core import CoherenceScorer, GroundTruthStore

store = GroundTruthStore()
store.add("sky color", "The sky is blue due to Rayleigh scattering.")

scorer = CoherenceScorer(threshold=0.6, ground_truth_store=store)
approved, score = scorer.review("What color is the sky?", "The sky is green.")

print(approved)     # False — contradicts ground truth
print(score.score)  # 0.42

With a real LLM backend

from director_ai import CoherenceAgent

# Works with any OpenAI-compatible endpoint (llama.cpp, vLLM, Ollama, etc.)
agent = CoherenceAgent(llm_api_url="http://localhost:8080/completion")
result = agent.process("Explain quantum entanglement")

if result.halted:
    print("Output blocked — coherence too low")
else:
    print(result.output)

Token-level streaming with halt

from director_ai.core import StreamingKernel

kernel = StreamingKernel(hard_limit=0.4, window_size=5, window_threshold=0.5)

session = kernel.stream_tokens(
    token_generator=my_token_iterator,
    coherence_callback=lambda tok: my_scorer(tok),
)

for event in session.events:
    if event.halted:
        print(f"\n[HALTED — {session.halt_reason}]")
        break
    print(event.token, end="")

NLI-based scoring (requires torch)

from director_ai.core import CoherenceScorer

scorer = CoherenceScorer(use_nli=True, threshold=0.6)
approved, score = scorer.review(
    "The Earth orbits the Sun.",
    "The Sun orbits the Earth."
)
print(score.h_logical)  # High — NLI detects contradiction

Custom knowledge base with ChromaDB

from director_ai.core import VectorGroundTruthStore

store = VectorGroundTruthStore()  # Uses ChromaDB
store.add_fact("company policy", "Refunds are available within 30 days.")
store.add_fact("pricing", "Enterprise plan starts at $99/month.")

scorer = CoherenceScorer(ground_truth_store=store)
approved, score = scorer.review(
    "What is the refund policy?",
    "We offer full refunds within 90 days."  # Wrong
)
# approved = False — contradicts your KB

LangChain integration

pip install director-ai[langchain,nli]

from director_ai.integrations.langchain import DirectorAIGuard

guard = DirectorAIGuard(
    facts={"refund": "Refunds available within 30 days."},
    threshold=0.6,
    use_nli=True,
)

# Pipe after any LLM in a chain
chain = my_llm | guard
result = chain.invoke({"query": "What is the refund policy?"})

print(result["approved"])  # False if hallucinated
print(result["score"])     # 0.0–1.0 coherence

Raises HallucinationError if raise_on_fail=True. Async supported via ainvoke().

LlamaIndex integration

pip install director-ai[llamaindex,nli]

from director_ai.integrations.llamaindex import DirectorAIPostprocessor

postprocessor = DirectorAIPostprocessor(
    facts={"pricing": "Enterprise plan starts at $99/month."},
    threshold=0.6,
)

# Filters out hallucinated nodes before they reach the user
query_engine = index.as_query_engine(
    node_postprocessors=[postprocessor]
)
response = query_engine.query("What does Enterprise cost?")

Adds director_ai_score metadata to surviving nodes. Also usable standalone via postprocessor.check(query, response).

More examples

Example	Backend	What it shows
`quickstart.py`	None	Guard any output in 10 lines
`openai_guard.py`	OpenAI	Score + streaming halt for GPT-4o
`ollama_guard.py`	Ollama	Local LLM guard with Llama 3
`langchain_guard.py`	LangChain	Full chain guardrail
`streaming_halt_demo.py`	Simulated	All 3 halt mechanisms visualised

Interactive demo

pip install director-ai gradio
python demo/app.py

Scoring Formula

Coherence = 1 - (0.6 * H_logical + 0.4 * H_factual)

Component	Source	Range	Meaning
H_logical	NLI model (DeBERTa)	0-1	Contradiction probability
H_factual	RAG retrieval	0-1	Ground truth deviation

Score >= 0.6 → approved (configurable)
Score < 0.5 → safety kernel emergency halt

Benchmarks

Evaluated on LLM-AggreFact (29,320 samples across 11 datasets):

Model	AggreFact Balanced Acc	Latency (avg)
DeBERTa-v3-base (baseline)	66.2%	220 ms
Fine-tuned DeBERTa-v3-large	64.7%	223 ms
Fine-tuned DeBERTa-v3-base	59.0%	220 ms

Per-dataset highlights:

Dataset	Balanced Accuracy	Notes
Reveal	80.7%	Strong on factual claims
FactCheck-GPT	71.7%	Good on GPT-generated text
Lfqa	64.8%	Long-form QA
RAGTruth	58.9%	RAG-specific hallucination
AggreFact-CNN	53.0%	Summarization (known weak spot)

Head-to-head (same benchmark, same metric — LLM-AggreFact leaderboard):

Tool	Bal. Acc	Params	Latency	Streaming
Bespoke-MiniCheck-7B	77.4%	7B	~100 ms (GPU)	No
MiniCheck-Flan-T5-L	75.0%	0.8B	~120 ms	No
MiniCheck-DeBERTa-L	72.6%	0.4B	~120 ms	No
HHEM-2.1-Open	71.8%	~0.4B	~200 ms	No
Director-AI	66.2%	0.4B	220 ms	Yes

Honest assessment: The NLI scorer alone is not state-of-the-art. Director-AI's value is in the system — combining NLI with your own KB facts, streaming token-level gating, and configurable halt thresholds. No competitor offers real-time streaming halt. The NLI component is pluggable; swap in any model that improves on these numbers.

Full comparison with SelfCheckGPT, RAGAS, NeMo Guardrails, Lynx, and others in benchmarks/comparison/. Benchmark scripts in benchmarks/. Fine-tuning pipeline in training/.

Package Structure

src/director_ai/
├── core/                           # Production API
│   ├── agent.py                    # CoherenceAgent — main orchestrator
│   ├── scorer.py                   # Dual-entropy coherence scorer
│   ├── kernel.py                   # Safety kernel (streaming interlock)
│   ├── streaming.py                # Token-level streaming oversight
│   ├── async_streaming.py          # Non-blocking async streaming
│   ├── nli.py                      # NLI scorer (DeBERTa)
│   ├── actor.py                    # LLM generator interface
│   ├── knowledge.py                # Ground truth store (in-memory)
│   ├── vector_store.py             # Vector store (ChromaDB backend)
│   ├── policy.py                   # YAML declarative policy engine
│   ├── audit.py                    # Structured JSONL audit logger
│   ├── tenant.py                   # Multi-tenant KB isolation
│   ├── sanitizer.py                # Prompt injection hardening
│   └── types.py                    # CoherenceScore, ReviewResult
├── integrations/                   # Framework integrations
│   ├── langchain.py                # LangChain Runnable guardrail
│   └── llamaindex.py               # LlamaIndex postprocessor
├── cli.py                          # CLI: review, process, batch, serve
├── server.py                       # FastAPI REST wrapper
benchmarks/                         # AggreFact evaluation suite
training/                           # DeBERTa fine-tuning pipeline

Testing

pytest tests/ -v

License & Pricing

Dual-licensed:

Open-Source: GNU AGPL v3.0 — research, personal use, open-source projects. Full source, self-host, no restrictions beyond AGPL copyleft obligations.
Commercial: Proprietary license from ANULUM — removes copyleft, allows closed-source and SaaS deployment.

Commercial Tiers

Tier	Monthly	Yearly	Best for
Hobbyist	$9	$90	Students, side projects, experiments. 1 local deployment, community support (GitHub/Discord), delayed updates.
Indie	$49	$490	Solo devs, bootstrapped teams (<$2M ARR). 1 production deployment, email support, 12 months updates.
Pro	$249	$2,490	Startups & scale-ups. Unlimited internal devs, multiple envs, Slack priority support, early releases.
Enterprise	Custom	Custom	Large orgs. SLA (99.9%), on-prem/air-gapped, SOC2/HIPAA-ready, dedicated engineer, custom NLI fine-tunes.

Perpetual license: $1,299 one-time (Indie equivalent). First 50 commercial licensees: 50% off first year.

Contact: anulum.li/contact or invest@anulum.li

See NOTICE for full terms and third-party acknowledgements.

Roadmap

Next Training Run (v1.1)

Current NLI baseline: 66.2% balanced accuracy on LLM-AggreFact. Target: 72%+ through:

Dataset rebalancing — downsample VitaminC from 370K to 100K (50% of current training data, causes domain bias toward fact-verification)
Contamination fix — remove HaluEval from training data (present in both train and benchmark, inflates reported numbers)
Threshold calibration — post-training calibration pass (fine-tuned models underperform baseline on AggreFact despite 91% raw accuracy)
MiniCheck backend — pluggable MiniCheck-DeBERTa-L as alternative (72.6% on same benchmark, no retraining needed)

Planned Features

director-ai eval — structured CLI benchmarking
Webhook/callback on halt events
SQLite-backed usage dashboard at /v1/dashboard
Native OpenAI/Anthropic SDK in CoherenceAgent
HuggingFace Spaces live demo
Chunked NLI scoring for long documents

Citation

@software{sotek2026director,
  author    = {Sotek, Miroslav},
  title     = {Director-AI: Real-time LLM Hallucination Guardrail},
  year      = {2026},
  url       = {https://github.com/anulum/director-ai},
  version   = {1.0.0},
  license   = {AGPL-3.0-or-later}
}

Contributing

See CONTRIBUTING.md for guidelines. By contributing, you agree to the Code of Conduct and AGPL v3 licensing terms.

Security

See SECURITY.md for reporting vulnerabilities.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

anulum

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

3.12.0

Apr 5, 2026

3.11.1

Mar 27, 2026

3.11.0

Mar 27, 2026

3.10.0

Mar 24, 2026

3.9.5

Mar 22, 2026

3.9.4

Mar 20, 2026

3.9.3

Mar 19, 2026

3.9.2

Mar 19, 2026

3.9.0

Mar 18, 2026

3.4.0

Mar 9, 2026

3.0.0

Mar 6, 2026

2.7.1

Mar 3, 2026

2.6.0

Mar 3, 2026

2.4.0

Mar 2, 2026

2.3.0

Mar 2, 2026

2.2.0

Mar 2, 2026

2.0.0

Mar 2, 2026

1.7.0

Mar 1, 2026

1.6.0

Mar 1, 2026

1.4.1

Mar 1, 2026

1.4.0

Mar 1, 2026

1.3.0

Mar 1, 2026

1.2.1

Feb 27, 2026

1.2.0

Feb 27, 2026

This version

1.1.0

Feb 26, 2026

1.0.0

Feb 26, 2026

0.9.0

Feb 25, 2026

0.8.2

Feb 25, 2026

0.8.1

Feb 23, 2026

0.8.0

Feb 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

director_ai-1.1.0.tar.gz (94.8 kB view details)

Uploaded Feb 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

director_ai-1.1.0-py3-none-any.whl (72.5 kB view details)

Uploaded Feb 26, 2026 Python 3

File details

Details for the file director_ai-1.1.0.tar.gz.

File metadata

Download URL: director_ai-1.1.0.tar.gz
Upload date: Feb 26, 2026
Size: 94.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for director_ai-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`4d3f00567447e802cd6720ddc6dcf6e40fe49af6f4a19634ac2eb109fbbf4daa`
MD5	`f7d3b476a2539431440208e4356d3e9f`
BLAKE2b-256	`0f726829eaebf741c0f63eb7f17e7d5b453a5106505d5f4a1942732483a82f6e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for director_ai-1.1.0.tar.gz:

Publisher: publish.yml on anulum/director-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: director_ai-1.1.0.tar.gz
- Subject digest: 4d3f00567447e802cd6720ddc6dcf6e40fe49af6f4a19634ac2eb109fbbf4daa
- Sigstore transparency entry: 1000393582
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: anulum/director-ai@5808ebaa02be6cbb14b53d46dd0390fde8b4fd77
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/anulum
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5808ebaa02be6cbb14b53d46dd0390fde8b4fd77
- Trigger Event: release

File details

Details for the file director_ai-1.1.0-py3-none-any.whl.

File metadata

Download URL: director_ai-1.1.0-py3-none-any.whl
Upload date: Feb 26, 2026
Size: 72.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for director_ai-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a04893265b33c6b3d6b040763b3181ce10e0bb6431968979e742d841f9e0a3ff`
MD5	`5ab26b849480daf5d85f1de712ae53d8`
BLAKE2b-256	`dfabec8530403e5ee42f389bf745db9e929fc8dcb87a1f2760ecc2443025cb40`

See more details on using hashes here.

Provenance

The following attestation bundles were made for director_ai-1.1.0-py3-none-any.whl:

Publisher: publish.yml on anulum/director-ai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: director_ai-1.1.0-py3-none-any.whl
- Subject digest: a04893265b33c6b3d6b040763b3181ce10e0bb6431968979e742d841f9e0a3ff
- Sigstore transparency entry: 1000393633
- Sigstore integration time: Feb 26, 2026
Source repository:
- Permalink: anulum/director-ai@5808ebaa02be6cbb14b53d46dd0390fde8b4fd77
- Branch / Tag: refs/tags/v1.1.0
- Owner: https://github.com/anulum
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@5808ebaa02be6cbb14b53d46dd0390fde8b4fd77
- Trigger Event: release

director-ai 1.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

Director-AI

What It Does

Architecture

Installation

Usage

Score a single response

With a real LLM backend

Token-level streaming with halt

NLI-based scoring (requires torch)

Custom knowledge base with ChromaDB

LangChain integration

LlamaIndex integration

More examples

Interactive demo

Scoring Formula

Benchmarks

Package Structure

Testing

License & Pricing

Commercial Tiers

Roadmap

Next Training Run (v1.1)

Planned Features

Citation

Contributing

Security

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance