Self-improving retrieval orchestration framework with RL-based routing, conditional graph activation, and evaluation-driven learning.

These details have not been verified by PyPI

Project links

Project description

adaptive-intelligence

Self-improving retrieval orchestration framework for document intelligence. Drop documents, ask questions, the system learns how to retrieve better over time.

RL-based retrieval routing, conditional graph activation, evaluation-driven learning, vectorless mode, and zero-configuration architecture. Works with any LLM.

Install

pip install adaptive-intelligence
pip install adaptive-intelligence[pdf]          # PDF support
pip install adaptive-intelligence[sql]          # SQL connector
pip install adaptive-intelligence[all]          # all document formats
pip install adaptive-intelligence[huggingface]  # local HuggingFace models

Quick Start

from adaptive_intelligence import AdaptiveAI

# Zero config — defaults to Ollama (free, local, private)
engine = AdaptiveAI()
engine.ingest("./documents")
response = engine.ask("What are the key operational risks?")

print(response.answer)
print(f"Confidence: {response.confidence:.0%}")
print(response.evaluation.display())

# Vectorless mode — no embeddings, no ChromaDB, zero dependencies
engine = AdaptiveAI(vectorless=True)
engine.ingest("./documents")
response = engine.ask("Revenue details?")
print(response.citations[0].page)  # Page number citation

# Structured output
response = engine.ask("Extract vendors", output_format="json")
print(response.structured)  # Parsed dict

# User feedback → RL reward
engine.feedback(response.query_id, "good")
engine.feedback(response.query_id, "bad", reason="Missing data")

Comparison: Traditional RAG vs GraphRAG vs Adaptive Intelligence

S.No.	Capability	Traditional RAG	GraphRAG	Adaptive Intelligence
1	Retrieval strategy	Static vector similarity	Always graph + vector	RL-learned per query (6+ routes)
2	Knowledge graph	None	Always on	Conditional (5-signal gate)
3	Learning	None — same forever	None — static	Improves every query (RL reward loop)
4	Evaluation	None built-in	None built-in	6 metrics automatic per response
5	Prompt engineering	Fixed template	Fixed template	Domain-aware, evolving from feedback
6	Query understanding	None	Basic	Type + complexity + domain + entities
7	Keyword retrieval	No (vector only)	No	BM25 + hybrid RRF fusion
8	Vectorless option	No	No	Page BM25 + graph (zero dependencies)
9	Graph compute waste	N/A	Yes — runs on every query	No — conditional activation
10	Output formats	Text only	Text only	JSON, CSV, YAML, DataFrame
11	User feedback	None	None	Thumbs up/down → RL update
12	Crash recovery	None	Partial	Auto-checkpoint + graceful shutdown
13	LLM agnostic	Usually tied to one	Usually tied to one	Any provider (10+ supported)
14	Audit trail	None	None	Full per-query decision log

Version Evolution (v1 → v2)

S.No.	Feature	v1	v2 (current)
1	RL policy engine	Thompson Sampling	+ configurable warmup
2	Conditional graph	5-signal gate + BFS	+ disk persistence
3	Evaluation	6 metrics → reward	+ user feedback as reward
4	Ingestion	Basic parsing	Hardened (every edge case)
5	SQL connector	No	PostgreSQL, MySQL, SQLite
6	Vectorless mode	No	Page BM25, zero dependencies
7	Output formats	Text only	JSON, CSV, YAML, DataFrame, custom schema
8	User feedback	No	Thumbs up/down → RL reward
9	Crash recovery	Partial (ChromaDB only)	Full (BM25 + graph + RL + memory)
10	Incremental ingest	No	add / remove / update
11	Parallel ingestion	No	ThreadPoolExecutor
12	System prompt	Hardcoded	Custom per-init, per-query
13	Page citations	chunk_id only	Page numbers
14	Chunk quality	No filtering	Quality scoring + dedup (SimHash)
15	Providers	Ollama + OpenAI + HF	10+ (NVIDIA, Groq, Gemini, Grok, Azure...)

Experimental Results

20 queries, 5 categories. Same LLM backend, same document corpus.

S.No.	Query category	Traditional RAG	Adaptive Intelligence	Delta
1	Factual	85%	90%	+5%
2	Relational	45%	78%	+33%
3	Analytical	55%	75%	+20%
4	Comparative	50%	80%	+30%
5	Multi-hop	35%	72%	+37%
6	Overall	54%	79%	+25%

Metric: keyword coverage (fraction of expected keywords in answer). Both systems use identical LLM and corpus. Adaptive Intelligence used 5 different retrieval strategies; Traditional RAG used 1. Graph activated for 5/20 queries (conditional, not wasted on simple lookups).

Three Core Innovations

1. RL Policy Engine

Contextual bandits with Thompson Sampling learn which retrieval strategy works best for each query type. First 15 queries use heuristic defaults, then RL takes over. No hardcoded rules.

2. Conditional Graph Activation

Knowledge graph built automatically during ingestion. Activated only when the query needs relational reasoning — not wasted on simple factual lookups. Five signals gate activation.

3. Self-Adaptive Retrieval

Every response evaluated on 6 metrics. Composite score becomes RL reward. System measurably improves over queries.

v2 Features

Vectorless Mode

No embeddings. No ChromaDB. No vector DB at all. Pure Python BM25 + knowledge graph + RL routing.

engine = AdaptiveAI(vectorless=True)
# Page-level BM25 index, page citations, zero dependencies
# RL + graph + evaluation still fully active

Output Formats

response = engine.ask("Extract vendors", output_format="json")
response = engine.ask("List items", output_format="csv")
response = engine.ask("Show data", output_format="yaml")
response = engine.ask("Revenue breakdown", output_format="dataframe")

# Custom schema
response = engine.ask("Contract details", output_format="json",
    schema={"parties": ["str"], "value": "float", "date": "str"})

User Feedback

engine.feedback(response.query_id, "good")   # RL reward boost
engine.feedback(response.query_id, "bad", reason="Wrong data")  # RL penalty + prompt evolution

Incremental Ingestion

engine.ingest("./new_report.pdf")       # Add
engine.remove("old_report.pdf")          # Remove
engine.update("./updated_report.pdf")    # Re-index

# Parallel
engine.ingest("./docs/", parallel=True, workers=4)

SQL Connector

engine.ingest("sqlite:///data.db")
engine.ingest("postgresql://user:pass@host/db", tables=["invoices", "vendors"])
engine.ingest("mysql://user:pass@host/db", query="SELECT * FROM orders WHERE year=2025")

Crash Recovery

Auto-checkpoint every 5 minutes. BM25, graph, RL policy, and memory all persist to disk. Graceful shutdown on SIGTERM/SIGINT. Auto-recovery on startup.

Hardened Ingestion

Handles every edge case: corrupted PDFs, password-protected files, scanned images (auto-OCR), Excel merged cells, hidden sheets, formula evaluation, CSV wrong delimiters, mixed encodings, malformed rows.

Supported Formats

S.No.	Format	Extension	Required Package
1	Text / Markdown	.txt, .md	—
2	CSV / TSV	.csv, .tsv	—
3	JSON	.json	—
4	HTML	.html	—
5	XML	.xml	—
6	PDF	.pdf	`PyMuPDF` or `pdfplumber`
7	Word	.docx	`python-docx`
8	Excel	.xlsx	`openpyxl`
9	PowerPoint	.pptx	`python-pptx`
10	Images (OCR)	.png, .jpg	`pytesseract`, `Pillow`
11	SQL databases	—	`sqlalchemy`

Providers — Copy, Paste, Run

Free Providers

# Ollama (default, local, free)
engine = AdaptiveAI()

# Ollama specific model
engine = AdaptiveAI(llm_model="llama3.2")

# NVIDIA NIM (free)
engine = AdaptiveAI(api_key="nvapi-...",
    base_url="https://integrate.api.nvidia.com/v1",
    llm_model="meta/llama-3.1-70b-instruct")

# Groq (free tier)
engine = AdaptiveAI(api_key="gsk_...",
    base_url="https://api.groq.com/openai/v1",
    llm_model="llama-3.3-70b-versatile")

# Google Gemini (free tier)
engine = AdaptiveAI(api_key="...",
    base_url="https://generativelanguage.googleapis.com/v1beta/openai",
    llm_model="gemini-2.0-flash")

# HuggingFace (local, any model)
engine = AdaptiveAI(llm_backend="huggingface", llm_model="microsoft/phi-2")

# Together AI (free tier)
engine = AdaptiveAI(api_key="...",
    base_url="https://api.together.xyz/v1",
    llm_model="meta-llama/Llama-3-70b-chat-hf")

Paid Providers

# OpenAI
engine = AdaptiveAI(api_key="sk-...", llm_model="gpt-4o")

# Grok (xAI)
engine = AdaptiveAI(api_key="xai-...",
    base_url="https://api.x.ai/v1", llm_model="grok-3-mini")

# Azure OpenAI
engine = AdaptiveAI(llm_backend="azure_openai", api_key="...",
    azure_endpoint="https://your-resource.openai.azure.com",
    deployment_name="gpt-4o")

# AWS Bedrock (via gateway)
engine = AdaptiveAI(api_key="...",
    base_url="https://your-bedrock-gateway/v1",
    llm_model="anthropic.claude-v2")

No LLM

# Retrieval only — returns ranked excerpts
engine = AdaptiveAI(llm_backend="none")

# Full zero-dependency mode
engine = AdaptiveAI(llm_backend="none", vectorless=True)

Code Examples

Inspect the Full Pipeline

response = engine.ask("Compare Q2 and Q3 revenue")

# What did the system understand?
print(response.query_analysis)

# What strategy did the RL policy choose?
pd = response.policy_decision
print(f"Route: {pd.retrieval_route}")
print(f"Graph: {pd.graph_activation}")
print(f"Explored: {pd.was_exploration}")

# Evaluation scores
print(response.evaluation.display())

# Citations with page numbers
for c in response.citations:
    print(f"  {c.source_document}, Page {c.page} ({c.confidence:.0%})")

Dashboard and Monitoring

print(engine.dashboard())

stats = engine.rl.get_stats()
print(f"Warmup: {stats['is_warmup']}")
print(f"Arms learned: {stats['total_arms']}")

curve = engine.learning_curve()
print(engine.memory.get_learning_summary())

engine.audit.export("audit.json")

System Prompt

# Set at init
engine = AdaptiveAI(system_prompt="You are a financial analyst. Cite page numbers.")

# Override per query
response = engine.ask("Risks?",
    system_prompt="You are a risk specialist. Rate each HIGH/MEDIUM/LOW.")

# Update anytime
engine.set_system_prompt("You are a legal reviewer. Flag violations.")
engine.set_system_prompt(None)  # Reset to default

Advanced Configuration

from adaptive_intelligence.core.config import (
    AdaptiveConfig, RLConfig, GraphConfig, EvaluationConfig,
    LLMBackend, Domain, SecurityLevel,
)

config = AdaptiveConfig(
    llm_backend=LLMBackend.OLLAMA,
    llm_model="llama3.2",
    domain=Domain.FINANCIAL,
    security_level=SecurityLevel.HIGH,
    rl=RLConfig(warmup_queries=20, exploration_rate=0.15),
    graph=GraphConfig(conditional_activation=True, max_hops=3),
    evaluation=EvaluationConfig(faithfulness_weight=0.35, enable_llm_judge=True),
)
engine = AdaptiveAI(config=config)

Why This Exists

The world has 500+ LLMs. Every cloud provider (Azure, AWS, GCP), every platform (HuggingFace, Ollama), every company (NVIDIA, Meta, Google, xAI) is producing models. Many are free.

But every LLM has the same problem: garbage in, garbage out.

adaptive-intelligence is the layer that learns WHAT to feed the LLM. The LLM is replaceable. The retrieval intelligence is not.

FAQ

Q: How does ingestion handle mixed content (text + tables) from PDFs?

Tables are extracted with structure preserved (is_table=True) and indexed into the same indexes as text. The RL policy learns to use the table_first retrieval route for structured queries. Separation happens at retrieval time via learned routing, not at ingestion time.

Q: How is this different from just using ChatGPT / Claude?

adaptive-intelligence is not an LLM — it's the retrieval layer that decides what context to feed TO the LLM. It uses ChatGPT, Claude, Grok, or Ollama as backends. The system learns which retrieval strategy works best for each query type on YOUR specific documents.

Q: What is vectorless mode?

No ChromaDB. No embeddings. No vector DB. Pure Python BM25 keyword search over full pages + knowledge graph + RL routing. Everything still learns and improves — just without vector similarity. Best for financial/legal/medical documents with standardized terminology, or air-gapped environments.

Q: Does the RL policy persist across sessions?

Yes. RL state, BM25 index, knowledge graph, and learning memory all persist to disk. Auto-checkpoint every 5 minutes. Auto-recovery on startup.

Q: Can I use this without an LLM (offline)?

Yes. AdaptiveAI(llm_backend="none") returns ranked source excerpts directly. RL, graph, and evaluation still work.

Roadmap

v2.0 (Current)

RL policy engine (Thompson Sampling)
Conditional graph activation (5-signal gate)
6-metric evaluation → RL reward loop
Vectorless mode (page BM25, zero dependencies)
Output formats (JSON, CSV, YAML, DataFrame)
User feedback → RL reward
Crash recovery (auto-checkpoint, graceful shutdown)
Hardened ingestion (every PDF/Excel/CSV edge case)
SQL connector (PostgreSQL, MySQL, SQLite)
Incremental ingestion (add/remove/update)
Parallel ingestion
Chunk quality scoring + deduplication
Custom system prompt support
10+ providers copy-paste ready
Page-number citations

v3.0 — Intelligence Upgrades

PPO/DQN alongside Thompson Sampling
GraphSAGE embeddings replacing BFS
Cross-encoder reranking
Multi-query decomposition
Pre-trained domain policies (financial, legal, healthcare)
Transfer learning across deployments
llmevalkit compliance integration
Multi-modal ingestion (images, charts, audio)
Enterprise connectors (S3, Notion, Confluence)
Plugin API for community connectors

Also by the Author

llmevalkit — LLM evaluation, hallucination detection, compliance, and 61 metrics
Responsible AI Series — HIPAA, GDPR, NIST AI RMF, CoSAI, EU AI Act

Citation

@article{venkatkumar2026adaptive,
  title={Adaptive Retrieval Orchestration for Self-Learning Knowledge Systems},
  author={Venkatkumar, Rajan},
  year={2026},
  url={https://www.researchgate.net/publication/405076088},
  note={Available at ResearchGate and GitHub: github.com/VK-Ant/adaptive-intelligence}
}

License

Apache License 2.0

Author

Venkatkumar Rajan

LinkedIn: https://linkedin.com/in/venkatkumarvk
GitHub: https://github.com/VK-Ant
Portfolio: https://vk-ant.github.io/Venkatkumar/
PyPI: https://pypi.org/project/adaptive-intelligence/
ResearchGate: https://www.researchgate.net/publication/405076088

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

3.0.0

May 24, 2026

This version

2.0.1

May 22, 2026

2.0.0

May 22, 2026

1.0.3

May 16, 2026

1.0.2

May 16, 2026

1.0.1

May 16, 2026

1.0.0

May 16, 2026

0.1.0

May 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adaptive_intelligence-2.0.1.tar.gz (79.6 kB view details)

Uploaded May 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

adaptive_intelligence-2.0.1-py3-none-any.whl (74.8 kB view details)

Uploaded May 22, 2026 Python 3

File details

Details for the file adaptive_intelligence-2.0.1.tar.gz.

File metadata

Download URL: adaptive_intelligence-2.0.1.tar.gz
Upload date: May 22, 2026
Size: 79.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for adaptive_intelligence-2.0.1.tar.gz
Algorithm	Hash digest
SHA256	`15e3bd5cc02883fdf970ff6c8cc8f13011a0ba80b03ce96dac61bf2441e4f34b`
MD5	`eba96c72b2e3789baaa5bfe36d7e9a26`
BLAKE2b-256	`daf6d8da3e717af662c88c3d489e9f537e3cec187b98fa05f9c9493cdae28229`

See more details on using hashes here.

File details

Details for the file adaptive_intelligence-2.0.1-py3-none-any.whl.

File metadata

Download URL: adaptive_intelligence-2.0.1-py3-none-any.whl
Upload date: May 22, 2026
Size: 74.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for adaptive_intelligence-2.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2b6c97c709bb0ed6e6bced0c1afe26e0085c9430a41f34f7b1f4c5117a2b8d00`
MD5	`2f814eea72e88c337ff7f99ce0040ce4`
BLAKE2b-256	`e0d1918859055e14b3289a5aff0957c9cc959f59f0dd844f7aa1351c1c58b945`

See more details on using hashes here.

adaptive-intelligence 2.0.1

Navigation

Verified details

Project links

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

adaptive-intelligence

Install

Quick Start

Comparison: Traditional RAG vs GraphRAG vs Adaptive Intelligence

Version Evolution (v1 → v2)

Experimental Results

Three Core Innovations

1. RL Policy Engine

2. Conditional Graph Activation

3. Self-Adaptive Retrieval

v2 Features

Vectorless Mode

Output Formats

User Feedback

Incremental Ingestion

SQL Connector

Crash Recovery

Hardened Ingestion

Supported Formats

Providers — Copy, Paste, Run

Free Providers

Paid Providers

No LLM

Code Examples

Inspect the Full Pipeline

Dashboard and Monitoring

System Prompt

Advanced Configuration

Why This Exists

FAQ

Roadmap

v2.0 (Current)

v3.0 — Intelligence Upgrades

Also by the Author

Citation

License

Author

Project details

Verified details

Project links

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes