Safe, explainability-first AutoML pipeline with AI vulnerability detection and autonomous research agent.

These details have not been verified by PyPI

Project links

Project description

ADIS — Automated Data Intelligence System

An explainability-first AutoML library with built-in AI vulnerability detection.

ADIS runs a complete data science pipeline — ingestion, cleaning, EDA, feature engineering, model benchmarking — and produces a human-readable explanation at every step. Its AI Critic then audits the entire pipeline for data leakage, metric illusions, overfitting risks, and production readiness.

Quick Start

Install

pip install -e .

Basic Usage (3 lines)

from adis import ADISPipeline

pipeline = ADISPipeline(target_column="target")
results = pipeline.run("data.csv")
pipeline.save_report()   # Saves report.json + report.md + cleaned_data.csv

Use Individual Modules

from adis import run_ingestion, run_cleaning, run_eda, run_critic

# Just ingest and inspect
result = run_ingestion("data.csv")
print(result["column_info"])     # Per-column type detection
print(result["validation"])      # Schema issues & warnings

# Clean a DataFrame
from adis import run_cleaning
cleaned = run_cleaning(df, column_info, strategy="knn")
print(cleaned["log"])            # Every cleaning action logged

# Run the AI Critic on any pipeline results
critic = run_critic(pipeline_results)
for vuln in critic["vulnerabilities"]:
    print(f"[{vuln['severity']}] {vuln['issue']}")

Use the Autonomous Agent (Experimental)

from adis.agent import AutoResearchAgent

agent = AutoResearchAgent(
    filepath="data.csv",
    target_column="price",
    max_iterations=10,
)
# Requires: GEMINI_API_KEY env var + ADIS_ALLOW_EXEC=1
results = agent.optimize()

What Makes ADIS Different

Feature	Typical AutoML	ADIS
Explainability	Post-hoc (SHAP/LIME)	Built into every step — `what_happened`, `why`, `impact`
Safety Audit	None	AI Critic detects leakage, metric illusions, overfitting
Pipeline Report	Metrics table	Full Markdown/JSON narrative with rationale
Leakage Prevention	Manual	Automatic — train/test split before feature engineering
Target	Best score	Best score that's safe for production

Pipeline Stages

CSV File
  │
  ▼
┌─────────────────┐
│   Ingestion     │  → Type detection, schema validation, warnings
├─────────────────┤
│   Cleaning      │  → Imputation, dedup, outlier detection, type coercion
├─────────────────┤
│   EDA           │  → Distributions, correlations, class imbalance, flags
├─────────────────┤
│   Feature Eng.  │  → Log/sqrt transforms, binning, OHE, datetime decomposition
├─────────────────┤
│   Feature Sel.  │  → Variance filter, correlation filter, mutual information
├─────────────────┤
│   Benchmarking  │  → 3-4 models + dummy baseline, full metric suite
├─────────────────┤
│   AI Critic     │  → Cross-signal vulnerability detection
└─────────────────┘
  │
  ▼
JSON/Markdown Report + Cleaned CSV

Each stage returns a structured result dict with:

df — The transformed DataFrame
explanation — Human-readable {title, what_happened, why, impact}
step — Stage identifier

AI Critic — Vulnerability Detection

The Critic cross-references signals from across the pipeline to flag issues that single-stage analysis would miss:

Vulnerability	What It Catches
Metric Illusion	High accuracy + low AUC on imbalanced data = model is lazy
Target Leakage	Near-perfect score driven by one dominant feature
Overfitting Risk	Complex model on tiny dataset
Temporal Leakage	Random split on time-series data
Production Blockers	Composite check — is this model safe to deploy?

critic = results["critic"]
print(critic["is_structurally_safe"])   # True/False
for v in critic["vulnerabilities"]:
    print(f"  [{v['severity']}] {v['issue']} (confidence: {v['confidence']})")

Configuration

Environment Variables

Variable	Required	Description
`GEMINI_API_KEY`	For agent only	API key for LLM-powered research agent
`ADIS_ALLOW_EXEC`	For agent only	Set to `1` to enable code execution sandbox

Optional Dependencies

pip install -e ".[ui]"          # Streamlit dashboard
pip install -e ".[agent]"       # Autonomous research agent
pip install -e ".[imbalanced]"  # SMOTE oversampling
pip install -e ".[all]"         # Everything
pip install -e ".[dev]"         # pytest + ruff

Streamlit Dashboard

A visual frontend is included for interactive exploration:

pip install -e ".[ui]"
streamlit run app.py

Development

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest tests/ -v

# Lint
ruff check adis/ tests/

Project Structure

adis/
├── __init__.py              # Public API: ADISPipeline + all run_* functions
├── schemas.py               # Pydantic data contracts
├── pipeline.py              # Pipeline orchestrator
├── agent.py                 # Autonomous research agent (experimental)
├── ingestion.py             # CSV loading, type detection, validation
├── cleaning.py              # Imputation, dedup, outliers, coercion
├── eda.py                   # Distributions, correlations, imbalance
├── feature_engineering.py   # Transforms, binning, encoding, datetime
├── feature_selection.py     # Variance, correlation, mutual information
├── model_recommendation.py  # Problem type detection, model ranking
├── benchmarking.py          # Multi-model training + evaluation
└── critic.py                # AI vulnerability detection

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.4

May 9, 2026

0.1.3

May 9, 2026

0.1.2

May 9, 2026

0.1.1

May 9, 2026

This version

0.1.0

May 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adis_autoresearch-0.1.0.tar.gz (42.7 kB view details)

Uploaded May 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

adis_autoresearch-0.1.0-py3-none-any.whl (42.6 kB view details)

Uploaded May 9, 2026 Python 3

File details

Details for the file adis_autoresearch-0.1.0.tar.gz.

File metadata

Download URL: adis_autoresearch-0.1.0.tar.gz
Upload date: May 9, 2026
Size: 42.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for adis_autoresearch-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`dbcc142faeeddece8b793b512939584691665ccbe98e825b34f4e847d2341eac`
MD5	`1e41200fe655007d8d35a9eee1d3bf4c`
BLAKE2b-256	`41846d62cfe441c6efeaf8c6ea0eb497982a7b63027ff9b27d7e9e92b2c65a28`

See more details on using hashes here.

File details

Details for the file adis_autoresearch-0.1.0-py3-none-any.whl.

File metadata

Download URL: adis_autoresearch-0.1.0-py3-none-any.whl
Upload date: May 9, 2026
Size: 42.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for adis_autoresearch-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dbdec0addb0b5ae6f41a55d61260e8b9b9cad8d9bf6ccd1a0dfb155e8f0c3f1c`
MD5	`ad549b8b89403a19d4fefe99dc0057b0`
BLAKE2b-256	`e1c8f03d8c96910b9981ffb849c8ab4557183027d1abd605b129af9592b951e4`

See more details on using hashes here.

adis-autoresearch 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ADIS — Automated Data Intelligence System

Quick Start

Install

Basic Usage (3 lines)

Use Individual Modules

Use the Autonomous Agent (Experimental)

What Makes ADIS Different

Pipeline Stages

AI Critic — Vulnerability Detection

Configuration

Environment Variables

Optional Dependencies

Streamlit Dashboard

Development

Project Structure

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes