Skip to main content

Complete ML automation platform with AutoML, Feature Engineering, AI Insights, and Interactive Dashboards.

Project description

๐ŸŽจ EssentiaX - Next-Generation Data Analysis Library

Smart EDA, Cleaning, and Visualization with AI-Powered Insights

Python 3.7+ License: MIT Version

๐Ÿš€ What Makes EssentiaX Special?

EssentiaX is not just another data analysis library. It's a next-generation toolkit that combines:

  • ๐Ÿค– AI-Powered Variable Selection - Let AI choose the best variables to visualize
  • ๐ŸŽจ Stunning Interactive Visualizations - Beautiful Plotly charts with insights
  • ๐Ÿง  Smart Insights Generation - Automatic interpretation of every chart
  • ๐Ÿงน Intelligent Data Cleaning - One-function ML-ready preprocessing
  • ๐Ÿ“Š Professional EDA Reports - HTML reports that impress stakeholders
  • ๐Ÿ’ก ML Model Recommendations - Get model suggestions based on your data

๐ŸŽฏ Quick Start

pip install essentiax
from essentiax import smart_read, smart_viz, smart_clean, problem_card

# 1. Load data with beautiful console output
df = smart_read("your_data.csv")

# 2. AI-powered visualization with insights
smart_viz(df, mode="auto", interactive=True)

# 3. Get ML insights and model recommendations
problem_card(df, target="your_target_column")

# 4. Clean data for ML in one line
clean_df = smart_clean(df)

๐ŸŽจ Smart Visualization Engine

๐Ÿค– Automatic Mode (AI Selection)

Let AI choose the best variables and create stunning visualizations:

smart_viz(
    df=df,
    mode="auto",           # AI selects best variables
    target="target_col",   # Optional target variable
    max_plots=8,          # Control number of plots
    interactive=True      # Beautiful interactive charts
)

What you get:

  • ๐Ÿ“Š Smart Variable Selection - AI picks the most informative variables
  • ๐Ÿ”ฅ Interactive Correlation Heatmaps - Hover for detailed insights
  • ๐Ÿ“ˆ Distribution Analysis - With statistical interpretations
  • ๐ŸŽฏ Multi-variable Relationships - Scatter plot matrices
  • ๐Ÿ’ก AI-Generated Insights - Automatic interpretation of every chart

๐Ÿ‘ค Manual Mode (User Selection)

Choose specific variables you want to analyze:

smart_viz(
    df=df,
    mode="manual",
    columns=["age", "salary", "department"],  # Your chosen variables
    target="promotion",
    interactive=True
)

๐ŸŽจ Features That Make It GOATED

1. AI-Powered Insights ๐Ÿง 

Every chart comes with automatic interpretation:

  • Statistical significance analysis
  • Pattern recognition
  • Outlier detection
  • Correlation explanations
  • Feature engineering suggestions

2. Interactive Visualizations โšก

  • Plotly-powered interactive charts
  • Hover for detailed information
  • Zoom, pan, and explore your data
  • Professional styling that impresses

3. Beautiful Console UI ๐ŸŽจ

  • Rich console output with colors and formatting
  • Progress bars and spinners
  • Organized panels and tables
  • Professional presentation

4. Smart Chart Selection ๐ŸŽฏ

AI automatically chooses the best chart type:

  • Distribution plots for continuous variables
  • Box plots for outlier detection
  • Correlation heatmaps for relationships
  • Categorical analysis for discrete variables
  • Scatter matrices for multi-variable analysis

๐Ÿงน Smart Data Cleaning

Transform messy data into ML-ready datasets:

clean_df = smart_clean(
    df,
    missing_strategy="auto",    # Smart missing value handling
    outlier_strategy="iqr",     # Intelligent outlier removal
    scale_numeric=True,         # Automatic scaling
    encode_categorical=True,    # Smart encoding
    verbose=True               # Beautiful progress output
)

๐Ÿ“Š Problem Card & Model Recommendations

Get instant ML insights:

problem_card(df, target="your_target")

Provides:

  • ๐ŸŽฏ Problem Type Detection (Classification/Regression/NLP)
  • ๐Ÿค– Model Recommendations (Baseline + Advanced)
  • โš–๏ธ Class Imbalance Analysis
  • ๐Ÿ” Data Quality Score
  • ๐Ÿ’ก Actionable Insights

๐Ÿ“ˆ Professional EDA Reports

Generate stunning HTML reports:

from essentiax import smart_eda_pro

smart_eda_pro(
    df, 
    target="target_column",
    report_path="my_analysis.html"
)

๐ŸŽฏ Real-World Examples

Example 1: Sales Data Analysis

# Load sales data
df = smart_read("sales_data.csv")

# AI-powered visualization
smart_viz(df, mode="auto", target="revenue")

# Get insights and recommendations
problem_card(df, target="revenue")

Example 2: Customer Segmentation

# Manual analysis of specific features
smart_viz(
    df=customer_df,
    mode="manual", 
    columns=["age", "income", "spending_score", "loyalty_years"],
    interactive=True
)

Example 3: ML Pipeline

# Complete ML preprocessing pipeline
df = smart_read("dataset.csv")
problem_card(df, target="target")
clean_df = smart_clean(df)
# Now ready for model training!

๐Ÿ†š Why Choose EssentiaX?

Feature EssentiaX pandas-profiling sweetviz
AI Variable Selection โœ… โŒ โŒ
Interactive Charts โœ… โŒ โŒ
Real-time Insights โœ… โŒ โŒ
ML Recommendations โœ… โŒ โŒ
Beautiful Console UI โœ… โŒ โŒ
One-line Cleaning โœ… โŒ โŒ

๐Ÿ“ฆ Installation

# Basic installation
pip install essentiax

# With all dependencies
pip install essentiax[complete]

๐Ÿ› ๏ธ Requirements

  • Python 3.7+
  • pandas >= 1.0
  • numpy >= 1.20
  • matplotlib >= 3.0
  • seaborn >= 0.11
  • plotly >= 5.0
  • rich >= 10.0
  • scikit-learn >= 1.0

๐Ÿค Contributing

We welcome contributions! Please see our Contributing Guide for details.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

๐Ÿ™ Acknowledgments

  • Built with โค๏ธ by Shubham Wagh
  • Powered by the amazing Python data science ecosystem
  • Special thanks to the Plotly and Rich communities

โญ Star this repo if EssentiaX helps you build better ML models!

๐Ÿ”— GitHub | ๐Ÿ“ง Contact | ๐Ÿฆ Twitter

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

essentiax-1.0.8.tar.gz (183.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

essentiax-1.0.8-py3-none-any.whl (211.0 kB view details)

Uploaded Python 3

File details

Details for the file essentiax-1.0.8.tar.gz.

File metadata

  • Download URL: essentiax-1.0.8.tar.gz
  • Upload date:
  • Size: 183.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for essentiax-1.0.8.tar.gz
Algorithm Hash digest
SHA256 c3fa7a154cf41854b374624e68e6d29261efea4a8180b84863ae96515aa664b1
MD5 d855d60dd88a5d9902043c3d19f5120a
BLAKE2b-256 6a79ba41f6cfe5b5f3720eff9e9345ccabbf4e9a486cdc42f81c30c7b27e504f

See more details on using hashes here.

File details

Details for the file essentiax-1.0.8-py3-none-any.whl.

File metadata

  • Download URL: essentiax-1.0.8-py3-none-any.whl
  • Upload date:
  • Size: 211.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for essentiax-1.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 c9570b70c4eca1d985188db8df6e8259610d4dadcc6d2f3d6cec0635c1dcb774
MD5 015cb8b949fe9dc15b169fb0b6a6d76f
BLAKE2b-256 6663d842ade43eb4c8f007f48678db651f926a80a485389d22bf7386ae4b87a1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page