Skip to main content

Complete ML automation platform with AutoML, Feature Engineering, AI Insights, and Interactive Dashboards.

Project description

๐ŸŽจ EssentiaX - Next-Generation Data Analysis Library

Smart EDA, Cleaning, and Visualization with AI-Powered Insights

Python 3.7+ License: MIT Version

๐Ÿš€ What Makes EssentiaX Special?

EssentiaX is not just another data analysis library. It's a next-generation toolkit that combines:

  • ๐Ÿค– AI-Powered Variable Selection - Let AI choose the best variables to visualize
  • ๐ŸŽจ Stunning Interactive Visualizations - Beautiful Plotly charts with insights
  • ๐Ÿง  Smart Insights Generation - Automatic interpretation of every chart
  • ๐Ÿงน Intelligent Data Cleaning - One-function ML-ready preprocessing
  • ๐Ÿ“Š Professional EDA Reports - HTML reports that impress stakeholders
  • ๐Ÿ’ก ML Model Recommendations - Get model suggestions based on your data

๐ŸŽฏ Quick Start

pip install essentiax
from essentiax import smart_read, smart_viz, smart_clean, problem_card

# 1. Load data with beautiful console output
df = smart_read("your_data.csv")

# 2. AI-powered visualization with insights
smart_viz(df, mode="auto", interactive=True)

# 3. Get ML insights and model recommendations
problem_card(df, target="your_target_column")

# 4. Clean data for ML in one line
clean_df = smart_clean(df)

๐ŸŽจ Smart Visualization Engine

๐Ÿค– Automatic Mode (AI Selection)

Let AI choose the best variables and create stunning visualizations:

smart_viz(
    df=df,
    mode="auto",           # AI selects best variables
    target="target_col",   # Optional target variable
    max_plots=8,          # Control number of plots
    interactive=True      # Beautiful interactive charts
)

What you get:

  • ๐Ÿ“Š Smart Variable Selection - AI picks the most informative variables
  • ๐Ÿ”ฅ Interactive Correlation Heatmaps - Hover for detailed insights
  • ๐Ÿ“ˆ Distribution Analysis - With statistical interpretations
  • ๐ŸŽฏ Multi-variable Relationships - Scatter plot matrices
  • ๐Ÿ’ก AI-Generated Insights - Automatic interpretation of every chart

๐Ÿ‘ค Manual Mode (User Selection)

Choose specific variables you want to analyze:

smart_viz(
    df=df,
    mode="manual",
    columns=["age", "salary", "department"],  # Your chosen variables
    target="promotion",
    interactive=True
)

๐ŸŽจ Features That Make It GOATED

1. AI-Powered Insights ๐Ÿง 

Every chart comes with automatic interpretation:

  • Statistical significance analysis
  • Pattern recognition
  • Outlier detection
  • Correlation explanations
  • Feature engineering suggestions

2. Interactive Visualizations โšก

  • Plotly-powered interactive charts
  • Hover for detailed information
  • Zoom, pan, and explore your data
  • Professional styling that impresses

3. Beautiful Console UI ๐ŸŽจ

  • Rich console output with colors and formatting
  • Progress bars and spinners
  • Organized panels and tables
  • Professional presentation

4. Smart Chart Selection ๐ŸŽฏ

AI automatically chooses the best chart type:

  • Distribution plots for continuous variables
  • Box plots for outlier detection
  • Correlation heatmaps for relationships
  • Categorical analysis for discrete variables
  • Scatter matrices for multi-variable analysis

๐Ÿงน Smart Data Cleaning

Transform messy data into ML-ready datasets:

clean_df = smart_clean(
    df,
    missing_strategy="auto",    # Smart missing value handling
    outlier_strategy="iqr",     # Intelligent outlier removal
    scale_numeric=True,         # Automatic scaling
    encode_categorical=True,    # Smart encoding
    verbose=True               # Beautiful progress output
)

๐Ÿ“Š Problem Card & Model Recommendations

Get instant ML insights:

problem_card(df, target="your_target")

Provides:

  • ๐ŸŽฏ Problem Type Detection (Classification/Regression/NLP)
  • ๐Ÿค– Model Recommendations (Baseline + Advanced)
  • โš–๏ธ Class Imbalance Analysis
  • ๐Ÿ” Data Quality Score
  • ๐Ÿ’ก Actionable Insights

๐Ÿ“ˆ Professional EDA Reports

Generate stunning HTML reports:

from essentiax import smart_eda_pro

smart_eda_pro(
    df, 
    target="target_column",
    report_path="my_analysis.html"
)

๐ŸŽฏ Real-World Examples

Example 1: Sales Data Analysis

# Load sales data
df = smart_read("sales_data.csv")

# AI-powered visualization
smart_viz(df, mode="auto", target="revenue")

# Get insights and recommendations
problem_card(df, target="revenue")

Example 2: Customer Segmentation

# Manual analysis of specific features
smart_viz(
    df=customer_df,
    mode="manual", 
    columns=["age", "income", "spending_score", "loyalty_years"],
    interactive=True
)

Example 3: ML Pipeline

# Complete ML preprocessing pipeline
df = smart_read("dataset.csv")
problem_card(df, target="target")
clean_df = smart_clean(df)
# Now ready for model training!

๐Ÿ†š Why Choose EssentiaX?

Feature EssentiaX pandas-profiling sweetviz
AI Variable Selection โœ… โŒ โŒ
Interactive Charts โœ… โŒ โŒ
Real-time Insights โœ… โŒ โŒ
ML Recommendations โœ… โŒ โŒ
Beautiful Console UI โœ… โŒ โŒ
One-line Cleaning โœ… โŒ โŒ

๐Ÿ“ฆ Installation

# Basic installation
pip install essentiax

# With all dependencies
pip install essentiax[complete]

๐Ÿ› ๏ธ Requirements

  • Python 3.7+
  • pandas >= 1.0
  • numpy >= 1.20
  • matplotlib >= 3.0
  • seaborn >= 0.11
  • plotly >= 5.0
  • rich >= 10.0
  • scikit-learn >= 1.0

๐Ÿค Contributing

We welcome contributions! Please see our Contributing Guide for details.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

๐Ÿ™ Acknowledgments

  • Built with โค๏ธ by Shubham Wagh
  • Powered by the amazing Python data science ecosystem
  • Special thanks to the Plotly and Rich communities

โญ Star this repo if EssentiaX helps you build better ML models!

๐Ÿ”— GitHub | ๐Ÿ“ง Contact | ๐Ÿฆ Twitter

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

essentiax-1.0.6.tar.gz (183.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

essentiax-1.0.6-py3-none-any.whl (211.0 kB view details)

Uploaded Python 3

File details

Details for the file essentiax-1.0.6.tar.gz.

File metadata

  • Download URL: essentiax-1.0.6.tar.gz
  • Upload date:
  • Size: 183.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for essentiax-1.0.6.tar.gz
Algorithm Hash digest
SHA256 e4d3ea05e6ade8784aeb13fd9946b8970faf93a61b13eb76eea7f330b98dc526
MD5 1732c0fc7ff28bd10fd266ac493d7c50
BLAKE2b-256 49702472846c9ccb1e920ab1dbbc739409b397af13a6086cf125458fc78bc8d6

See more details on using hashes here.

File details

Details for the file essentiax-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: essentiax-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 211.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for essentiax-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 743b69f99f6b2be8d7646421da23a9bb78b44d3156e39d92c937d8a0dfd12b1f
MD5 f00504049ad709153b32b150430c7234
BLAKE2b-256 a51764b9db5a18f923ec338618845f0eb0510ad5869254f9ccd46f0a51d8c4b0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page