Skip to main content

Complete ML automation platform with AutoML, Feature Engineering, AI Insights, and Interactive Dashboards.

Project description

๐ŸŽจ EssentiaX - Next-Generation Data Analysis Library

Smart EDA, Cleaning, and Visualization with AI-Powered Insights

Python 3.7+ License: MIT Version

๐Ÿš€ What Makes EssentiaX Special?

EssentiaX is not just another data analysis library. It's a next-generation toolkit that combines:

  • ๐Ÿค– AI-Powered Variable Selection - Let AI choose the best variables to visualize
  • ๐ŸŽจ Stunning Interactive Visualizations - Beautiful Plotly charts with insights
  • ๐Ÿง  Smart Insights Generation - Automatic interpretation of every chart
  • ๐Ÿงน Intelligent Data Cleaning - One-function ML-ready preprocessing
  • ๐Ÿ“Š Professional EDA Reports - HTML reports that impress stakeholders
  • ๐Ÿ’ก ML Model Recommendations - Get model suggestions based on your data

๐ŸŽฏ Quick Start

pip install essentiax
from essentiax import smart_read, smart_viz, smart_clean, problem_card

# 1. Load data with beautiful console output
df = smart_read("your_data.csv")

# 2. AI-powered visualization with insights
smart_viz(df, mode="auto", interactive=True)

# 3. Get ML insights and model recommendations
problem_card(df, target="your_target_column")

# 4. Clean data for ML in one line
clean_df = smart_clean(df)

๐ŸŽจ Smart Visualization Engine

๐Ÿค– Automatic Mode (AI Selection)

Let AI choose the best variables and create stunning visualizations:

smart_viz(
    df=df,
    mode="auto",           # AI selects best variables
    target="target_col",   # Optional target variable
    max_plots=8,          # Control number of plots
    interactive=True      # Beautiful interactive charts
)

What you get:

  • ๐Ÿ“Š Smart Variable Selection - AI picks the most informative variables
  • ๐Ÿ”ฅ Interactive Correlation Heatmaps - Hover for detailed insights
  • ๐Ÿ“ˆ Distribution Analysis - With statistical interpretations
  • ๐ŸŽฏ Multi-variable Relationships - Scatter plot matrices
  • ๐Ÿ’ก AI-Generated Insights - Automatic interpretation of every chart

๐Ÿ‘ค Manual Mode (User Selection)

Choose specific variables you want to analyze:

smart_viz(
    df=df,
    mode="manual",
    columns=["age", "salary", "department"],  # Your chosen variables
    target="promotion",
    interactive=True
)

๐ŸŽจ Features That Make It GOATED

1. AI-Powered Insights ๐Ÿง 

Every chart comes with automatic interpretation:

  • Statistical significance analysis
  • Pattern recognition
  • Outlier detection
  • Correlation explanations
  • Feature engineering suggestions

2. Interactive Visualizations โšก

  • Plotly-powered interactive charts
  • Hover for detailed information
  • Zoom, pan, and explore your data
  • Professional styling that impresses

3. Beautiful Console UI ๐ŸŽจ

  • Rich console output with colors and formatting
  • Progress bars and spinners
  • Organized panels and tables
  • Professional presentation

4. Smart Chart Selection ๐ŸŽฏ

AI automatically chooses the best chart type:

  • Distribution plots for continuous variables
  • Box plots for outlier detection
  • Correlation heatmaps for relationships
  • Categorical analysis for discrete variables
  • Scatter matrices for multi-variable analysis

๐ŸŽจ NEW: Advanced 3D & Interactive Visualizations

Transform boring charts into stunning visualizations!

from essentiax.visuals import advanced_viz

# Auto mode - AI selects best advanced visualizations
advanced_viz(df, viz_type='auto')

๐Ÿš€ 10 Advanced Visualization Types

Visualization Description Perfect For
๐ŸŽจ 3D Scatter + Clustering 3D scatter with K-means clustering Multi-dimensional patterns
๐ŸŒŠ 3D Surface Plot Beautiful 3D surfaces Density visualization
โ˜€๏ธ Sunburst Chart Hierarchical circular viz Categorical hierarchy
๐ŸŒŠ Sankey Diagram Flow visualization Process flows
๐ŸŽป Advanced Violin Plots Distribution + statistics Feature comparison
๐Ÿ“Š Parallel Coordinates Multi-dimensional data High-dim exploration
๐Ÿ—บ๏ธ Treemap Hierarchical rectangles Category proportions
๐ŸŽฌ Animated Scatter Time-series animations Temporal analysis
๐ŸŽญ Advanced Correlation Interactive correlation Feature relationships
๐Ÿ”๏ธ Ridge Plot Overlapping distributions Category comparison

โœจ Key Features

  • โœ… Fully Interactive - Hover, zoom, pan, rotate
  • โœ… 3D Capabilities - True 3D with rotation
  • โœ… Auto Clustering - K-means built-in
  • โœ… Production Ready - Beautiful aesthetics
  • โœ… One-Line Usage - Simple API

๐Ÿ“ Quick Examples

from essentiax.visuals import Advanced3DViz

engine = Advanced3DViz()

# 3D scatter with clustering
engine.plot_3d_scatter_clusters(df, n_clusters=3)

# 3D surface plot
engine.plot_3d_surface(df, x_col='feature1', y_col='feature2')

# Advanced violin plots
engine.plot_violin_advanced(df, columns=['f1', 'f2', 'f3'])

# Parallel coordinates
engine.plot_parallel_coordinates(df, color_col='target')

# Sunburst chart
engine.plot_sunburst(df, path_columns=['cat1', 'cat2'])

See ADVANCED_VIZ_GUIDE.md for complete documentation!

๐Ÿงน Smart Data Cleaning

Transform messy data into ML-ready datasets:

clean_df = smart_clean(
    df,
    missing_strategy="auto",    # Smart missing value handling
    outlier_strategy="iqr",     # Intelligent outlier removal
    scale_numeric=True,         # Automatic scaling
    encode_categorical=True,    # Smart encoding
    verbose=True               # Beautiful progress output
)

๐Ÿ“Š Problem Card & Model Recommendations

Get instant ML insights:

problem_card(df, target="your_target")

Provides:

  • ๐ŸŽฏ Problem Type Detection (Classification/Regression/NLP)
  • ๐Ÿค– Model Recommendations (Baseline + Advanced)
  • โš–๏ธ Class Imbalance Analysis
  • ๐Ÿ” Data Quality Score
  • ๐Ÿ’ก Actionable Insights

๐Ÿ“ˆ Professional EDA Reports

Generate stunning HTML reports:

from essentiax import smart_eda_pro

smart_eda_pro(
    df, 
    target="target_column",
    report_path="my_analysis.html"
)

๐ŸŽฏ Real-World Examples

Example 1: Sales Data Analysis

# Load sales data
df = smart_read("sales_data.csv")

# AI-powered visualization
smart_viz(df, mode="auto", target="revenue")

# Get insights and recommendations
problem_card(df, target="revenue")

Example 2: Customer Segmentation

# Manual analysis of specific features
smart_viz(
    df=customer_df,
    mode="manual", 
    columns=["age", "income", "spending_score", "loyalty_years"],
    interactive=True
)

Example 3: ML Pipeline

# Complete ML preprocessing pipeline
df = smart_read("dataset.csv")
problem_card(df, target="target")
clean_df = smart_clean(df)
# Now ready for model training!

๐Ÿ†š Why Choose EssentiaX?

Feature EssentiaX pandas-profiling sweetviz
AI Variable Selection โœ… โŒ โŒ
Interactive Charts โœ… โŒ โŒ
Real-time Insights โœ… โŒ โŒ
ML Recommendations โœ… โŒ โŒ
Beautiful Console UI โœ… โŒ โŒ
One-line Cleaning โœ… โŒ โŒ

๐Ÿ“ฆ Installation

# Basic installation
pip install essentiax

# With all dependencies
pip install essentiax[complete]

๐Ÿ› ๏ธ Requirements

  • Python 3.7+
  • pandas >= 1.0
  • numpy >= 1.20
  • matplotlib >= 3.0
  • seaborn >= 0.11
  • plotly >= 5.0
  • rich >= 10.0
  • scikit-learn >= 1.0

๐Ÿค Contributing

We welcome contributions! Please see our Contributing Guide for details.

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

๐Ÿ™ Acknowledgments

  • Built with โค๏ธ by Shubham Wagh
  • Powered by the amazing Python data science ecosystem
  • Special thanks to the Plotly and Rich communities

โญ Star this repo if EssentiaX helps you build better ML models!

๐Ÿ”— GitHub | ๐Ÿ“ง Contact | ๐Ÿฆ Twitter

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

essentiax-1.1.9.tar.gz (203.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

essentiax-1.1.9-py3-none-any.whl (233.4 kB view details)

Uploaded Python 3

File details

Details for the file essentiax-1.1.9.tar.gz.

File metadata

  • Download URL: essentiax-1.1.9.tar.gz
  • Upload date:
  • Size: 203.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for essentiax-1.1.9.tar.gz
Algorithm Hash digest
SHA256 65ef6cc3aa3de7b8e48e3f63b720e25f8a31e86c9266311288bdb93db368727a
MD5 b4ce2253168a1fa29c27df6a0219c138
BLAKE2b-256 e84eb84f525b2fc37a1b27b2484f03ec37c56352b1c6808a3ac951cff9caaed8

See more details on using hashes here.

File details

Details for the file essentiax-1.1.9-py3-none-any.whl.

File metadata

  • Download URL: essentiax-1.1.9-py3-none-any.whl
  • Upload date:
  • Size: 233.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.25

File hashes

Hashes for essentiax-1.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 4030bdf77f9e48e1c2d30049f5f5ac856f801b4e7c2e07a94cddeae1e10616e9
MD5 51bc0179ffe7e9aebc7e50867215522f
BLAKE2b-256 69197a8da49af1133ca7319158506217c49d22260a7c4dfe5d6252fbf9425097

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page