Skip to main content

An automated and dynamic exploratory data analysis (EDA) library for streamlined data insights using Large Language Model.

Project description

AutoXEDA - Automated Exploratory Data Analysis (EDA)

PyPI Version Python Versions License LinkedIn GitHub

🚀 Introduction

AutoXEDA is an advanced Python library designed for automated exploratory data analysis (EDA). It streamlines the data analysis process by providing detailed insights, business intelligence summaries, and AI-powered suggestions. Whether you are a data scientist, analyst, or business professional, AutoXEDA helps you make data-driven decisions with ease.

✨ Features

Automated Statistical Analysis - Get key descriptive statistics with one function call.
Business Insights - Generate structured insights tailored for decision-making.
AI-Powered EDA - Leverages LLM-based models (via API key) for intelligent data summaries.
Customizable Detail Levels - Choose from basic, intermediate, or detailed reports.
Data Cleaning & Missing Values Handling - Detects and suggests fixes for data inconsistencies.
Correlation & Distribution Analysis - Understand relationships between variables.
Seamless Pandas Integration - Works with pandas DataFrames effortlessly.


📌 Installation

AutoXEDA is available on PyPI and can be installed easily using pip:

pip install autoxeda

🔥 Quick Start

Here's a quick example to get started with AutoXEDA:

import pandas as pd
from autoxeda.core import autoeda

# Create a sample dataset
data = {"x": [10, 20, 30, 40, 50], "y": [5, 15, 25, 35, 45]}
df = pd.DataFrame(data)

# Run AutoXEDA
result = autoeda(df, analysis_type="business", api_key=None, detail_level='basic')

# Print the output
print(result)

📊 Example Output

{
    "status": "success",
    "summary": {
        "total_rows": 1000,
        "total_columns": 5,
        "missing_values": 0,
        "correlation_matrix": {...},
        "business_insights": "The data shows a positive trend..."
    }
}

🛠️ Advanced Usage

1️⃣ Using AI-Powered Analysis

If you want to leverage AI-powered insights, provide an API key:

result = autoeda(df, analysis_type="business", api_key="your-api-key", temperature=0.5)

2️⃣ Custom Detail Levels

result = autoeda(df, analysis_type="business", detail_level='detailed')

3️⃣ Handling Large Datasets

If your dataset is large, you can optimize performance by sampling:

result = autoeda(df.sample(500), analysis_type="business", detail_level='basic')

Hyperparameters

AutoXEDA allows customization through hyperparameters:

Parameter Type Default Description
data data Required Input data (DataFrame, CSV, or SQL query)
analysis_type str 'business' Type of analysis ('business' or 'prediction')
api_key str None API key (Your API key from Groq)
max_retries int 2 Number of retries for failed actions
columns int None (all columns) Subset of columns to analyze
detail_level str 'basic' Level of detail ('basic', 'advanced' or 'intermediate')
temperature float 1.0 LLM creativity level (0.0-1.0)

📝 Contributing

We welcome contributions from the community! To contribute:

  1. Fork the repository.
  2. Clone it: git clone https://github.com/Jahanzeb-git/autoxeda.git
  3. Create a new branch: git checkout -b feature-branch
  4. Make your changes and commit: git commit -m "Add new feature"
  5. Push the changes: git push origin feature-branch
  6. Submit a pull request with a clear description of your contribution.

📄 License

This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.


🌎 Connect with Me


If you like this project, don't forget to star it on GitHub!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoxeda-0.1.3.tar.gz (28.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoxeda-0.1.3-py3-none-any.whl (26.4 kB view details)

Uploaded Python 3

File details

Details for the file autoxeda-0.1.3.tar.gz.

File metadata

  • Download URL: autoxeda-0.1.3.tar.gz
  • Upload date:
  • Size: 28.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for autoxeda-0.1.3.tar.gz
Algorithm Hash digest
SHA256 f1be18d80a9db04ef2b224dd98a7380d8477d13dc76d8a5a154b72eac06c2d99
MD5 2e68299366aaef08c64440cb05f73683
BLAKE2b-256 550d1b6b09cbd51995ea23a650f6c0e7f74d04ede0c3ef29e2ee49157334c5a5

See more details on using hashes here.

File details

Details for the file autoxeda-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: autoxeda-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 26.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for autoxeda-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 384f0bee56f6307cf8cdaa137a31dce78066e53f370515b6481368776d961983
MD5 901fbd2d7674529e9c4d6b6f3ad94e3c
BLAKE2b-256 5fe9b98461b99c869cdf7ff3793f376a374db6c2b73f1d42b790b9987844e4e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page