Skip to main content

An automated and dynamic exploratory data analysis (EDA) library for streamlined data insights using Large Language Model.

Project description

AutoXEDA - Automated Exploratory Data Analysis (EDA)

AutoXEDA Banner

PyPI Version Python Versions License LinkedIn GitHub

🚀 Introduction

AutoXEDA is an advanced Python library designed for automated exploratory data analysis (EDA). It streamlines the data analysis process by providing detailed insights, business intelligence summaries, and AI-powered suggestions. Whether you are a data scientist, analyst, or business professional, AutoXEDA helps you make data-driven decisions with ease.

✨ Features

Automated Statistical Analysis - Get key descriptive statistics with one function call.
Business Insights - Generate structured insights tailored for decision-making.
AI-Powered EDA - Leverages LLM-based models (via API key) for intelligent data summaries.
Customizable Detail Levels - Choose from basic, intermediate, or detailed reports.
Data Cleaning & Missing Values Handling - Detects and suggests fixes for data inconsistencies.
Correlation & Distribution Analysis - Understand relationships between variables.
Seamless Pandas Integration - Works with pandas DataFrames effortlessly.


📌 Installation

AutoXEDA is available on PyPI and can be installed easily using pip:

pip install autoxeda

🔥 Quick Start

Here's a quick example to get started with AutoXEDA:

import pandas as pd
from autoxeda.core import autoeda

# Create a sample dataset
data = {"x": [10, 20, 30, 40, 50], "y": [5, 15, 25, 35, 45]}
df = pd.DataFrame(data)

# Run AutoXEDA
result = autoeda(df, analysis_type="business", api_key=None, detail_level='basic')

# Print the output
print(result)

📊 Example Output

{
    "status": "success",
    "summary": {
        "total_rows": 1000,
        "total_columns": 5,
        "missing_values": 0,
        "correlation_matrix": {...},
        "business_insights": "The data shows a positive trend..."
    }
}

🛠️ Advanced Usage

1️⃣ Using AI-Powered Analysis

If you want to leverage AI-powered insights, provide an API key:

result = autoeda(df, analysis_type="business", api_key="your-api-key", temperature=0.5)

2️⃣ Custom Detail Levels

result = autoeda(df, analysis_type="business", detail_level='detailed')

3️⃣ Handling Large Datasets

If your dataset is large, you can optimize performance by sampling:

result = autoeda(df.sample(500), analysis_type="business", detail_level='basic')

Hyperparameters

AutoXEDA allows customization through hyperparameters:

Parameter Type Default Description
data data Required Input data (DataFrame, CSV, or SQL query)
analysis_type str 'business' Type of analysis ('business' or 'prediction')
api_key str None API key (Your API key from Groq)
max_retries int 2 Number of retries for failed actions
columns int None (all columns) Subset of columns to analyze
detail_level str 'basic' Level of detail ('basic', 'advanced' or 'intermediate')
temperature float 1.0 LLM creativity level (0.0-1.0)

📝 Contributing

We welcome contributions from the community! To contribute:

  1. Fork the repository.
  2. Clone it: git clone https://github.com/Jahanzeb-git/autoxeda.git
  3. Create a new branch: git checkout -b feature-branch
  4. Make your changes and commit: git commit -m "Add new feature"
  5. Push the changes: git push origin feature-branch
  6. Submit a pull request with a clear description of your contribution.

📄 License

This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.


🌎 Connect with Me


If you like this project, don't forget to star it on GitHub!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoxeda-0.1.2.tar.gz (28.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoxeda-0.1.2-py3-none-any.whl (26.4 kB view details)

Uploaded Python 3

File details

Details for the file autoxeda-0.1.2.tar.gz.

File metadata

  • Download URL: autoxeda-0.1.2.tar.gz
  • Upload date:
  • Size: 28.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for autoxeda-0.1.2.tar.gz
Algorithm Hash digest
SHA256 35161a1b9109157c367eba08f9164a7cff67d322b60b3f34c3192c9549b487ad
MD5 208ef4f8b42f5c2f1b88ebe0429df015
BLAKE2b-256 8c2c7a5cf61116ab01d22b1ce3e9951441d11447ac26434d6c8498f12ba6b897

See more details on using hashes here.

File details

Details for the file autoxeda-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: autoxeda-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 26.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for autoxeda-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0bbb2019d99836117eab7a314194166e0db196adb800912592315c528d6c7fdf
MD5 1dcf661ca24484bd593390be3a5ca35b
BLAKE2b-256 17d3391a575b90a6b5df43a1d58963499129e5e309e03de47d8ff65df39bc166

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page