Skip to main content

An automated and dynamic exploratory data analysis (EDA) library for streamlined data insights using Large Language Model.

Project description

AutoXEDA - Automated Exploratory Data Analysis (EDA)

AutoXEDA Banner

PyPI Version Python Versions License LinkedIn GitHub

🚀 Introduction

AutoXEDA is an advanced Python library designed for automated exploratory data analysis (EDA). It streamlines the data analysis process by providing detailed insights, business intelligence summaries, and AI-powered suggestions. Whether you are a data scientist, analyst, or business professional, AutoXEDA helps you make data-driven decisions with ease.

✨ Features

Automated Statistical Analysis - Get key descriptive statistics with one function call.
Business Insights - Generate structured insights tailored for decision-making.
AI-Powered EDA - Leverages LLM-based models (via API key) for intelligent data summaries.
Customizable Detail Levels - Choose from basic, intermediate, or detailed reports.
Data Cleaning & Missing Values Handling - Detects and suggests fixes for data inconsistencies.
Correlation & Distribution Analysis - Understand relationships between variables.
Seamless Pandas Integration - Works with pandas DataFrames effortlessly.


📌 Installation

AutoXEDA is available on PyPI and can be installed easily using pip:

pip install autoxeda

🔥 Quick Start

Here's a quick example to get started with AutoXEDA:

import pandas as pd
from autoxeda.core import autoeda

# Create a sample dataset
data = {"x": [10, 20, 30, 40, 50], "y": [5, 15, 25, 35, 45]}
df = pd.DataFrame(data)

# Run AutoXEDA
result = autoeda(df, analysis_type="business", api_key=None, detail_level='basic')

# Print the output
print(result)

📊 Example Output

{
    "status": "success",
    "summary": {
        "total_rows": 1000,
        "total_columns": 5,
        "missing_values": 0,
        "correlation_matrix": {...},
        "business_insights": "The data shows a positive trend..."
    }
}

🛠️ Advanced Usage

1️⃣ Using AI-Powered Analysis

If you want to leverage AI-powered insights, provide an API key:

result = autoeda(df, analysis_type="business", api_key="your-api-key", temperature=0.5)

2️⃣ Custom Detail Levels

result = autoeda(df, analysis_type="business", detail_level='detailed')

3️⃣ Handling Large Datasets

If your dataset is large, you can optimize performance by sampling:

result = autoeda(df.sample(500), analysis_type="business", detail_level='basic')

Hyperparameters

AutoXEDA allows customization through hyperparameters:

Parameter Type Default Description
data data Required Input data (DataFrame, CSV, or SQL query)
analysis_type str 'business' Type of analysis ('business' or 'prediction')
api_key str None API key (Your API key from Groq)
max_retries int 2 Number of retries for failed actions
columns int None (all columns) Subset of columns to analyze
detail_level str 'basic' Level of detail ('basic', 'advanced' or 'intermediate')
temperature float 1.0 LLM creativity level (0.0-1.0)

📝 Contributing

We welcome contributions from the community! To contribute:

  1. Fork the repository.
  2. Clone it: git clone https://github.com/Jahanzeb-git/autoxeda.git
  3. Create a new branch: git checkout -b feature-branch
  4. Make your changes and commit: git commit -m "Add new feature"
  5. Push the changes: git push origin feature-branch
  6. Submit a pull request with a clear description of your contribution.

📄 License

This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.


🌎 Connect with Me


If you like this project, don't forget to star it on GitHub!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoxeda-0.1.1.tar.gz (27.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoxeda-0.1.1-py3-none-any.whl (25.6 kB view details)

Uploaded Python 3

File details

Details for the file autoxeda-0.1.1.tar.gz.

File metadata

  • Download URL: autoxeda-0.1.1.tar.gz
  • Upload date:
  • Size: 27.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for autoxeda-0.1.1.tar.gz
Algorithm Hash digest
SHA256 aaf3fae70daf9911e92c5973721e85ba703180f852d41364f5abc7f0ab150146
MD5 012d4d704fb7bfb4fb5a3113698e6242
BLAKE2b-256 e263786be1f78d06a7894f50771207bbea16fb5750db396ef22c21191754bb1c

See more details on using hashes here.

File details

Details for the file autoxeda-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: autoxeda-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 25.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for autoxeda-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bb7b4510855446713dacc245e8336b388664661982a30909c5d774bb5a06c0ee
MD5 ce81a2d08830c5062e549be84061c176
BLAKE2b-256 b42d5619965faa4cae72922abb525d09a8e76acd15e819163b4075dd3a0342e1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page