An automated and dynamic exploratory data analysis (EDA) library for streamlined data insights using Large Language Model.
Project description
AutoXEDA - Automated Exploratory Data Analysis (EDA)
🚀 Introduction
AutoXEDA is an advanced Python library designed for automated exploratory data analysis (EDA). It streamlines the data analysis process by providing detailed insights, business intelligence summaries, and AI-powered suggestions. Whether you are a data scientist, analyst, or business professional, AutoXEDA helps you make data-driven decisions with ease.
✨ Features
✅ Automated Statistical Analysis - Get key descriptive statistics with one function call.
✅ Business Insights - Generate structured insights tailored for decision-making.
✅ AI-Powered EDA - Leverages LLM-based models (via API key) for intelligent data summaries.
✅ Customizable Detail Levels - Choose from basic, intermediate, or detailed reports.
✅ Data Cleaning & Missing Values Handling - Detects and suggests fixes for data inconsistencies.
✅ Correlation & Distribution Analysis - Understand relationships between variables.
✅ Seamless Pandas Integration - Works with pandas DataFrames effortlessly.
📌 Installation
AutoXEDA is available on PyPI and can be installed easily using pip:
pip install autoxeda
🔥 Quick Start
Here's a quick example to get started with AutoXEDA:
import pandas as pd
from autoxeda.core import autoeda
# Create a sample dataset
data = {"x": [10, 20, 30, 40, 50], "y": [5, 15, 25, 35, 45]}
df = pd.DataFrame(data)
# Run AutoXEDA
result = autoeda(df, analysis_type="business", api_key=None, detail_level='basic')
# Print the output
print(result)
📊 Example Output
{
"status": "success",
"summary": {
"total_rows": 1000,
"total_columns": 5,
"missing_values": 0,
"correlation_matrix": {...},
"business_insights": "The data shows a positive trend..."
}
}
🛠️ Advanced Usage
1️⃣ Using AI-Powered Analysis
If you want to leverage AI-powered insights, provide an API key:
result = autoeda(df, analysis_type="business", api_key="your-api-key", temperature=0.5)
2️⃣ Custom Detail Levels
result = autoeda(df, analysis_type="business", detail_level='detailed')
3️⃣ Handling Large Datasets
If your dataset is large, you can optimize performance by sampling:
result = autoeda(df.sample(500), analysis_type="business", detail_level='basic')
Hyperparameters
AutoXEDA allows customization through hyperparameters:
| Parameter | Type | Default | Description |
|---|---|---|---|
data |
data | Required | Input data (DataFrame, CSV, or SQL query) |
analysis_type |
str | 'business' | Type of analysis ('business' or 'prediction') |
api_key |
str | None | API key (Your API key from Groq) |
max_retries |
int | 2 | Number of retries for failed actions |
columns |
int | None (all columns) | Subset of columns to analyze |
detail_level |
str | 'basic' | Level of detail ('basic', 'advanced' or 'intermediate') |
temperature |
float | 1.0 | LLM creativity level (0.0-1.0) |
📝 Contributing
We welcome contributions from the community! To contribute:
- Fork the repository.
- Clone it:
git clone https://github.com/Jahanzeb-git/autoxeda.git - Create a new branch:
git checkout -b feature-branch - Make your changes and commit:
git commit -m "Add new feature" - Push the changes:
git push origin feature-branch - Submit a pull request with a clear description of your contribution.
📄 License
This project is licensed under the GNU General Public License v3.0. See the LICENSE file for details.
🌎 Connect with Me
- 📂 GitHub: Jahanzeb-git
- 🌐 Website: jahanzebahmed.com
- 💼 LinkedIn: Jahanzeb Ahmed
⭐ If you like this project, don't forget to star it on GitHub! ⭐
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file autoxeda-0.1.2.tar.gz.
File metadata
- Download URL: autoxeda-0.1.2.tar.gz
- Upload date:
- Size: 28.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
35161a1b9109157c367eba08f9164a7cff67d322b60b3f34c3192c9549b487ad
|
|
| MD5 |
208ef4f8b42f5c2f1b88ebe0429df015
|
|
| BLAKE2b-256 |
8c2c7a5cf61116ab01d22b1ce3e9951441d11447ac26434d6c8498f12ba6b897
|
File details
Details for the file autoxeda-0.1.2-py3-none-any.whl.
File metadata
- Download URL: autoxeda-0.1.2-py3-none-any.whl
- Upload date:
- Size: 26.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0bbb2019d99836117eab7a314194166e0db196adb800912592315c528d6c7fdf
|
|
| MD5 |
1dcf661ca24484bd593390be3a5ca35b
|
|
| BLAKE2b-256 |
17d3391a575b90a6b5df43a1d58963499129e5e309e03de47d8ff65df39bc166
|