A comprehensive toolkit for exploratory data analysis with advanced visualization and statistical analysis
Project description
InsightfulPy
A comprehensive Python toolkit for exploratory data analysis with advanced visualization and statistical analysis capabilities.
Overview
InsightfulPy simplifies the process of exploring and understanding your data through intuitive functions for statistical analysis, data quality assessment, and professional visualization. Whether you're a data scientist, analyst, or researcher, this package provides the tools you need for thorough data exploration.
Key Features
- Statistical Analysis: Comprehensive statistics, distribution analysis, and normality testing
- Data Quality Assessment: Missing value detection, outlier identification, and data type validation
- Professional Visualization: Box plots, distribution plots, correlation analysis, and categorical charts
- Dataset Comparison: Multi-dataset analysis and column linking capabilities
- Batch Processing: Handle large datasets with intelligent batching for visualizations
- Easy Integration: Works seamlessly with pandas DataFrames
Installation
pip install insightfulpy
Quick Start
import pandas as pd
import insightfulpy as ipy
# Load your data
df = pd.read_csv('your_data.csv')
# Basic data exploration
ipy.columns_info('My Dataset', df)
ipy.num_summary(df)
ipy.cat_summary(df)
# Data quality checks
ipy.missing_inf_values(df)
ipy.detect_outliers(df)
# Visualization
ipy.show_missing(df)
ipy.plot_boxplots(df)
ipy.kde_batches(df, batch_num=1)
Core Functions
Basic Analysis
num_summary(df)- Statistical summary of numerical columnscat_summary(df)- Analysis of categorical columnscolumns_info(title, df)- Dataset structure overviewmissing_inf_values(df)- Missing and infinite value detectiondetect_outliers(df)- Outlier identification using IQR method
Visualization
show_missing(df)- Missing data pattern visualizationplot_boxplots(df)- Box plots for all numerical columnskde_batches(df)- Distribution plots organized in batchescat_bar_batches(df)- Bar charts for categorical datacat_pie_chart_batches(df)- Pie charts for categorical analysis
Advanced Analysis
grouped_summary(df, groupby)- Statistical analysis by groupscompare_df_columns()- Multi-dataset comparisoninterconnected_outliers()- Cross-column outlier analysisnum_vs_num_scatterplot_pair_batch()- Numerical correlation plotscat_vs_cat_pair_batch()- Categorical relationship heatmaps
Statistical Tools
calc_stats(series)- Comprehensive statistical calculationscalculate_skewness_kurtosis(df)- Distribution shape analysisiqr_trimmed_mean(data)- Robust mean calculationmad(data)- Mean absolute deviation
Help System
InsightfulPy includes a built-in help system for easy reference:
import insightfulpy as ipy
# Get help overview
ipy.help()
# List all functions
ipy.list_all()
# Quick start guide
ipy.quick_start()
# Usage examples
ipy.examples()
Requirements
- Python 3.8+
- pandas >= 1.3.0
- numpy >= 1.20.0
- matplotlib >= 3.3.0
- seaborn >= 0.11.0
- scipy >= 1.7.0
- Additional dependencies: researchpy, tableone, missingno, tabulate
Contributing
Contributions are welcome! Please read contributing guidelines and submit pull requests to GitHub repository.
Related links
-
For detailed documentation and examples, visit GitHub repository.
-
This project is licensed under the MIT License - see the LICENSE file for details.
-
If you encounter any issues or have questions, please open an issue on GitHub Issues page.
InsightfulPy makes data exploration intuitive and comprehensive. Start exploring your data with confidence today.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file insightfulpy-0.1.8.tar.gz.
File metadata
- Download URL: insightfulpy-0.1.8.tar.gz
- Upload date:
- Size: 38.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b58252c32681a92c8ee20830de98f2ae5ae9f4e95d93bfec8c3e568f919740be
|
|
| MD5 |
9896ba5ec5b84ca536f538e1f9109c43
|
|
| BLAKE2b-256 |
3a06b9dd618a12780404d1ad4f3ea17641f45ea7c2b46bec80cfde18b5abf109
|
File details
Details for the file insightfulpy-0.1.8-py3-none-any.whl.
File metadata
- Download URL: insightfulpy-0.1.8-py3-none-any.whl
- Upload date:
- Size: 20.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d3c9dbb6e3253e07f1f305446a4b2c7071b35d5657339262fe56606bfdf7e321
|
|
| MD5 |
6aab1be4d77cf64db3fd58de2f63abf1
|
|
| BLAKE2b-256 |
09dfc184f88a68590f59873392614ae9a9b5136d06480619b75a398496b4270d
|