Skip to main content

A comprehensive toolkit for exploratory data analysis with advanced visualization and statistical analysis

Project description

InsightfulPy

A comprehensive Python toolkit for exploratory data analysis with advanced visualization and statistical analysis capabilities.

Overview

InsightfulPy simplifies the process of exploring and understanding your data through intuitive functions for statistical analysis, data quality assessment, and professional visualization. Whether you're a data scientist, analyst, or researcher, this package provides the tools you need for thorough data exploration.

Key Features

  • Statistical Analysis: Comprehensive statistics, distribution analysis, and normality testing
  • Data Quality Assessment: Missing value detection, outlier identification, and data type validation
  • Professional Visualization: Box plots, distribution plots, correlation analysis, and categorical charts
  • Dataset Comparison: Multi-dataset analysis and column linking capabilities
  • Batch Processing: Handle large datasets with intelligent batching for visualizations
  • Easy Integration: Works seamlessly with pandas DataFrames

Installation

pip install insightfulpy

Quick Start

import pandas as pd
import insightfulpy as ipy

# Load your data
df = pd.read_csv('your_data.csv')

# Basic data exploration
ipy.columns_info('My Dataset', df)
ipy.num_summary(df)
ipy.cat_summary(df)

# Data quality checks
ipy.missing_inf_values(df)
ipy.detect_outliers(df)

# Visualization
ipy.show_missing(df)
ipy.plot_boxplots(df)
ipy.kde_batches(df, batch_num=1)

Core Functions

Basic Analysis

  • num_summary(df) - Statistical summary of numerical columns
  • cat_summary(df) - Analysis of categorical columns
  • columns_info(title, df) - Dataset structure overview
  • missing_inf_values(df) - Missing and infinite value detection
  • detect_outliers(df) - Outlier identification using IQR method

Visualization

  • show_missing(df) - Missing data pattern visualization
  • plot_boxplots(df) - Box plots for all numerical columns
  • kde_batches(df) - Distribution plots organized in batches
  • cat_bar_batches(df) - Bar charts for categorical data
  • cat_pie_chart_batches(df) - Pie charts for categorical analysis

Advanced Analysis

  • grouped_summary(df, groupby) - Statistical analysis by groups
  • compare_df_columns() - Multi-dataset comparison
  • interconnected_outliers() - Cross-column outlier analysis
  • num_vs_num_scatterplot_pair_batch() - Numerical correlation plots
  • cat_vs_cat_pair_batch() - Categorical relationship heatmaps

Statistical Tools

  • calc_stats(series) - Comprehensive statistical calculations
  • calculate_skewness_kurtosis(df) - Distribution shape analysis
  • iqr_trimmed_mean(data) - Robust mean calculation
  • mad(data) - Mean absolute deviation

Help System

InsightfulPy includes a built-in help system for easy reference:

import insightfulpy as ipy

# Get help overview
ipy.help()

# List all functions
ipy.list_all()

# Quick start guide
ipy.quick_start()

# Usage examples
ipy.examples()

Requirements

  • Python 3.8+
  • pandas >= 1.3.0
  • numpy >= 1.20.0
  • matplotlib >= 3.3.0
  • seaborn >= 0.11.0
  • scipy >= 1.7.0
  • Additional dependencies: researchpy, tableone, missingno, tabulate

Contributing

Contributions are welcome! Please read contributing guidelines and submit pull requests to GitHub repository.

Related links

  • For detailed documentation and examples, visit GitHub repository.

  • This project is licensed under the MIT License - see the LICENSE file for details.

  • If you encounter any issues or have questions, please open an issue on GitHub Issues page.

InsightfulPy makes data exploration intuitive and comprehensive. Start exploring your data with confidence today.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

insightfulpy-0.1.8.tar.gz (38.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

insightfulpy-0.1.8-py3-none-any.whl (20.6 kB view details)

Uploaded Python 3

File details

Details for the file insightfulpy-0.1.8.tar.gz.

File metadata

  • Download URL: insightfulpy-0.1.8.tar.gz
  • Upload date:
  • Size: 38.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.5

File hashes

Hashes for insightfulpy-0.1.8.tar.gz
Algorithm Hash digest
SHA256 b58252c32681a92c8ee20830de98f2ae5ae9f4e95d93bfec8c3e568f919740be
MD5 9896ba5ec5b84ca536f538e1f9109c43
BLAKE2b-256 3a06b9dd618a12780404d1ad4f3ea17641f45ea7c2b46bec80cfde18b5abf109

See more details on using hashes here.

File details

Details for the file insightfulpy-0.1.8-py3-none-any.whl.

File metadata

  • Download URL: insightfulpy-0.1.8-py3-none-any.whl
  • Upload date:
  • Size: 20.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.5

File hashes

Hashes for insightfulpy-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 d3c9dbb6e3253e07f1f305446a4b2c7071b35d5657339262fe56606bfdf7e321
MD5 6aab1be4d77cf64db3fd58de2f63abf1
BLAKE2b-256 09dfc184f88a68590f59873392614ae9a9b5136d06480619b75a398496b4270d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page