Skip to main content

Python toolkit for exploratory data analysis with visualization and statistical functions

Project description

InsightfulPy

Python toolkit for exploratory data analysis with visualization and statistical functions.

PyPI version Python Version Version License Status

Overview

InsightfulPy simplifies exploratory data analysis through statistical summaries, data quality checks, and visualizations. Built with a modular architecture and constants-driven design, it works seamlessly in Jupyter notebooks, IPython, and terminal environments.

Key Features:

  • Statistical summaries for numerical and categorical data
  • Data quality checks (missing values, outliers, mixed types)
  • Batch visualization functions for large datasets
  • Individual column analysis with plots
  • Multi-dataset comparison tools
  • Environment detection (Jupyter and terminal)
  • Type hints and test coverage

Installation

PyPI

pip install insightfulpy

Source

git clone https://github.com/dhaneshbb/insightfulpy.git
cd insightfulpy
pip install .

Requirements: Python 3.8 or higher

Quick Start

import pandas as pd
import insightfulpy as ipy

# Load data
df = pd.read_csv('data.csv')

# Get help
ipy.help()         # Function overview
ipy.quick_start()  # Step-by-step guide

# Basic analysis
ipy.columns_info('Dataset', df)  # Structure
ipy.num_summary(df)               # Numerical stats
ipy.cat_summary(df)               # Categorical stats

# Data quality
ipy.missing_inf_values(df)  # Missing values
ipy.detect_outliers(df)     # Outliers

# Visualizations
ipy.show_missing(df)             # Missing patterns
ipy.kde_batches(df, batch_num=1) # Distributions

For complete workflow examples, see User Guide and Examples.

Visualization Gallery

View example visualizations in the Gallery Documentation

Function Categories

# Helper Functions
# Quick utilities for navigation, exploration, and guidance
help(), list_all(), quick_start(), examples()

# Basic Analysis
# Core analytical operations on categorical & numerical data
analyze_data(), cat_summary(), num_summary(), columns_info(),
grouped_summary(), detect_outliers(), missing_inf_values()

# Visualization
# Visual insights with distribution & categorical plots
show_missing(), plot_boxplots(), kde_batches(),
box_plot_batches(), qq_plot_batches(),
cat_bar_batches(), cat_pie_chart_batches()

# Advanced Visualization
# Multi-variable and relational data visualization tools
num_vs_num_scatterplot_pair_batch(),
cat_vs_cat_pair_batch(),
num_vs_cat_box_violin_pair_batch()

# Statistical Functions
# Deeper statistical calculations and data profiling metrics
calc_stats(), calculate_skewness_kurtosis(),
iqr_trimmed_mean(), mad()

# Individual Analysis
# Focused analysis and plotting for specific column types
num_analysis_and_plot(), cat_analyze_and_plot()

# Dataset Comparison
# Compare datasets, detect key overlaps, and highlight deltas
compare_df_columns(), display_key_columns(),
interconnected_outliers(), linked_key(),
comp_cat_analysis(), comp_num_analysis()

See API Reference for detailed documentation.

Documentation

User Documentation:

Developer Documentation:

Complete Index:

Contributing

Contributions are welcome. See CONTRIBUTING.md for complete guidelines including development setup, testing requirements, code quality standards, and pull request process.

For development workflow, see Developer Guide. For dependencies, see pyproject.toml.

License

MIT License - see LICENSE file.

Third-party components are listed in NOTICE and THIRD_PARTY_LICENSES.txt.

Links


Version: 0.2.0 | Status: Beta | Python: 3.8-3.12

Copyright 2025 dhaneshbb | License: MIT | Homepage: https://github.com/dhaneshbb/insightfulpy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

insightfulpy-0.2.0.tar.gz (123.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

insightfulpy-0.2.0-py3-none-any.whl (121.1 kB view details)

Uploaded Python 3

File details

Details for the file insightfulpy-0.2.0.tar.gz.

File metadata

  • Download URL: insightfulpy-0.2.0.tar.gz
  • Upload date:
  • Size: 123.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for insightfulpy-0.2.0.tar.gz
Algorithm Hash digest
SHA256 d396645f6c17a8e87bfbc14d30ee7a66dbd506a32a9d05baedc5f9f84531abe6
MD5 8a0ea44d83b536834c34c422e485a362
BLAKE2b-256 a8936e450f8889bd5f6dbd3de28af7225e9cb60d5a4f0daeb955f2745859665a

See more details on using hashes here.

File details

Details for the file insightfulpy-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: insightfulpy-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 121.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for insightfulpy-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 70c466f1825e7cc48abb4ce941c8ba8e5813efc68beb68092e4fd1b2918ed84f
MD5 611fdccb46707df10cf84a9a1439830f
BLAKE2b-256 f9d293729d518bb0a43feb683510c3803105f6b88a12c75ea46c5d17b4c5e8cc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page