Skip to main content

No project description provided

Project description

Skimpy Extended

A light weight tool for creating summary statistics from dataframes.

This is a replica of the original Skimpy package, but for polars!

skimpy is a light weight tool that provides summary statistics about variables in data frames within the console or your interactive Python window. Think of it as a super-charged version of df.describe(). You can find the documentation here.

MVP TODOs

  • Info tables
  • Support for numerical data
  • Support for categorical data
  • Support for timeseries data
  • Support for boolean data
  • Remove zero-variance columns
  • Identifies rare categories and groups
  • Find outliers based on Inter Quartile Range
  • Detects possibly mixed data types columns
  • Detects high cardinality features
  • Detects high correlated features
  • Detects duplicated rows (may be an option to pass when call the Skimpy class)
  • Indicate skewed ditributions
  • Detects imbalanced classes
  • Detects feature leakage

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skimpy_ext-0.1.0.tar.gz (4.4 kB view hashes)

Uploaded Source

Built Distribution

skimpy_ext-0.1.0-py3-none-any.whl (5.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page