No project description provided
Project description
Skimpy Extended
A light weight tool for creating summary statistics from dataframes.
This is a replica of the original Skimpy package, but for polars!
skimpy is a light weight tool that provides
summary statistics about variables in data frames within the console or your interactive Python window.
Think of it as a super-charged version of df.describe()
.
You can find the documentation here.
MVP TODOs
- Info tables
- Support for numerical data
- Support for categorical data
- Support for timeseries data
- Support for boolean data
- Remove zero-variance columns
- Identifies rare categories and groups
- Find outliers based on Inter Quartile Range
- Detects possibly mixed data types columns
- Detects high cardinality features
- Detects high correlated features
- Detects duplicated rows (may be an option to pass when call the Skimpy class)
- Indicate skewed ditributions
- Detects imbalanced classes
- Detects feature leakage
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
skimpy_ext-0.1.0.dev1.tar.gz
(4.9 kB
view hashes)
Built Distribution
Close
Hashes for skimpy_ext-0.1.0.dev1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 593072d95c9d860cb8e1c65b777235468f43016012e572215ccaac8081b2009b |
|
MD5 | b01597969e40e5200698187e803be129 |
|
BLAKE2b-256 | 570d19c840eacfcf7e4dc77e8aecfe05d503fb6342ff83a5b83da4cfab8f6a7a |