No project description provided
Project description
Skimpy Extended
A light weight tool for creating summary statistics from dataframes.
This is a replica of the original Skimpy package, but for polars!
skimpy is a light weight tool that provides
summary statistics about variables in data frames within the console or your interactive Python window.
Think of it as a super-charged version of df.describe()
.
You can find the documentation here.
MVP TODOs
- Info tables
- Support for numerical data
- Support for categorical data
- Support for timeseries data
- Support for boolean data
- Remove zero-variance columns
- Identifies rare categories and groups
- Find outliers based on Inter Quartile Range
- Detects possibly mixed data types columns
- Detects high cardinality features
- Detects high correlated features
- Detects duplicated rows (may be an option to pass when call the Skimpy class)
- Indicate skewed ditributions
- Detects imbalanced classes
- Detects feature leakage
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
skimpy_ext-0.1.0.tar.gz
(4.4 kB
view hashes)
Built Distribution
Close
Hashes for skimpy_ext-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 628aee86f48e3c222fc6734ee8d2548c8a531b1f5d73396b457a9112be223854 |
|
MD5 | f7283073f7c635a41be155e1eab3fb2e |
|
BLAKE2b-256 | 5731a36fc63db7af50c3414641b3e19b2ec19536fe6f32613e4fbd02982c16a8 |