Pairplotr is a Python library used to graph combinations of numerical and categorical data in a pair plot
Project description
<img src="https://github.com/JaggedParadigm/pairplotr/blob/master/pairplot_demo.png" width="500" />
# How
## Installation
For now, simply clone the respository and link to the location in your code.
## Use
See the [demo](https://nbviewer.jupyter.org/github/JaggedParadigm/pairplotr/blob/master/pairplotr_demo.ipynb) for use of pairplotr.
# What
Pairplotr is a Python library used to graph combinations of numerical and categorical data in a pair plot,
similar to Seaborn's pairplot(), given a cleaned Pandas dataframe with a mixture of categorical and numerical
values.
Here are the formats for Row feature|Column feature combinations in either on- or off-diagonal cells:
- On-diagonal:
- Categorical|Categorical:
- Value counts of feature values ordered by ascending value count and colored by feature values
- Numerical|Numerical:
- Histogram of feature w/ no coloring (or by desired label)
- Off-diagonal:
- Categorical|Categorical:
- Stacked value count of row feature values colored by column feature values
- Categorical|Numerical:
- Histograms of column feature for each row feature value colored by row feature value
- Numerical|Numerical:
- Scatter plot of row feature values vs column feature values w/ no coloring (or by desired label)
# Why
The available tools I've found don't seem to be able to combine numerical and categorical feature data
in a quick and easy way and I wanted to customize the comparisons as the plot types I find most useful.
# How
## Installation
For now, simply clone the respository and link to the location in your code.
## Use
See the [demo](https://nbviewer.jupyter.org/github/JaggedParadigm/pairplotr/blob/master/pairplotr_demo.ipynb) for use of pairplotr.
# What
Pairplotr is a Python library used to graph combinations of numerical and categorical data in a pair plot,
similar to Seaborn's pairplot(), given a cleaned Pandas dataframe with a mixture of categorical and numerical
values.
Here are the formats for Row feature|Column feature combinations in either on- or off-diagonal cells:
- On-diagonal:
- Categorical|Categorical:
- Value counts of feature values ordered by ascending value count and colored by feature values
- Numerical|Numerical:
- Histogram of feature w/ no coloring (or by desired label)
- Off-diagonal:
- Categorical|Categorical:
- Stacked value count of row feature values colored by column feature values
- Categorical|Numerical:
- Histograms of column feature for each row feature value colored by row feature value
- Numerical|Numerical:
- Scatter plot of row feature values vs column feature values w/ no coloring (or by desired label)
# Why
The available tools I've found don't seem to be able to combine numerical and categorical feature data
in a quick and easy way and I wanted to customize the comparisons as the plot types I find most useful.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pairplotr-1.2.3.1.tar.gz
(9.7 kB
view hashes)
Built Distribution
Close
Hashes for pairplotr-1.2.3.1-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | dea1a98786ab9155222f2d8e128dc5fe2c3f75820abc34988536cb6465d83c86 |
|
MD5 | 595f6254952e5ef0b6e60608e8c9bbe1 |
|
BLAKE2b-256 | 6b95d4b49bd547f34e50cec22969a70a6800e1c62382e398a5c7cd557138e408 |