A package to allow for basic visuals with pandas dataframes
Project description
easystyle
easystyle is a python package created to allow simple and quick data analysis without having to export to excel. easystyle reads in the data as a dataframe as highlights certain cells depending on what the user calls. The package is heavily dependent on the styling in pandas and pulls some basic code from the Stying pandas documentation. Functions such as identifying, by highlighting the cell or text, the maximum, the minimum, outliers, and NaN values are used in easystyle. These functions allow the user to take a quick look at their data in Jupyter Notebook. It also serves the purpose of showing the data cleaner than with a standard dataframe so that you can present the information to others without having to go to excel.
easystyle works great in Jupyter Notebook, but I am unsure of how well, or if it will work at all, in other environments
Installing easystyle
pip install easystyle
Importing easystyle
The best way to import easystyle is:
from easystyle import Style as es
This is since it makes it simpler to call functions when using the package. For the following examples we will assume this importing method.
Highlight Negative Values
def highlight_negative(df, col = 'red'):
In order to use highlight_negative, the user must provide the dataframe. The user does not need to provide a color, but may do so if they want a color other than red. To run the function:
df = es.highlight_negative(df, color)
Highlight Positive Values
def highlight_positive(df, col = 'green'):
This function follows similarly to the function above. To run the function:
df = es.highlight_positive(df, color)
Highlight Negative and Positive Values
def highlight_neg_pos(df, colNeg = 'red', colPos = 'green'):
Follows from above. To run the function:
df = es.highlight_neg_pos(df, color_neg, color_pos)
Highlight Maximum Value
def highlight_max(df, column_names, col='yellow'):
The function is fed an additional variable, the column names. column_names
is an array of the column names that you wish to find the maximum in. For example:
column_names = ['A', 'B', 'C']
df = es.highlight_max(df, column_names, color)
Highlight Minimum Value
def highlight_min(df, column_names, col='orange'):
Follows similarly to highlighting a maximum value. For example:
column_names = ['A', 'B', 'C']
df = es.highlight_min(df, column_names, color)
Highlight Outliers
def highlight_outlier(df, column_names, colLow='red', colUp='red'):
This function uses the IQR method in order to find outliers. Again, column_names
is a array of column names. For example:
df = es.highlight_outlier(df, column_names, colorLower, colorUpper)
Highlight NaN Values
def highlight_NaN(df, color = 'red'):
If any NaN values are detected, they are highlighted. For example:
df = es.highlight_NaN(df, color)
Give a Gradient
def highlight_gradient(df, color = 'green'):
This function uses the gradients provided by the Seaborn package. For example:
df = es.highlight_gradient(df, color)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.