Skip to main content

Automate the process of visualization

Project description

Visualizer:

Visualizer is a Python package that automates the process of visualization and facilitates the plotting of any individual relationship between multiple-columns.

Visualizer package allows you to do 2 types of plotting:

  1. Visualize by an individual column:

    • Count Plot.
    • Pie Plot.
    • Histogram plot.
    • KDE plot.
    • WordCloud plot.
    • Histogram for high cardinality columns.
    • Line plot with index.
    • Point plot with index.
    • Clustered-bar Plot.
    • Bubble plot.
    • Scatter plot.
    • Density plot.
    • Box plot.
    • Violin plot.
    • Ridge plot.
    • Parallel plot.
    • Radar plot.
  2. Visualize by a relationship (multiple-columns):

    • Uni-vairate Target.
    • Uni-variate Categorical (Cat).
    • Uni-variate Numerical (Num).
    • Bi-variate Num with Index.
    • Bi-variate Cat with Index.
    • Bi-variate Num with Num.
    • Bi-variate Num with Cat.
    • Bi-variate Cat with Cat.
    • Bi-variate Cat with Target.
    • Bi-variate Num with Target.
    • Multi-variate Nums with Cat.

Installation:

pip install -U visualizer

Usage:

  1. To use the first type Individual Plotting, all the methods starts with create_, and you can use them as follows:
# Import the library
from visualizer import Visualize

# Create a count plot
Visualizer.create_count_plot(df=df, cat_col="bin_3", annot=True)

count_plot

To see more types of individual plotting, you can see the docs.

  1. To use the second type Automatic Visualization, all the methods starts with visualize_, and you can them as follows:
# Import the library
from visualizer import Visualizer

autoVis = Visualizer(df=df,                    # df: (dataframe)
                    num_cols=num_cols,         # num_cols: (list) of numerical columns.
                    cat_cols=cat_cols,         # cat_cols: (list) of categorical columns.
                    target_col=target_col,     # target_col: (string) your target column.
                    ignore_cols=ignore_cols,   # ignore_cols: (list) of columns to ignore.
                    problem_type='classification') # problem_type: (string) ['classification', 'regression']

# Visualize all the relationships between the selected columns,
# whether it's uni-variate, Bi-variate, or even multi-variate.
# This methods saves the generated figures into folder named "visualizer"
# into the current directory.
autoVis.visualize_all()

After running this command, the output in your notebook will tell you the progress, and when it's finished, it will be like this:

progress

And your current directory will have a new folder called "visualizer", which contains all the relationships between the columns.

So you can go to each particular relationship "folder", and see the related figures, does it contain any pattern?

image-20200314105541089

To know more, you can see the docs.

Further Ideas/Developments:

The following ideas are under construction and it will be added soon in upcoming versions:

  1. Visualize Sparse Columns, to see if they have a pattern.

  2. Visualize NaN/Infinite/Large numeric values across the whole dataframe, to see the pattern of the whole dataframe.

  3. Visualize Text columns.

  4. Add the functionality to arrange the structure of the folders to be by columns, so each column has all the relationships for a specific column.

Contribute:

If you've found a bug or something that you would like to improve, don't hesitate to create an issue and create a pull request.

License:

MIT License.

Authors:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visualizer-0.0.9.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

visualizer-0.0.9-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file visualizer-0.0.9.tar.gz.

File metadata

  • Download URL: visualizer-0.0.9.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.7

File hashes

Hashes for visualizer-0.0.9.tar.gz
Algorithm Hash digest
SHA256 298dd24549e349659c8c76b455fb1f8bfc9423021d5c2843af98fb264111234e
MD5 4dc8204857841f2b0ffa2ceaaefca68b
BLAKE2b-256 f0acac97835ded310f30dae72ccdb1fc6a2a8681e884bcec06763e902b7ed93d

See more details on using hashes here.

File details

Details for the file visualizer-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: visualizer-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.7

File hashes

Hashes for visualizer-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 cd4e20401a8c5129e29f6b351f591475cb7110c2754d3c644a4b437fd95ff095
MD5 f25c0b7e66c7473ae5cb7b58c3f5639d
BLAKE2b-256 cd9d59193e88a8e2b3b896c4bffd9ea6f942199148e8d904fde2379fb1fd6a9e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page