Skip to main content

Automate the process of visualization

Project description

Visualizer:

Visualizer is a Python package that automates the process of visualization and facilitates the plotting of any individual relationship between multiple-columns.

Visualizer package allows you to do 2 types of plotting:

  1. Visualize by an individual column:

    • Count Plot.
    • Pie Plot.
    • Histogram plot.
    • KDE plot.
    • WordCloud plot.
    • Histogram for high cardinality columns.
    • Line plot with index.
    • Point plot with index.
    • Clustered-bar Plot.
    • Bubble plot.
    • Scatter plot.
    • Density plot.
    • Box plot.
    • Violin plot.
    • Ridge plot.
    • Parallel plot.
    • Radar plot.
  2. Visualize by a relationship (multiple-columns):

    • Uni-vairate Target.
    • Uni-variate Categorical (Cat).
    • Uni-variate Numerical (Num).
    • Bi-variate Num with Index.
    • Bi-variate Cat with Index.
    • Bi-variate Num with Num.
    • Bi-variate Num with Cat.
    • Bi-variate Cat with Cat.
    • Bi-variate Cat with Target.
    • Bi-variate Num with Target.
    • Multi-variate Nums with Cat.

Installation:

pip install -U visualizer

Usage:

  1. To use the first type Individual Plotting, all the methods starts with create_, and you can use them as follows:
# Import the library
from visualizer import Visualize

# Create a count plot
Visualizer.create_count_plot(df=df, cat_col="bin_3", annot=True)

count_plot

To see more types of individual plotting, you can see the docs.

  1. To use the second type Automatic Visualization, all the methods starts with visualize_, and you can them as follows:
# Import the library
from visualizer import Visualizer

autoVis = Visualizer(df=df,                    # df: (dataframe)
                    num_cols=num_cols,         # num_cols: (list) of numerical columns.
                    cat_cols=cat_cols,         # cat_cols: (list) of categorical columns.
                    target_col=target_col,     # target_col: (string) your target column.
                    ignore_cols=ignore_cols,   # ignore_cols: (list) of columns to ignore.
                    problem_type='classification') # problem_type: (string) ['classification', 'regression']

# Visualize all the relationships between the selected columns,
# whether it's uni-variate, Bi-variate, or even multi-variate.
# This methods saves the generated figures into folder named "visualizer"
# into the current directory.
autoVis.visualize_all()

After running this command, the output in your notebook will tell you the progress, and when it's finished, it will be like this:

progress

And your current directory will have a new folder called "visualizer", which contains all the relationships between the columns.

So you can go to each particular relationship "folder", and see the related figures, does it contain any pattern?

image-20200314105541089

To know more, you can see the docs.

Further Ideas/Developments:

The following ideas are under construction and it will be added soon in upcoming versions:

  1. Visualize Sparse Columns, to see if they have a pattern.

  2. Visualize NaN/Infinite/Large numeric values across the whole dataframe, to see the pattern of the whole dataframe.

  3. Visualize Text columns.

  4. Add the functionality to arrange the structure of the folders to be by columns, so each column has all the relationships for a specific column.

Contribute:

If you've found a bug or something that you would like to improve, don't hesitate to create an issue and create a pull request.

License:

MIT License.

Authors:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visualizer-0.0.10.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

visualizer-0.0.10-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file visualizer-0.0.10.tar.gz.

File metadata

  • Download URL: visualizer-0.0.10.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.7

File hashes

Hashes for visualizer-0.0.10.tar.gz
Algorithm Hash digest
SHA256 0ac364b8f03dd1e7afaac8ca7866df8d412c3db0bb75da921b892dd8f9ab4ece
MD5 14229e6578c7c9d0ef169030b9b392b2
BLAKE2b-256 6c75a4d541e6e1a627c1e2f59f6e4fbb5c3b7c0f02c23be827315c122ebf2678

See more details on using hashes here.

File details

Details for the file visualizer-0.0.10-py3-none-any.whl.

File metadata

  • Download URL: visualizer-0.0.10-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.7

File hashes

Hashes for visualizer-0.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 98b396d328b3defd02ccfb3d91d70e930a721161764c258dbab54552f3b3f293
MD5 a35a05d600526aba0ef92e8e1655203b
BLAKE2b-256 467e9f0752fc97be7df10f5806ef5999c296fd48cd83558d3898d6519e4f5b1e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page