Skip to main content

Automate the process of visualization

Project description

Visualizer:

Visualizer is a Python package that automates the process of visualization and facilitates the plotting of any individual relationship between multiple-columns.

Visualizer package allows you to do 2 types of plotting:

  1. Visualize by an individual column:

    • Count Plot.
    • Pie Plot.
    • Histogram plot.
    • KDE plot.
    • WordCloud plot.
    • Histogram for high cardinality columns.
    • Line plot with index.
    • Point plot with index.
    • Clustered-bar Plot.
    • Bubble plot.
    • Scatter plot.
    • Density plot.
    • Box plot.
    • Violin plot.
    • Ridge plot.
    • Parallel plot.
    • Radar plot.
  2. Visualize by a relationship (multiple-columns):

    • Uni-vairate Target.
    • Uni-variate Categorical (Cat).
    • Uni-variate Numerical (Num).
    • Bi-variate Num with Index.
    • Bi-variate Cat with Index.
    • Bi-variate Num with Num.
    • Bi-variate Num with Cat.
    • Bi-variate Cat with Cat.
    • Bi-variate Cat with Target.
    • Bi-variate Num with Target.
    • Multi-variate Nums with Cat.

Installation:

pip install -U visualizer

Usage:

  1. To use the first type Individual Plotting, all the methods starts with create_, and you can use them as follows:
# Import the library
from visualizer import Visualize

# Create a count plot
Visualizer.create_count_plot(df=df, cat_col="bin_3", annot=True)

count_plot

To see more types of individual plotting, you can see the docs.

  1. To use the second type Automatic Visualization, all the methods starts with visualize_, and you can them as follows:
# Import the library
from visualizer import Visualizer

autoVis = Visualizer(df=df,                    # df: (dataframe)
                    num_cols=num_cols,         # num_cols: (list) of numerical columns.
                    cat_cols=cat_cols,         # cat_cols: (list) of categorical columns.
                    target_col=target_col,     # target_col: (string) your target column.
                    ignore_cols=ignore_cols,   # ignore_cols: (list) of columns to ignore.
                    problem_type='classification') # problem_type: (string) ['classification', 'regression']

# Visualize all the relationships between the selected columns,
# whether it's uni-variate, Bi-variate, or even multi-variate.
# This methods saves the generated figures into folder named "visualizer"
# into the current directory.
autoVis.visualize_all()

After running this command, the output in your notebook will tell you the progress, and when it's finished, it will be like this:

progress

And your current directory will have a new folder called "visualizer", which contains all the relationships between the columns.

So you can go to each particular relationship "folder", and see the related figures, does it contain any pattern?

image-20200314105541089

To know more, you can see the docs.

Further Ideas/Developments:

The following ideas are under construction and it will be added soon in upcoming versions:

  1. Visualize Sparse Columns, to see if they have a pattern.

  2. Visualize NaN/Infinite/Large numeric values across the whole dataframe, to see the pattern of the whole dataframe.

  3. Visualize Text columns.

  4. Add the functionality to arrange the structure of the folders to be by columns, so each column has all the relationships for a specific column.

Contribute:

If you've found a bug or something that you would like to improve, don't hesitate to create an issue and create a pull request.

License:

MIT License.

Authors:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

visualizer-0.0.8.tar.gz (4.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

visualizer-0.0.8-py3-none-any.whl (10.2 kB view details)

Uploaded Python 3

File details

Details for the file visualizer-0.0.8.tar.gz.

File metadata

  • Download URL: visualizer-0.0.8.tar.gz
  • Upload date:
  • Size: 4.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.7

File hashes

Hashes for visualizer-0.0.8.tar.gz
Algorithm Hash digest
SHA256 25a4ab04ec93dcf184df15badbdced96c9642cd901211663f5c20b3e9cb13c11
MD5 54cabf1e9a9f336c9d6909d483c44eee
BLAKE2b-256 bbfce229326986a2431f277b9e8876330337fc48ee8b8c48951b4960851ff389

See more details on using hashes here.

File details

Details for the file visualizer-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: visualizer-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 10.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.43.0 CPython/3.6.7

File hashes

Hashes for visualizer-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 4d224ffe58eeb907c32829d842d3fcc5bd83f88656b255f48354eab19351d156
MD5 3c61e9230c916977670ab2b4cd17eae9
BLAKE2b-256 1b059100beee6b0fb5c0a78ea879afaaa5cdb46dcda42050fb5d7ad345274ccb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page