Skip to main content

Automatically Visualize any dataset, any size with a single line of code

Project description

AutoViz

banner

Pepy Downloads Pepy Downloads per week Pepy Downloads per month standard-readme compliant Python Versions PyPI Version PyPI License

Automatically Visualize any dataset, any size with a single line of code.

AutoViz performs automatic visualization of any dataset with one line. Give any input file (CSV, txt or json) and AutoViz will visualize it.

Table of Contents

Install

Prerequsites

To clone AutoViz, it's better to create a new environment, and install the required dependencies:

To install from PyPi:

conda create -n <your_env_name> python=3.7 anaconda
conda activate <your_env_name> # ON WINDOWS: `source activate <your_env_name>`
pip install autoviz

To install from source:

cd <AutoViz_Destination>
git clone git@github.com:AutoViML/AutoViz.git
# or download and unzip https://github.com/AutoViML/AutoViz/archive/master.zip
conda create -n <your_env_name> python=3.7 anaconda
conda activate <your_env_name> # ON WINDOWS: `source activate <your_env_name>`
cd AutoViz
pip install -r requirements.txt

Usage

Read this Medium article to know how to use AutoViz.

In the AutoViz directory, open a Jupyter Notebook and use this line to instantiate the library

from autoviz.AutoViz_Class import AutoViz_Class

AV = AutoViz_Class()

Load a dataset (any CSV or text file) into a Pandas dataframe or give the name of the path and filename you want to visualize. If you don't have a filename, you can simply assign the filename argument "" (empty string).

Call AutoViz using the filename (or dataframe) along with the separator and the name of the target variable in the input. AutoViz will do the rest. You will see charts and plots on your screen.

filename = ""
sep = ","
dft = AV.AutoViz(
    filename,
    sep,
    target,
    df,
    header=0,
    verbose=0,
    lowess=False,
    chart_format="svg",
    max_rows_analyzed=150000,
    max_cols_analyzed=30,
)

This is the main calling program in AV. It will call all the load, display and save programs that are currently outside AV. This program will draw scatter and other plots for the input dataset and then call the correct variable name with the add_plots function and send in the chart created by that plotting program, for example, scatter. You have to make sure that add_plots function has the exact name of the variable defined in the Class AV. If not, this will give an error.

Notes:

  • AutoViz will visualize any sized file using a statistically valid sample.
  • COMMA is assumed as default separator in file. But you can change it.
  • Assumes first row as header in file but you can change it.

API

Arguments

  • max_rows_analyzed - limits the max number of rows that is used to display charts
  • max_cols_analyzed - limits the number of continuous vars that can be analyzed
  • verbose
    • if 0, does not print any messages and goes into silent mode. This is the default.
    • if 1, print messages on the terminal and also display charts on terminal.
    • if 2, print messages but will not display charts, it will simply save them.

Maintainers

Contributing

See the contributing file!

PRs accepted.

License

Apache License, Version 2.0

DISCLAIMER

This project is not an official Google project. It is not supported by Google and Google specifically disclaims all warranties as to its quality, merchantability, or fitness for a particular purpose.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autoviz-0.0.74.tar.gz (30.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

autoviz-0.0.74-py3-none-any.whl (33.7 kB view details)

Uploaded Python 3

File details

Details for the file autoviz-0.0.74.tar.gz.

File metadata

  • Download URL: autoviz-0.0.74.tar.gz
  • Upload date:
  • Size: 30.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for autoviz-0.0.74.tar.gz
Algorithm Hash digest
SHA256 1e35242a1b82b4bfeef82810a7c4cf7f3d1376480dced2d2c553133471a61c86
MD5 ce81b0d05a68bf5c9abdaa57fc85a174
BLAKE2b-256 c084d8400fcf3f71f7fa7829efdc4da519fc1825b10aa61782ed370e86cfeb7e

See more details on using hashes here.

File details

Details for the file autoviz-0.0.74-py3-none-any.whl.

File metadata

  • Download URL: autoviz-0.0.74-py3-none-any.whl
  • Upload date:
  • Size: 33.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/45.2.0.post20200210 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for autoviz-0.0.74-py3-none-any.whl
Algorithm Hash digest
SHA256 02c83e49bd1111aad4e5ec013803db1b0f8a3f52882cefac88b1e3559e238f9d
MD5 4e2de1cf726cde98875fca911b0dbff4
BLAKE2b-256 359b16ad6755ee1bf5241b484bf07825f7f9c2002214e16a0a1dcde470146c44

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page