Skip to main content

A set of functions and demos to make machine learning projects easier to understand through effective visualizations.

Project description

MLVisualizationTools

Tests Badge Python Version Badge License Badge

MLVisualizationTools is a python library to make machine learning more understandable through the use of effective visualizations.

We support graphing with matplotlib and plotly. We implicity support all major ML libraries, such as tensorflow and sklearn.

You can use the built in apps to quickly anaylyze your existing models, or build custom projects using the modular sets of functions.

Installation

pip install MLVisualizationTools

Depending on your use case, tensorflow, plotly and matplotlib might need to be installed.

pip install tensorflow pip install plotly pip install matplotlib

To use interactive webapps, use the pip install MLVisualizationTools[dash] or pip install MLVisualizationTools[dash-notebook] flags on install.

If you are running on a notebook that doesn't have dash support (like kaggle), you might need pip install MLVisualizationTools[ngrok-tunneling]

Express

To get started using MLVisualizationTools, run one of the prebuilt apps.

import MLVisualizationTools.express.DashModelVisualizer as App

model = ... #your keras model
data = ... #your pandas dataframe with features

App.visualize(model, data)

Functions

MLVisualizationTools connects a variety of smaller functions.

Steps:

  1. Keras Model and Dataframe with features
  2. Analyzer
  3. Interface / Interface Raw (if you don't have a dataframe)
  4. Colorizers (optional)
  5. Apply Training Data Points (Optional)
  6. Colorize data points (Optional)
  7. Graphs

Analyzers take a keras model and return information about the inputs such as which ones have high variance.

Interfaces take parameters and construct a multidimensional grid of values based on plugging these numbers into the model.

(Raw interfaces allow you to use interfaces by specifying column data instead of a pandas dataframe. Column data is a list with a dict with name, min, max, and mean values for each feature column)

Colorizers mark points as being certain colors, typically above or below 0.5.

Data Interfaces render training data points on top of the graph to make it easier to tell if the model trained properly.

Graphs turn these output grids into a visual representation.

Sample

from MLVisualizationTools import Analytics, Interfaces, Graphs, Colorizers, DataInterfaces

#Displays plotly graphs with max variance inputs to model

model = ... #your model
df = ... #your dataframe
AR = Analytics.analyzeModel(model, df)
maxvar = AR.maxVariance()

grid = Interfaces.predictionGrid(model, maxvar[0], maxvar[1], df)
grid = Colorizers.binary(grid)
grid = DataInterfaces.addPercentageData(grid, df, str('OutputKey'))
fig = Graphs.plotlyGraph(grid)
fig.show()

Prebuilt Examples

Prebuilt examples run off of the pretrained model and dataset packaged with this library. They include:

  • Demo: a basic demo of library functionality that renders 2 plots
  • MatplotlibDemo: Demo but with matplotlib instead of plotly
  • DashDemo: Non-jupyter notebook version of an interactive dash website demo
  • DashNotebookDemo: Notebook version of an interactive website demo
  • DashKaggleDemo: Notebook version of an dash demo that works in kaggle notebooks
  • DataOverlayDemo: Demonstrates data overlay features

See MLVisualizationTools/Examples for more examples. Use example.main() to run the examples and set parameters such as themes.

Support for more ML Libraries

We support any ML library that has a predict() call that takes a pd Dataframe with features. If this doesn't work, use a wrapper class like in this example:

import pandas as pd

class ModelWrapper:
    def __init(self, model):
        self.model = model

    def predict(self, dataframe: pd.DataFrame):
        ... #Do whatever code you need here

Tensorflow Compatibility

MLVisualizationTools is distributed with a pretrained tensorflow model to make running examples quick and easy. It is not needed for main library functions.

For version 2.0 through 2.4, we load a v2.0 model. For version 2.5+ we load a v2.5 model.

If this causes compatibility issues you can still use the main library on your models. If you need an example model, retrain it with TrainTitanicModel.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

MLVisualizationTools-0.6.0rc0.tar.gz (78.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

MLVisualizationTools-0.6.0rc0-py3-none-any.whl (88.7 kB view details)

Uploaded Python 3

File details

Details for the file MLVisualizationTools-0.6.0rc0.tar.gz.

File metadata

  • Download URL: MLVisualizationTools-0.6.0rc0.tar.gz
  • Upload date:
  • Size: 78.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.12

File hashes

Hashes for MLVisualizationTools-0.6.0rc0.tar.gz
Algorithm Hash digest
SHA256 0eef3037aaaf23cba3dee8fc0c9b91621b26d5ed7ad7bcd24731be3ad55a7b54
MD5 de83d6b31920db296b58fafb9f68c19c
BLAKE2b-256 342d928a99400b4fb154bcccb11e53f6180b08dc51e731831cd3608113abe7e3

See more details on using hashes here.

File details

Details for the file MLVisualizationTools-0.6.0rc0-py3-none-any.whl.

File metadata

  • Download URL: MLVisualizationTools-0.6.0rc0-py3-none-any.whl
  • Upload date:
  • Size: 88.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.11.3 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.9.12

File hashes

Hashes for MLVisualizationTools-0.6.0rc0-py3-none-any.whl
Algorithm Hash digest
SHA256 b65ff77598c1c64731624269fb773a8635327e5a2b4b7d520b1edd2677a5f184
MD5 06b032ab78bde4aadf6a322cb2261a0b
BLAKE2b-256 746a3e89bcccc2d90fad56d0ec2f896c34f3b677e6d75383e288ebbe3d022337

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page