Skip to main content

Utility functions to help with exploratory data analysis on top the Pandas APIs

Project description

Pandas Data Exploration Utility Package

Table of content


Pandas Data Exploration utility is an interactive, notebook based library for quickly profiling and exploring the shape of data and the relationships between data. Using existing APIs from IpyWidget,, and Pandas, it creates a flexible point and click widget that allows the user to easily explore and visualize the dataset.
This is a work in progress, and I welcome any suggestions on features and/or enhancements.


pip install Pandas-Data-Exploration-Utility-Package


Visualization Module

import pandas as pd
import pandas_exploration_util.viz.explore as pe

global_temp = pd.read_csv("./data/GlobalTemperatures.csv", parse_dates = [0], infer_datetime_format=True)


see /test for sample data and test jupyter notebook

Pareto plot

Visualize the top values of any column as ranked by aggregation of any other column. Support aggregation functions include 'count', 'sum', 'mean', 'std', 'max', 'min', 'uniques'

Distribution plot

Visualize distribution of any numerical value. Binning is automatically determined by the histogram method.

X-Y plot

Visualize the X-Y scatter of any column vs aggregation of any other column. Support aggregation functions include 'count', 'sum', 'mean', 'std', 'max', 'min', 'uniques'

Recommended development setup

Local Dev

  1. Setup virtualenv
  2. Create a virtual environment using virtualenv /path/to/env/dir
  3. Activate virtual environment using source /path/to/env/dir/bin/activate
  4. Clone the repo locally
  5. Navigate the root directory of the repo where the lives
  6. Install the module in development mode using python develop
  7. Run the Jupyter notebook that is in the virtual environment directory, which should have installed as the part of the dependency of the module
  8. Dev away
  9. When done uninstall the package using python develop --uninstall
  10. Deactive the environment using deactivate

Building and distributing
Assuming all relevant tools are installed and the relevant project files are properly defined

  1. build the distribution using python3 sdist bdist_wheel
  2. upload the distribution using twine upload dist/*{version}*

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for Pandas-Data-Exploration-Utility-Package, version 0.0.3
Filename, size & hash File type Python version Upload date
Pandas_Data_Exploration_Utility_Package-0.0.3-py3-none-any.whl (6.4 kB) View hashes Wheel py3
Pandas Data Exploration Utility Package-0.0.3.tar.gz (5.5 kB) View hashes Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page