Skip to main content

Automated view of dataset

Project description

Description

Requirements

  • python >= 3.6
  • numpy >= 1.16
  • pandas >= 0.23
  • matplotlib >= 3.02
  • IPython >= 7.2.0

Module, provides the function view, which displays general information on the data:

  • Result of method info()
  • Result of method describe()
  • for numeric / categorical signs - The number of missions in the data (number and percentage for each column)
  • grid of histograms
  • Top-5 of the most frequent categorical signs (for each)

Parameters (function view):

  • d - table with data
  • only_numeric - True / False, default: True. True - information output only by numerical signs, False - information output by numerical and categorical signs.
  • full_stats - True / False, default: False. False - output information on numerical characteristics without interquartile range, data boundaries without outliers, True - complete output with data character.
  • histograms - True / False, default: True. True - output with building histograms for numerical signs, False - without building histograms

Top-5 elements of categorical signs

The table is formed as follows. The postfix (_name / _count) is assigned to the name of the data column:

  • _name - category name
  • _count - number of elements in this category If there are less than 5 elements in the attribute, then the values ​​in the _count field are filled -1

Usage

$ pip install data_view

$ python3

import pandas as pd

import numpy as np

from data_view import view

d = pd.DataFrame(np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]), columns=['a', 'b', 'c'])

view(d, only_numeric=True, histograms=False)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data-view-0.0.9.tar.gz (3.7 kB view hashes)

Uploaded Source

Built Distribution

data_view-0.0.9-py3-none-any.whl (3.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page