Automated view of dataset
Project description
Description
Module, provides the function view, which displays general information on the data:
- Result of method info()
- Result of method describe()
- for numeric / categorical signs - The number of missions in the data (number and percentage for each column)
- Top-5 of the most frequent categorical signs (for each)
Parameters:
- d - table with data
- only_numeric - True / False, default: True. True - information output only by numerical signs, False - information output by numerical and categorical signs.
- full_stats - True / False, default: False. False - output information on numerical characteristics without interquartile range, data boundaries without outliers, True - complete output with data character.
- histograms - True / False, default: True. True - output with building histograms for numerical signs, False - without building histograms
Top-5 elements of categorical signs
The table is formed as follows. The postfix (_name / _count) is assigned to the name of the data column:
- _name - category name
- _count - number of elements in this category If there are less than 5 elements in the attribute, then the values in the _count field are filled -1
Usage
$ pip install data_view
$ python3
import pandas as pd
import numpy as np
from data_view import view
d = pd.DataFrame(np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]]), columns=['a', 'b', 'c'])
view(d, only_numeric=True, histograms=False)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
data-view-0.0.8.tar.gz
(3.6 kB
view hashes)
Built Distribution
Close
Hashes for data_view-0.0.8-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 88c6828127379cf9c6a855366d41f07dd6854d6732f80bf2e62aa1d3ecdb11f4 |
|
MD5 | 0ceca8a5ff10b4bdec06150466c834da |
|
BLAKE2b-256 | 554eb5c4a52499d44ddbaf687799d010d6860f15ae0a7da5d709ad58b56a6d7d |