Skip to main content

Auto EDA package

Project description

Auto EDA package

  • Automatically detects numeric and categorical features
  • User may manually assign numeric and categorical features using set_numeric_features() and set_categorical_features() methods if the feature detection is incorrect (recommended).
  • get_data_structure_summary() provides basic information like head, tail, data types, missing value info, etc.
  • get_categorical_features_summary() provides information like count of unique values, unique values, data distribution, etc of categorical features.
  • get_numeric_features_summary() provides summary of numeric features along with distribution plots.
  • plot_correlation_matrix() plots the Pearson and Spearman correlation matrices of all the numeric features.
  • plot_chi_square_result() plots the p-values of the chi-square tests performed between categorical features.
  • plot_numeric_vs_numeric() plots scatter plots between the numeric features.
  • plot_categorical_vs_categorical() plots stacked bar plots between the categorical features.
  • a.plot_mutual_information(target) plots a bar plot showing the mutual information score between the input features and target.
  • a.get_vif() returns the VIF scores of all the numeric features.
  • plot_categorical_vs_numeric() plots violin plots between all the categorical features and numeric features.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data_analyzer-0.0.1.tar.gz (6.6 kB view details)

Uploaded Source

File details

Details for the file data_analyzer-0.0.1.tar.gz.

File metadata

  • Download URL: data_analyzer-0.0.1.tar.gz
  • Upload date:
  • Size: 6.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.7.13

File hashes

Hashes for data_analyzer-0.0.1.tar.gz
Algorithm Hash digest
SHA256 3bead62ac601217ef6f92ae6962455caed6221b7d08b40ee68b4fb3d948f6d02
MD5 694013fab497ea0fb06805406f8d5c8d
BLAKE2b-256 a85fc64044d067e8cccdfa1c805c080742eaf0941298db4524a9acc0889085ff

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page