Skip to main content

Exploratory data analysis tools

Project description

Installation

To install the package,

pip install edamame

the edamame package works correctly inside a .ipynb file.

import edamame as eda

Why Edamame?

Edamame is born under the inspiration of the pandas-profiling and pycaret packages. The scope of edamame is to build friendly and helpful functions for handling the EDA (exploratory data analysis) step in a dataset studied and after that train and analyze a models battery for regression or classification problems.

Exploratory data analysis functions

You can find an example of the EDA that uses the edamame package in the eda_example.ipynb notebook.

Dimensions

a prettier version of the .shape method

eda.dimensions(data)

the function displays the number of rows and columns of a pandas dataframe passed

Describe distribution

eda.describe_distribution(data)

passing a dataframe the function display the result of the .describe() method applied to a pandas dataframe, divided by quantitative/numerical and categorical/object columns.

Identify columns types

eda.identify_types(data)

passing a dataframe the function display the result of the .dtypes method and returns a list with the name of the quantitative/numerical columns and a list with the name of the columns identified as "object" by pandas.

Convert numerical columns to categorical

eda.identify_types(data, col: list[str])

passing a dataframe and a list with columns name, the function transforms the types of the columns into "object". This operation can help convert numerical columns we know to be categorical.

Missing data

eda.missing(data)

TODO

  • Finishing the documentation
  • Add the xgboost model, PCA regression and other methods for studying the goodness of fit of the other models
  • Add the classification part to the package

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edamame-0.22.tar.gz (13.5 kB view details)

Uploaded Source

Built Distribution

edamame-0.22-py3-none-any.whl (13.2 kB view details)

Uploaded Python 3

File details

Details for the file edamame-0.22.tar.gz.

File metadata

  • Download URL: edamame-0.22.tar.gz
  • Upload date:
  • Size: 13.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.12

File hashes

Hashes for edamame-0.22.tar.gz
Algorithm Hash digest
SHA256 8027739fd8858b686e816cbd22735acdf9978e6977f0035255a01e9cad976a67
MD5 324dcd753dbeadcb3adadecc99702d9a
BLAKE2b-256 75a54d2994bcc193a1e42b9ecc9cf5f28522819425dcd9e96e53f38605623c75

See more details on using hashes here.

File details

Details for the file edamame-0.22-py3-none-any.whl.

File metadata

  • Download URL: edamame-0.22-py3-none-any.whl
  • Upload date:
  • Size: 13.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.12

File hashes

Hashes for edamame-0.22-py3-none-any.whl
Algorithm Hash digest
SHA256 d4e9ac91c3b8b4006b18e7415a265162cceaf7233989472c1ce33f48e43607e9
MD5 31281e97a5e796059e69714fd2fabc08
BLAKE2b-256 33045ea15576a80f5c4687f35491bde316506354dd305ff4820858eaf898a22c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page