Skip to main content

Functions and tools for making Exploratory Data Analysis easy!

Project description

EDAeasy 😀

The package for quick exploratory data analysis

Instalation

pip install EDAeasy

Usage

The dataframe_summary function have relative simple summary of the columns of your dataframe for quick look at tabular data

Generate a summary DataFrame of the input DataFrame 'dataframe'.

Parameters
----------
dataframe : pandas.DataFrame
    The input DataFrame for which the summary needs to be generated.

Returns
-------
pandas.DataFrame
    A DataFrame containing summary information for each column in 'df':
    - Type: Data type of the column.
    - Min: Minimum value in the column.
    - Max: Maximum value in the column.
    - Nan %: Percentage of NaN values in the column.
    - # Unique Values: Total number of unique values in the column.
    - Unique values: List of unique values in the column.

Example
-------
>>> data = {
        'age': ['[40-50)', '[60-70)', '[70-80)'],
        'time_in_hospital': [8, 3, 5],
        'n_lab_procedures': [72, 34, 45],
        ...
    }
>>> dataframe = pd.DataFrame(data)
>>> result = dataframe_summary(df)
>>> print(result)
           Type       Min        Max  Nan %  # Unique Values                                  Unique values
Variables                                                                                                              
age       object   [40-50)    [90-100)    0.0        3      ['[70-80)', '[50-60)', '[60-70)', '[40-50)', '[80-90)', ...
time_in_hospital  int64    1           14    0.0        3        [8, 3, 5]
n_lab_procedures  int64    1          113    0.0        3        [72, 34, 45]
...

Note
----
The function uses vectorized operations to improve performance and memory usage.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edaeasy-1.1.3.tar.gz (3.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

edaeasy-1.1.3-py3-none-any.whl (4.7 kB view details)

Uploaded Python 3

File details

Details for the file edaeasy-1.1.3.tar.gz.

File metadata

  • Download URL: edaeasy-1.1.3.tar.gz
  • Upload date:
  • Size: 3.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.11.0 Darwin/23.5.0

File hashes

Hashes for edaeasy-1.1.3.tar.gz
Algorithm Hash digest
SHA256 0d7836f2b8ed0e4679e804ec9c0c4e324389da1b8c9cd6379a7673bc0b089510
MD5 17e5782903f1e140d18ea0a65a2dc7a2
BLAKE2b-256 62f0ca317f93b7cbf2b5d8de3a233ef49d906fdd440214aa660bc94156f4d31e

See more details on using hashes here.

File details

Details for the file edaeasy-1.1.3-py3-none-any.whl.

File metadata

  • Download URL: edaeasy-1.1.3-py3-none-any.whl
  • Upload date:
  • Size: 4.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.11.0 Darwin/23.5.0

File hashes

Hashes for edaeasy-1.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 203b66aab6b76c44e29f49c5bd107250a37692bbe4c59ad788400cdcb3efa711
MD5 318c9596c768d4fc4c9529ac94e05996
BLAKE2b-256 2034520124d84e1a32e922aaadceb10f8585aab2e17a32c9131bd5cfb1a524d7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page