Project description

EDAeasy 😀

The package for quick exploratory data analysis

Instalation

pip install EDAeasy

Usage

The dataframe_summary function have relative simple summary of the columns of your dataframe for quick look at tabular data

Generate a summary DataFrame of the input DataFrame 'dataframe'.

Parameters
----------
dataframe : pandas.DataFrame
    The input DataFrame for which the summary needs to be generated.

Returns
-------
pandas.DataFrame
    A DataFrame containing summary information for each column in 'df':
    - Type: Data type of the column.
    - Min: Minimum value in the column.
    - Max: Maximum value in the column.
    - Nan %: Percentage of NaN values in the column.
    - # Unique Values: Total number of unique values in the column.
    - Unique values: List of unique values in the column.

Example
-------
>>> data = {
        'age': ['[40-50)', '[60-70)', '[70-80)'],
        'time_in_hospital': [8, 3, 5],
        'n_lab_procedures': [72, 34, 45],
        ...
    }
>>> dataframe = pd.DataFrame(data)
>>> result = dataframe_summary(df)
>>> print(result)
           Type       Min        Max  Nan %  # Unique Values                                  Unique values
Variables                                                                                                              
age       object   [40-50)    [90-100)    0.0        3      ['[70-80)', '[50-60)', '[60-70)', '[40-50)', '[80-90)', ...
time_in_hospital  int64    1           14    0.0        3        [8, 3, 5]
n_lab_procedures  int64    1          113    0.0        3        [72, 34, 45]
...

Note
----
The function uses vectorized operations to improve performance and memory usage.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

1.0.1

Aug 31, 2023

1.0.0

Aug 1, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edaeasy-1.0.1.tar.gz (3.0 kB view hashes)

Uploaded Aug 31, 2023 Source

Built Distribution

edaeasy-1.0.1-py3-none-any.whl (4.1 kB view hashes)

Uploaded Aug 31, 2023 Python 3

Hashes for edaeasy-1.0.1.tar.gz

Hashes for edaeasy-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`a79e0981753bf498e39f1697af43fc7bc668dbbcb15186cd6f441b9355fd6e86`
MD5	`2ce80b09c6df2d83bf98b600d178cee4`
BLAKE2b-256	`ca114bc9c7df999253cecc13a940a6ff3abe5c32491c885ab3cb1affcd4502b8`

Hashes for edaeasy-1.0.1-py3-none-any.whl

Hashes for edaeasy-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e7d68ee856db602858c9d61ec89901eb874ad77e259c7676c71f955dd7752564`
MD5	`266b357f8e2e32c9106b387e6e6607f6`
BLAKE2b-256	`c2d1567410cb689218d6c87733bf49f1a41c0c75d52862e94a2147e3f523d70e`