Skip to main content

Functions and tools for making Exploratory Data Analysis easy!

Project description

EDAeasy 😀

The package for quick exploratory data analysis

Instalation

pip install EDAeasy

Usage

The dataframe_summary function have relative simple summary of the columns of your dataframe for quick look at tabular data

Generate a summary DataFrame of the input DataFrame 'dataframe'.

Parameters
----------
dataframe : pandas.DataFrame
    The input DataFrame for which the summary needs to be generated.

Returns
-------
pandas.DataFrame
    A DataFrame containing summary information for each column in 'df':
    - Type: Data type of the column.
    - Min: Minimum value in the column.
    - Max: Maximum value in the column.
    - Nan %: Percentage of NaN values in the column.
    - # Unique Values: Total number of unique values in the column.
    - Unique values: List of unique values in the column.

Example
-------
>>> data = {
        'age': ['[40-50)', '[60-70)', '[70-80)'],
        'time_in_hospital': [8, 3, 5],
        'n_lab_procedures': [72, 34, 45],
        ...
    }
>>> dataframe = pd.DataFrame(data)
>>> result = dataframe_summary(df)
>>> print(result)
           Type       Min        Max  Nan %  # Unique Values                                  Unique values
Variables                                                                                                              
age       object   [40-50)    [90-100)    0.0        3      ['[70-80)', '[50-60)', '[60-70)', '[40-50)', '[80-90)', ...
time_in_hospital  int64    1           14    0.0        3        [8, 3, 5]
n_lab_procedures  int64    1          113    0.0        3        [72, 34, 45]
...

Note
----
The function uses vectorized operations to improve performance and memory usage.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edaeasy-1.2.5.tar.gz (7.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

edaeasy-1.2.5-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file edaeasy-1.2.5.tar.gz.

File metadata

  • Download URL: edaeasy-1.2.5.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.11.0 Darwin/23.6.0

File hashes

Hashes for edaeasy-1.2.5.tar.gz
Algorithm Hash digest
SHA256 dde883590ffe62e2d0f12c5a6348e2db8940994201a6bc29551057376ec6df2d
MD5 9df491342f0f95309186dbfcb8656409
BLAKE2b-256 43aba9fa93e8e6f03c288f0b7df0deaeda0a454b325fe8e9f52bb40ab0f48f1a

See more details on using hashes here.

File details

Details for the file edaeasy-1.2.5-py3-none-any.whl.

File metadata

  • Download URL: edaeasy-1.2.5-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.11.0 Darwin/23.6.0

File hashes

Hashes for edaeasy-1.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 8b042f416b56a95b612953228d977ad3df0d5d0fd0d602d0a1a178f7989989ba
MD5 7f8e539cde21b9e9eff0b6e7f584d172
BLAKE2b-256 686f10e0a6a9e20211615f7c202f2cfed412f80f5b1eb35aed0a15c46f834b4d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page