Skip to main content

EDA in 4 lines of Code

Project description

DataCompass

DataCompass is a Python package designed to enhance data analysis with pandas DataFrames. It provides a set of tools to quickly inspect and understand the structure and contents of DataFrames, aiding in efficient data exploration and preprocessing.

Features

  • Describe Columns: Analyze and list numerical and categorical columns in a DataFrame.
  • Display Missing Information: Calculate and display the count and percentage of missing values in each column.
  • Display Basic Information: Show basic details like the number of columns, rows, and the first few observations.
  • Display Unique Values: Enumerate unique values or the count of unique values in each column.

Installation

To install DataCompass, simply use pip:

pip install datacompass
import pandas as pd
from datacompass import describe_columns, display_missing_info, display_basic_info, display_unique_values

# Sample DataFrame
df = pd.DataFrame({'A': [1, 2, None, 4], 'B': ['a', 'b', 'b', 'c']})

# Describe Columns
describe_columns(df)

# Display Missing Information
display_missing_info(df)

# Display Basic Information
display_basic_info(df)

# Display Unique Values
display_unique_values(df)
### Describe Columns Output

Number of Numerical Columns: 1
['A']
-------------------------------------------------------------------------------------
Number of Categorical Columns: 1
['B']

### Display Missing Information Output

|   Missing Count | Missing Percentage |
|----------------:|-------------------:|
| A              |                  1 |             25.0 |
| B              |                  0 |              0.0 |

### Display Basic Information Output

Number of Columns: 2
Number of Rows: 4

First 6 Observations of Our Data:
  A    B
1.0  a
2.0  b
NaN  b
4.0  c

### Display Unique Values Output

A contains: 1.0, 2.0, nan, 4.0
B contains: a, b, c

Requirements

  • pandas

Contribution

Contributions to DataCompass are welcome! Please feel free to submit a pull request or open an issue on the GitHub repository.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Sai Koushik Gandikota

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datacompass-0.4.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datacompass-0.4-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file datacompass-0.4.tar.gz.

File metadata

  • Download URL: datacompass-0.4.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.18

File hashes

Hashes for datacompass-0.4.tar.gz
Algorithm Hash digest
SHA256 52e3601f7641da51d42dc38cf2fe92b78cdacc28799f9decb58afee85eef4d24
MD5 3dbd86df53bf1b69e9d0805642eda9bb
BLAKE2b-256 bd551a6845b4665cbcfb47372a9bf451fd3625ff38e2766d0d056c79c84412e7

See more details on using hashes here.

File details

Details for the file datacompass-0.4-py3-none-any.whl.

File metadata

  • Download URL: datacompass-0.4-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.18

File hashes

Hashes for datacompass-0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 47bb242aee5d440170db21db0a5591a6c0ead3fdf000a0f17eea1e1502fec541
MD5 32cee3cc2e21f314eea71189abfdca12
BLAKE2b-256 88f4070073b37786ae2558c0894cc9b259ea765946e3180f9f5c685a721dfd0f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page