EDA in 4 lines of Code
Project description
DataCompass
DataCompass is a Python package designed to enhance data analysis with pandas DataFrames. It provides a set of tools to quickly inspect and understand the structure and contents of DataFrames, aiding in efficient data exploration and preprocessing.
Features
- Describe Columns: Analyze and list numerical and categorical columns in a DataFrame.
- Display Missing Information: Calculate and display the count and percentage of missing values in each column.
- Display Basic Information: Show basic details like the number of columns, rows, and the first few observations.
- Display Unique Values: Enumerate unique values or the count of unique values in each column.
Installation
To install DataCompass, simply use pip:
pip install datacompass
import pandas as pd
from datacompass import describe_columns, display_missing_info, display_basic_info, display_unique_values
# Sample DataFrame
df = pd.DataFrame({'A': [1, 2, None, 4], 'B': ['a', 'b', 'b', 'c']})
# Describe Columns
describe_columns(df)
# Display Missing Information
display_missing_info(df)
# Display Basic Information
display_basic_info(df)
# Display Unique Values
display_unique_values(df)
### Describe Columns Output
Number of Numerical Columns: 1
['A']
-------------------------------------------------------------------------------------
Number of Categorical Columns: 1
['B']
### Display Missing Information Output
| Missing Count | Missing Percentage |
|----------------:|-------------------:|
| A | 1 | 25.0 |
| B | 0 | 0.0 |
### Display Basic Information Output
Number of Columns: 2
Number of Rows: 4
First 6 Observations of Our Data:
A B
1.0 a
2.0 b
NaN b
4.0 c
### Display Unique Values Output
A contains: 1.0, 2.0, nan, 4.0
B contains: a, b, c
Requirements
- pandas
Contribution
Contributions to DataCompass are welcome! Please feel free to submit a pull request or open an issue on the GitHub repository.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Author
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file datacompass-0.4.tar.gz.
File metadata
- Download URL: datacompass-0.4.tar.gz
- Upload date:
- Size: 3.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
52e3601f7641da51d42dc38cf2fe92b78cdacc28799f9decb58afee85eef4d24
|
|
| MD5 |
3dbd86df53bf1b69e9d0805642eda9bb
|
|
| BLAKE2b-256 |
bd551a6845b4665cbcfb47372a9bf451fd3625ff38e2766d0d056c79c84412e7
|
File details
Details for the file datacompass-0.4-py3-none-any.whl.
File metadata
- Download URL: datacompass-0.4-py3-none-any.whl
- Upload date:
- Size: 3.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
47bb242aee5d440170db21db0a5591a6c0ead3fdf000a0f17eea1e1502fec541
|
|
| MD5 |
32cee3cc2e21f314eea71189abfdca12
|
|
| BLAKE2b-256 |
88f4070073b37786ae2558c0894cc9b259ea765946e3180f9f5c685a721dfd0f
|