A package for data analysis
Project description
suraj_datalab
suraj_datalab is a Python package designed to streamline the process of analyzing and visualizing both categorical and numerical data. It also includes utilities for data cleaning and preparing datasets for machine learning models, like creating K-Folds for cross-validation.
Table of Contents
- Features
- Installation
- Quickstart
- Usage
- Examples
- API Reference
- Contributing
- License
- Acknowledgments
- Contact
Features
- Categorical Analysis: Effortlessly analyze and visualize categorical data in relation to target variables.
- Numerical Analysis: Detect, analyze, and visualize outliers in numerical data.
- Data Cleaning: Automatically handle rare categories in your datasets.
- Cross-Validation Preparation: Create K-Folds for both classification and regression tasks, including stratified K-Folds.
- Visualization: Built-in support for generating insightful plots with minimal code.
- Extensible: Designed with flexibility in mind, allowing easy extension and integration with other data processing workflows.
Installation
Requirements
- Python 3.12 or higher
- pandas
- seaborn
- matplotlib
Install via pip
pip install suraj_datalab
Quickstart
Here’s how you can quickly get started with suraj_datalab:
import pandas as pd
from suraj_datalab.analysis import analyze_categorical_feature, analyze_numerical_feature
# Sample DataFrame
data = {'Feature': ['A', 'B', 'A', 'B'], 'Transported': [True, False, True, False]}
df = pd.DataFrame(data)
# Analyze categorical feature
result = analyze_categorical_feature(df, 'Feature', 'Transported')
print(result)
Usage
For detailed usage instructions, please refer to the Usage Guide.
Examples
Check out the Examples section for practical examples of how to use the functions and classes provided by suraj_datalab.
API Reference
For a detailed reference of all available functions and classes, see the API Reference.
Contributing
Contributions are welcome! Please read the Contributing Guidelines for more details.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Acknowledgments
Thanks to all contributors who have helped with this project.
Contact
For any questions or suggestions, please contact Suraj Wate.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file suraj_datalab-0.1.1.tar.gz.
File metadata
- Download URL: suraj_datalab-0.1.1.tar.gz
- Upload date:
- Size: 7.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.5 Linux/6.5.0-1025-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
46d2830ef9ebc91883b07452003cfdbc16e43cd54c0ab67e5a68fb9c39730a93
|
|
| MD5 |
b496eca9a2caa160fe2e4ff615e46d5d
|
|
| BLAKE2b-256 |
0e557e500aa9cd9ec2607a9c128a4f21d1fb692479a507e44055bf80a0e93a79
|
File details
Details for the file suraj_datalab-0.1.1-py3-none-any.whl.
File metadata
- Download URL: suraj_datalab-0.1.1-py3-none-any.whl
- Upload date:
- Size: 8.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.12.5 Linux/6.5.0-1025-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8f1c8263f48427c15f8601ef25fa7f6e940e19df57100d592f8cc94c586cbf65
|
|
| MD5 |
0681e7e76fd94a5e445dbd53f51cc729
|
|
| BLAKE2b-256 |
29bf165b4605461291f19420d39d1ac96b77c0051c6d5eb849945716f7dc23a0
|