Skip to main content

A package for data analysis

Project description

suraj_datalab

PyPI version License: MIT Build Status

suraj_datalab is a Python package designed to streamline the process of analyzing and visualizing both categorical and numerical data. It also includes utilities for data cleaning and preparing datasets for machine learning models, like creating K-Folds for cross-validation.

Table of Contents

Features

  • Categorical Analysis: Effortlessly analyze and visualize categorical data in relation to target variables.
  • Numerical Analysis: Detect, analyze, and visualize outliers in numerical data.
  • Data Cleaning: Automatically handle rare categories in your datasets.
  • Cross-Validation Preparation: Create K-Folds for both classification and regression tasks, including stratified K-Folds.
  • Visualization: Built-in support for generating insightful plots with minimal code.
  • Extensible: Designed with flexibility in mind, allowing easy extension and integration with other data processing workflows.

Installation

Requirements

Install via pip

pip install suraj_datalab

Quickstart

Here’s how you can quickly get started with suraj_datalab:

import pandas as pd
from suraj_datalab.analysis import analyze_categorical_feature, analyze_numerical_feature

# Sample DataFrame
data = {'Feature': ['A', 'B', 'A', 'B'], 'Transported': [True, False, True, False]}
df = pd.DataFrame(data)

# Analyze categorical feature
result = analyze_categorical_feature(df, 'Feature', 'Transported')
print(result)

Usage

For detailed usage instructions, please refer to the Usage Guide.

Examples

Check out the Examples section for practical examples of how to use the functions and classes provided by suraj_datalab.

API Reference

For a detailed reference of all available functions and classes, see the API Reference.

Contributing

Contributions are welcome! Please read the Contributing Guidelines for more details.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

Thanks to all contributors who have helped with this project.

Contact

For any questions or suggestions, please contact Suraj Wate.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

suraj_datalab-0.1.1.tar.gz (7.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

suraj_datalab-0.1.1-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file suraj_datalab-0.1.1.tar.gz.

File metadata

  • Download URL: suraj_datalab-0.1.1.tar.gz
  • Upload date:
  • Size: 7.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.5 Linux/6.5.0-1025-azure

File hashes

Hashes for suraj_datalab-0.1.1.tar.gz
Algorithm Hash digest
SHA256 46d2830ef9ebc91883b07452003cfdbc16e43cd54c0ab67e5a68fb9c39730a93
MD5 b496eca9a2caa160fe2e4ff615e46d5d
BLAKE2b-256 0e557e500aa9cd9ec2607a9c128a4f21d1fb692479a507e44055bf80a0e93a79

See more details on using hashes here.

File details

Details for the file suraj_datalab-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: suraj_datalab-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 8.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.12.5 Linux/6.5.0-1025-azure

File hashes

Hashes for suraj_datalab-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8f1c8263f48427c15f8601ef25fa7f6e940e19df57100d592f8cc94c586cbf65
MD5 0681e7e76fd94a5e445dbd53f51cc729
BLAKE2b-256 29bf165b4605461291f19420d39d1ac96b77c0051c6d5eb849945716f7dc23a0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page