Skip to main content

A comprehensive library for data preprocessing tasks

Project description

DataAnalyse is a comprehensive Python library for data preprocessing tasks. It provides a set of functions and classes for handling missing values, outliers, text manipulation, feature engineering, data type conversion, categorical encoding, date and time manipulation, and more.

Features

  • Data Cleaning: Handle missing values, outliers, and text data.
  • String Manipulation: Remove stopwords, lowercase text, remove punctuation, and lemmatize text.
  • Data Transformation: Convert data types, encode categorical variables, create new features, and handle date and time data.
  • Date and Time Manipulation: Extract date components.
  • Scalable: Works efficiently with large datasets.
  • Easy to Use: Simple and intuitive API.

Installation

You can install DataAnalyse from PyPI using pip:

Usage

import DataAnalyse


# Example usage of DataAnalyse module
import DataAnalyse

# Example usage of specific modules within DataAnalyse
from DataAnalyse import OutlierHandler, StringManipulation

# Use cases
# Handle outliers
data = pd.read_csv('data.csv')
cleaned_data = OutlierHandler.remove_outliers_iqr(data, threshold=1.5)

# Text preprocessing
processed_text = StringManipulation.remove_stopwords(data, column='text_column')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dataMnp-0.1.0-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file dataMnp-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: dataMnp-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.2

File hashes

Hashes for dataMnp-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0cefb7d6f932bfa070337e37a5ca9d61c3ceb4388b77566f26eba9184a0a9141
MD5 f3c69653bd0ad70fc07954a1b22aa89e
BLAKE2b-256 ca9edb0ab15b36389b2ea860b307c230e27943ed588696e52371d1ff8cd95ccb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page