A comprehensive library for data preprocessing tasks
Project description
DataAnalyse is a comprehensive Python library for data preprocessing tasks. It provides a set of functions and classes for handling missing values, outliers, text manipulation, feature engineering, data type conversion, categorical encoding, date and time manipulation, and more.
Features
- Data Cleaning: Handle missing values, outliers, and text data.
- String Manipulation: Remove stopwords, lowercase text, remove punctuation, and lemmatize text.
- Data Transformation: Convert data types, encode categorical variables, create new features, and handle date and time data.
- Date and Time Manipulation: Extract date components.
- Scalable: Works efficiently with large datasets.
- Easy to Use: Simple and intuitive API.
Installation
You can install DataAnalyse from PyPI using pip:
Usage
import DataAnalyse
# Example usage of DataAnalyse module
import DataAnalyse
# Example usage of specific modules within DataAnalyse
from DataAnalyse import OutlierHandler, StringManipulation
# Use cases
# Handle outliers
data = pd.read_csv('data.csv')
cleaned_data = OutlierHandler.remove_outliers_iqr(data, threshold=1.5)
# Text preprocessing
processed_text = StringManipulation.remove_stopwords(data, column='text_column')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dataMnp-0.1.0-py3-none-any.whl.
File metadata
- Download URL: dataMnp-0.1.0-py3-none-any.whl
- Upload date:
- Size: 8.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0cefb7d6f932bfa070337e37a5ca9d61c3ceb4388b77566f26eba9184a0a9141
|
|
| MD5 |
f3c69653bd0ad70fc07954a1b22aa89e
|
|
| BLAKE2b-256 |
ca9edb0ab15b36389b2ea860b307c230e27943ed588696e52371d1ff8cd95ccb
|