Skip to main content

No project description provided

Project description

A Deep Learning Data Analysis Package

DataPrep and Visualization Toolkit

This is a Python package designed to streamline the process of preparing datasets for machine learning workflows and visualizing time-series data. This package provides essential functionality for splitting datasets, applying data scaling techniques, and visualizing feature trends, making it easier to prepare data for modeling. This is version 0.3 of the package, and we plan to add more features in future updates!

Key Features

Exponential Weighted Mean Smoothing:

Smooths input features using an exponential weighted mean (EWM) to help reduce noise in the data before training.

Train-Test Split with Optional Validation Split:

The data_prep() function handles the splitting of data into training, testing, and (optionally) validation sets, with a variety of user-defined parameters for customization.

Scaling Options:

Choose between two widely-used scaling methods—MinMaxScaler and StandardScaler—to normalize your data and ensure that it’s well-prepared for machine learning models.

Support for Oversampling (SMOTE):

The package offers optional oversampling using the SMOTE technique to handle imbalanced datasets effectively.

Dataset Visualization:

The dataset_visualize() function allows you to easily visualize time-series data for selected features, providing insights into trends and patterns in the dataset.

Installation

You can install the package using pip:

pip install dl-data-analysis

Data Preparation

import pandas as pd
from your_package_name import data_prep

# Example usage
X_train, X_test, y_train, y_test = data_prep(
    x_dataframe=my_data, 
    y_data=labels, 
    test_ratio=0.3, 
    validation=True, 
    scaler_type="min_max", 
    oversample=True
)

Visualization

from your_package_name import dataset_visualize

# Example visualization
dataset_visualize(
    pd_dataframe=my_data, 
    feature_list=['feature_1', 'feature_2'], 
    Name='Sensor', 
    list=[1, 2, 3]
)

Planned Updates

This is just the first version of the package. We have plans to introduce additional features in the future, including:

  • More scaling and normalization techniques.
  • Advanced data preprocessing capabilities.
  • Enhanced visualization functions.
  • Support for more types of datasets and tasks.

Stay tuned for more!

Contributing

Contributions are welcome! If you have any ideas or would like to contribute to the project, please open an issue or submit a pull request.

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dl_data_analysis-0.3.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

dl_data_analysis-0.3-py3-none-any.whl (3.8 kB view details)

Uploaded Python 3

File details

Details for the file dl_data_analysis-0.3.tar.gz.

File metadata

  • Download URL: dl_data_analysis-0.3.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.5

File hashes

Hashes for dl_data_analysis-0.3.tar.gz
Algorithm Hash digest
SHA256 1e874c5aa3019667e51552f3e0547f9a7632a0db2f20e21d70644b9a49b5c1c4
MD5 527950a6bd7e59d0bad4ee4150293c09
BLAKE2b-256 7406460156b9589e55cd5b046e29a721076371bc60b6d61002e7d4d155aba85d

See more details on using hashes here.

File details

Details for the file dl_data_analysis-0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for dl_data_analysis-0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 e9e9c4eb885e0ab310a131081b3b1609ceee7c8b1d0bdaed4c6af850d81a1687
MD5 1a9fd21fb754bcedc93e69dcfcb492a8
BLAKE2b-256 3a3985f80d13e9c3ce6802226f9a250816439e19836ac5e7dc11ef1b5e94a0ac

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page