Skip to main content

A package to automize some of the steps before modeling and in the modeling stage

Project description

Data Science Core Functionalities

Modules included in this package:

    1. regressor_utils.py
    1. classifier_utils.py
    1. exploratory_data_analyzer.py

1) regressor_utils.py

Containing Regressor class which has certain type of functions that make life easier for regression problems. These functions are quite various for different type of problems. The following functionalities are included in this module:

  • Data splitting
  • Oversampling
  • Experimenting different regression algorithms
  • Training given model
  • Calculating residual difference between the target feature and predicted or calculated feature with visualization
  • Regression plots
  • Regression scoring metrics
  • Quantile regression

2) classifier_utils.py

Having Classifier class which contains a set of functions for modeling ML classification problems in the shortest time. The functions included in the class are quite various, these can be seen as follows:

  • Data splitting
  • Experimenting different regression algorithms
  • Training given model
  • Cross validation score of the given model
  • Confusion matrix visualization
  • Creating a stack model
  • Evaluating model in the test dataset with classification metrics

3) exploratory_data_analyzer.py

This module has EDA_Preprocessor class in it where the class functions serve as a baseline for all kinds of EDA. The functions in this module are including the following analysis tasks:

  • filling missing values in the data
  • showing distributions / counts of the columns
  • dummification of the categorical data columns
  • PCA decomposition of the given data
  • standardization of the data
  • applying transformation function for handling data skewness
  • showing heatmap correlation of the features before modeling
  • checking the correlation of the categorical features compare to target feature
  • feature importances of a default model in the given problem domain

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ds-core-sanpier-0.1.4.tar.gz (15.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ds_core_sanpier-0.1.4-py3-none-any.whl (30.9 kB view details)

Uploaded Python 3

File details

Details for the file ds-core-sanpier-0.1.4.tar.gz.

File metadata

  • Download URL: ds-core-sanpier-0.1.4.tar.gz
  • Upload date:
  • Size: 15.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.7

File hashes

Hashes for ds-core-sanpier-0.1.4.tar.gz
Algorithm Hash digest
SHA256 740aa4ccc8472016b75ccc2358e2198fc95a8c9559015c256f3cc71b506faecd
MD5 ad9f182570c32b3e15cc8beeb2894023
BLAKE2b-256 191b5d11399a6594d981b3d04beed17061c7566bd2529f09b09c4315101fccf4

See more details on using hashes here.

File details

Details for the file ds_core_sanpier-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for ds_core_sanpier-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 2862e075c801b3bf39d55112a9562850b1745df558a5d023077eab51142912ba
MD5 42813c6081211cfd78480ec058301070
BLAKE2b-256 35737bd661541d70245d823dc8f6aec8d38f88e46873b90a7f6eeec8d29aa59f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page