Skip to main content

package for dsci310 report

Project description

dsci_310_group_11_pkg

This package contains functions used for generating dsci_310_group_11's report. The main functions inside this package is to split the data, generate pipeline for predictive models, conduct hyperparameter tunning and make graphs.

The main difference between our package to other similar functions is that the functions in our package is designed to improve the efficiency for analyzing the wine data. Using our pipeline generator as example, while directly using functions in sklearns can generate the same model, we simplifies this process by integrating repetitive steps into our functions.

Code Coverage Badge

codecov

Installation

  1. Clone the git repository onto your local machine using:
git clone https://github.com/DSCI-310/dsci-310-group-11-pkg.git
  1. Navigate to the local repository through the terminal (this may differ based on your home directory setups):
cd dsci-310-group-11-pkg
  1. Run pip to install the package locally (from pypi):
pip install dsci-310-group-11-pkg

You should now be able to import the package into relevant projects and notebooks.

Usage

The most basic usage of this package is to split the data into training or testing data:

from dsci_310_group_11_pkg.preprocess import preprocessor

df # target dataframe
training_data = preprocessor(df, 0)
testing_data = preprocessor(df, 1)

One of the other usages of dsci_310_group_11_pkg is to generate pipeline for classification models as follows:

from dsci_310_group_11_pkg.pipeline import pipe_build

x = X_training_data # training features data
y = Y_training data # training label data
model_type = 'lr' # types of model

model = pipe_build(model_type, x, y)

Another usage is to conduct hyperparameter tuning and return it as dataframe for different model as follows:

from dsci_310_group_11_pkg.optimize import hy_optimizer

x = X_training_data # training features data
y = Y_training data # training label data
model_name = 'lr' # types of model

tuning_result = hp_optimize(model_name, x, y)

This package can also be used to generate different graphs for the analysis report, for example:

from dsci_310_group_11_pkg.grapher import correlation_table

ctb = correlation_table(df) # this generate the correlation table for dataframe df

Other functions and more detailed usage can be found in docs/example or in the website documentation of the package.

Contributing

Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

License

dsci_310_group_11_pkg was created by DSCI_310_Group_11. It is licensed under the terms of the MIT license.

Credits

dsci_310_group_11_pkg was created with cookiecutter and the py-pkgs-cookiecutter template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dsci_310_group_11_pkg-0.1.1.tar.gz (7.2 kB view details)

Uploaded Source

Built Distribution

dsci_310_group_11_pkg-0.1.1-py3-none-any.whl (9.3 kB view details)

Uploaded Python 3

File details

Details for the file dsci_310_group_11_pkg-0.1.1.tar.gz.

File metadata

  • Download URL: dsci_310_group_11_pkg-0.1.1.tar.gz
  • Upload date:
  • Size: 7.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.2 CPython/3.10.9 Darwin/22.3.0

File hashes

Hashes for dsci_310_group_11_pkg-0.1.1.tar.gz
Algorithm Hash digest
SHA256 fe5612318f89a2c73bf9ae04f8a1442a8c04983a958e5afc0aa89e3b5aef9069
MD5 86ee8f237d47cfe0b045eb0381fa3f5e
BLAKE2b-256 127326d03aacb6615dd495e159ce24564ff8b119a41789a048147410b72d2d40

See more details on using hashes here.

File details

Details for the file dsci_310_group_11_pkg-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for dsci_310_group_11_pkg-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2246dee1baae4dded82bb30aa061c838320f12dc699a1b3d1ee71e1076ea0803
MD5 d9371dbb8f694c7a865cbcd9af0cacda
BLAKE2b-256 6b5820c06d2a0f66378d13e9017170e628ab239a3f3fb64c4292d50eb7af6446

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page