package for dsci310 report
Project description
dsci_310_group_11_pkg
This package contains functions used for generating dsci_310_group_11's report. The main functions inside this package is to split the data, generate pipeline for predictive models, conduct hyperparameter tunning and make graphs.
The main difference between our package to other similar functions is that the functions in our package is designed to improve the efficiency for analyzing the wine data. Using our pipeline generator as example, while directly using functions in sklearns can generate the same model, we simplifies this process by integrating repetitive steps into our functions.
Code Coverage Badge
Installation
- Clone the git repository onto your local machine using:
git clone https://github.com/DSCI-310/dsci-310-group-11-pkg.git
- Navigate to the local repository through the terminal (this may differ based on your home directory setups):
cd dsci-310-group-11-pkg
- Run pip to install the package locally (from pypi):
pip install dsci-310-group-11-pkg
You should now be able to import the package into relevant projects and notebooks.
Usage
The most basic usage of this package is to split the data into training or testing data:
from dsci_310_group_11_pkg.preprocess import preprocessor
df # target dataframe
training_data = preprocessor(df, 0)
testing_data = preprocessor(df, 1)
One of the other usages of dsci_310_group_11_pkg
is to generate pipeline for classification models as follows:
from dsci_310_group_11_pkg.pipeline import pipe_build
x = X_training_data # training features data
y = Y_training data # training label data
model_type = 'lr' # types of model
model = pipe_build(model_type, x, y)
Another usage is to conduct hyperparameter tuning and return it as dataframe for different model as follows:
from dsci_310_group_11_pkg.optimize import hy_optimizer
x = X_training_data # training features data
y = Y_training data # training label data
model_name = 'lr' # types of model
tuning_result = hp_optimize(model_name, x, y)
This package can also be used to generate different graphs for the analysis report, for example:
from dsci_310_group_11_pkg.grapher import correlation_table
ctb = correlation_table(df) # this generate the correlation table for dataframe df
Other functions and more detailed usage can be found in docs/example or in the website documentation of the package.
Contributing
Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.
License
dsci_310_group_11_pkg
was created by DSCI_310_Group_11. It is licensed under the terms of the MIT license.
Credits
dsci_310_group_11_pkg
was created with cookiecutter
and the py-pkgs-cookiecutter
template.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dsci_310_group_11_pkg-0.1.1.tar.gz
.
File metadata
- Download URL: dsci_310_group_11_pkg-0.1.1.tar.gz
- Upload date:
- Size: 7.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.10.9 Darwin/22.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe5612318f89a2c73bf9ae04f8a1442a8c04983a958e5afc0aa89e3b5aef9069 |
|
MD5 | 86ee8f237d47cfe0b045eb0381fa3f5e |
|
BLAKE2b-256 | 127326d03aacb6615dd495e159ce24564ff8b119a41789a048147410b72d2d40 |
File details
Details for the file dsci_310_group_11_pkg-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: dsci_310_group_11_pkg-0.1.1-py3-none-any.whl
- Upload date:
- Size: 9.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.4.2 CPython/3.10.9 Darwin/22.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2246dee1baae4dded82bb30aa061c838320f12dc699a1b3d1ee71e1076ea0803 |
|
MD5 | d9371dbb8f694c7a865cbcd9af0cacda |
|
BLAKE2b-256 | 6b5820c06d2a0f66378d13e9017170e628ab239a3f3fb64c4292d50eb7af6446 |