Skip to main content

This package is to facilitate model selection in Machine Learning.

Project description

Machine Learning Model Selection

This package aims to facilitate model selection in Machine Learning. It is a common issue that ML practitioners often struggle to decide on the most appropriate model prior to optimization, as tuning hyperparameters can be time-consuming and computationally demanding. To simplify the process, this package enables users to train several machine learning models using their default hyperparameters and compare their performance, helping them determine the most suitable model to select.

Usage

pip install mlms -U

pypi package

Then instantiate and use it like this:

from mlms.ModelSelection import Select_Regressor, Select_Classifier

Select some models to tune, this list should be the abbreviation of models as below, for example

MODELS = ['LGR', 'AB', 'CART', 'GBC', 'XGBC', 'RFC', 'ETC', 'KNN', 'NB', 'SVC', 'MLP', 'SGDC', 'GPC', 'PAC']

df_performance, fitted_classifiers = Select_Classifier('accuracy', 10, X_train, X_test, y_train, y_test, MODELS)

df_performance, fitted_regressors = Select_Classifier('neg_mean_squared_erro', 10, X_train, X_test, y_train, y_test)

For classifiers, the performance can set as accuracy , 'f1_score , precision, recall, roc_auc , balanced_accuracy_score and so on. Available classifiers are below

  • ('LGR', LogisticRegression(n_jobs=-1)),
  • ('AB', AdaBoostClassifier()),
  • ('CART', DecisionTreeClassifier()),
  • ('GBC', GradientBoostingClassifier()),
  • ('XGBC', XGBClassifier()),
  • ('RFC', RandomForestClassifier()),
  • ('ETC', ExtraTreeClassifier()),
  • ('KNN', KNeighborsClassifier(n_jobs=-1)),
  • ('NB', GaussianNB()),
  • ('SVC', SVC()),
  • ('MLP', MLPClassifier()),
  • ('SGDC', SGDClassifier(n_jobs=-1)),
  • ('GPC', GaussianProcessClassifier(n_jobs=-1)),
  • ('PAC', PassiveAggressiveClassifier(n_jobs=-1))

(The charts is an classifier selection example using Iris dataset)

1679444303986

1679443565646

1679443664816

For regressors, the performance can set as r2_score, neg_mean_squared_error and so on. Available regressors are below:

  • ('KNN', KNeighborsRegressor()),
  • ('CART', DecisionTreeRegressor()),
  • ('SVR', SVR()),
  • ('MLP', MLPRegressor()),
  • ('ABR', AdaBoostRegressor()),
  • ('GBR', GradientBoostingRegressor()),
  • ('XGB', XGBRegressor()),
  • ('RFR', RandomForestRegressor()),
  • ('ETR', ExtraTreesRegressor())

1679487197758

Additonally, this package also alow users to plot ROC_Curve

from mlms.plot_roc_curve import Multiclass_ROC_Curve, Binary_ROC_Curve

Multiclass_ROC_Curve(X_test, y_test, fitted_model, chart_title:str)

1679785680784

Binary_ROC_Curve(y_true, y_pred,chart_name:str)

GitHub

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlms-0.11.0.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlms-0.11.0-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file mlms-0.11.0.tar.gz.

File metadata

  • Download URL: mlms-0.11.0.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.0

File hashes

Hashes for mlms-0.11.0.tar.gz
Algorithm Hash digest
SHA256 e793953d333e9d7ed61c84c5470893705d200fd998eba91a2fd75486152b1039
MD5 de5316a5cee81f40254d9958ed55ec6d
BLAKE2b-256 79a4eb645d548787f4ce44e60b00e177e4543db7e0948c7b5a43b104319687dd

See more details on using hashes here.

File details

Details for the file mlms-0.11.0-py3-none-any.whl.

File metadata

  • Download URL: mlms-0.11.0-py3-none-any.whl
  • Upload date:
  • Size: 8.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.0

File hashes

Hashes for mlms-0.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6bebe6891cbcd6b5000e766efc9e26880b53e02fd56b342b8037a7383a8033ac
MD5 42424258b2d54231bd643762109f84eb
BLAKE2b-256 e1b0880bd8403a43ce0800938831ae3d72bc7b4b0388098b1bb81afae0cbe363

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page