Skip to main content

A small package for all useful ML things

Project description

Kowalsky, analysis!

A simple package for handful ML things and more.

What's inside?

  1. analysis - method for evaluation of specified model with given dataframe. With export_test_set=False it exports ready for submission predictions.

  2. df - module for working with dataframe:

    • corr - sort all correlated features.
    • handle_outliers - fill or drop columns with outliers.
    • log_transform - transform columns with log function.
    • group_by_mean - make additional columns with aggregated mean
    • group_by_max - make additional columns with aggregated max
    • group_by_min - make additional columns with aggregated min
    • scale - scale columns with Standard of MinMax scalers
  3. kaggle:

    • submit - make submit-file for kaggle based on sample
  4. metrics:

    • rmse - RMSE scorer
    • rmsle - RMSLE scorer
  5. optuna - handful methods for working with optuna:

    • optimize - optimize model with given dataframe
    • optimize_super_learner - optimize super learner configuration with given set of models and set of heads (meta_model)
  6. colab:

    • csv - read csv file located at Google Drive with specified id
    • path - get path to Google Drive file

Example:

!pip install kowalsky --upgrade
from kowalsky.opt import optimize
optimize('RFR',
         path='../input/project/feed.csv',
         scorer='acc',
         y_label='y_label',
         trials=3000)

Avaliable models:

Gradient Boosts

    'XGBR': XGBRegressor
    'XGBC': XGBClassifier
    'LGBR': LGBMRegressor
    'LGBC': LGBMClassifier

Trees

    'RFR': RandomForestRegressor
    'RFC': RandomForestClassifier
    'DTR': DecisionTreeRegressor
    'DTC': DecisionTreeClassifier
    'ETR': ExtraTreeRegressor
    'ETC': ExtraTreeClassifier

Ensemble

    'BC': BaggingClassifier
    'BR': BaggingRegressor
    'ADAR': AdaBoostRegressor
    'ADAC': AdaBoostClassifier
    'CBR': CatBoostRegressor
    'CBC': CatBoostClassifier

KNeighbors

    'KNC': KNeighborsClassifier
    'KNR': KNeighborsRegressor

SVM

    'SVR': SVR
    'SVC': SVC

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kowalsky-0.0.7.tar.gz (5.6 kB view details)

Uploaded Source

File details

Details for the file kowalsky-0.0.7.tar.gz.

File metadata

  • Download URL: kowalsky-0.0.7.tar.gz
  • Upload date:
  • Size: 5.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/52.0.0 requests-toolbelt/0.9.1 tqdm/4.56.0 CPython/3.8.2

File hashes

Hashes for kowalsky-0.0.7.tar.gz
Algorithm Hash digest
SHA256 e809f740ca59acb295b3902fbddf37f7151cc3be0a833018813ce437a0b41916
MD5 63c5928775ab13ad644ca33583c9b6ca
BLAKE2b-256 a99bb4ab86bbb1afbb2939d16a5d0e6ad719d5fdf1aa895b0d80b11e9b174a91

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page