Skip to main content

Crucio is a python sci-kit learn inspired package for class imbalance. It use some classic methods for class balancing taking as parameters a data frame and the target column.

Project description

crucio

Crucio is a python sci-kit learn inspired package for class imbalance. It use some classic methods for class balancing taking as parameters a data frame and the target column.

This version of kydavra has the next methods of feature selection:

  • ADASYN.
  • ICOTE (Immune Centroids Oversampling).
  • MTDF (Mega-Trend Difussion Function).
  • MWMOTE (Majority Weighted Minority Oversampling Technique).
  • SMOTE (Synthetic Minority Oversampling Technique).
  • SMOTENC (Synthetic Minority Over-sampling Technique for Nominal and Continuous).
  • SMOTETOMEK (Synthetic Minority Oversampling Technique + Tomek links for undersampling).
  • SMOTEENN (Synthetic Minority Oversampling Technique + ENN for undersampling).
  • SCUT (SMOTE and Clustered Undersampling Technique).
  • SLS (Safe-Level-Synthetic Minority Over-Sampling TEchnique).
  • TKRKNN (Top-K ReverseKNN).

All these methods takes the pandas Data Frame and y column to balance on.

How to use crucio

To use balancer from crucio you should just import the balancer from crucio in the following framework:

from crucio import SMOTE

class names are written above.Next create a object of this algorithm (I will use SMOTE method as an example).

method = SMOTE()

To balance the dataset on the target column use the ‘balance’ function, using as parameters the pandas Data Frame and the column that you want to balance. Small tip, balance only the training set, not full one.

new_dataframe = method.balance(df, 'target')

Returned value is a new data frame with the target column balanced.

With love from Sigmoid.

We are open for feedback. Please send your impression to vladimir.stojoc@gmail.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crucio-0.1.94.tar.gz (15.1 kB view details)

Uploaded Source

Built Distribution

crucio-0.1.94-py3-none-any.whl (29.4 kB view details)

Uploaded Python 3

File details

Details for the file crucio-0.1.94.tar.gz.

File metadata

  • Download URL: crucio-0.1.94.tar.gz
  • Upload date:
  • Size: 15.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.2

File hashes

Hashes for crucio-0.1.94.tar.gz
Algorithm Hash digest
SHA256 ad0f39c12a11f8f4be028dd02f1ae324fc85f5cabd55f344b3a963d1a3d0c547
MD5 b3641b6be07d3ce31398674cf87deaf6
BLAKE2b-256 7c4635b8b2d5798e3a37b7df07f81abe2bb93413f4c1feed0414ccff3325720a

See more details on using hashes here.

File details

Details for the file crucio-0.1.94-py3-none-any.whl.

File metadata

  • Download URL: crucio-0.1.94-py3-none-any.whl
  • Upload date:
  • Size: 29.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.2

File hashes

Hashes for crucio-0.1.94-py3-none-any.whl
Algorithm Hash digest
SHA256 73c20ccc1544635de7f9765066b047683f70b66d1915bb6cdc69086a3924aa2e
MD5 278e3d757f21a037605072c2e8b4b166
BLAKE2b-256 c091a65f97f6f8e980901842421070a775dca64ec9d168a65535499ead5dec5b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page