Skip to main content

Package contains a collection of bagging ensemble algorithms for imbalanced data

Project description

ImbBag

Imbalanced Bagging Ensemble Algorithms

Description

ImbBag is a specialized package that integrates a variety of bagging ensemble methods specifically designed for imbalanced data classification. This package provides a scikit-learn-based framework that simplifies the usage of these methods, making it easier for researchers and practitioners to apply them in their work, whether dealing with binary or multi-class classification problems.

Installation

pip install imbbag

Requirements

The following Python packages are required.

  • scikit-learn
  • imblearn
  • PyGAD
  • ARFS
  • mlxtend
  • scikit-learn-intelex
  • ARFS

Also, use Python 3.11

Available Bagging Ensemble Algorithms in the ImbBag Package

  • UnderBagging (UnderBag) - ** Multi-class
  • Exactly Balanced Bagging (EBBag) - ** Binary-class
  • OverBagging (OverBag) - ** Multi-class
  • SMOTE Bagging (SMOTEBag) - ** Multi-class
  • Roughly Balanced Bagging (RBBag) - ** Binary-class
  • Multi-class Roughly Balanced Bagging (MRBBag) - ** Multi-class
  • Bagging Ensemble Variation (BEV) - ** Binary-class
  • Lazy Bagging (LazyBag) - ** Multi-class
  • Multi Random Balance Bagging (MultiRandBalBag) - ** Multi-class
  • Neighborhood Balanced Bagging (NBBag) - ** Binary-class
  • Probability Threshold Bagging (PTBag) - ** Multi-class
  • Adaptive Synthetic Bagging (ADASYNBag) - ** Binary-class
  • RSYN Bagging (RSYNBag) - ** Binary-class
  • Resampling Ensemble Algorithm (REABag) - ** Multi-class
  • Under-bagging K-NN (UnderBagKNN) - ** Multi-class
  • Boundary Bagging (BBag) - ** Multi-class
  • Bagging of Extrapolation-SMOTE SVM (BEBS) - ** Binary-class
  • Evolutionary Under-sampling based Bagging (EUSBag)- ** Binary-class
  • Random Balanced Sampling with Bagging (RBSBag) - ** Multi-class
  • Cost-sensitive Bagging (CostBag) - ** Multi-class

Get Started

Here is an example:

from sklearn.tree import DecisionTreeClassifier
from sklearn. model_selection import train_test_split
from ImbBag import BBag

dataframe = read_csv('dataset.csv')
data = dataframe.values    
X = data[:,:-1]
Y = data[:,:-1]

# split the dataset into training and test sets
X_train ,X_test ,y_train ,y_test = train_test_split (X, y, test_size =0.2)

# instantiate the imbalance bagging classifier, training, prediction 
cls = BBag(estimator = DecisionTreeClassifier(), n_estimator = 50)
clf.fit(X_train , y_train)
y_pred = clf.predict(X_test)

Version History

  • v1.0.0 - Initial release - July 29, 2024

Credits

  • **Yousef Abdi
  • University of Tabriz

License

This project licensed under the MIT License.

Support

Report issues, ask questions, and provide suggestions using:

The project can be accessed at https://github.com/yousefabdi/imbbag

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imbbag-1.2.tar.gz (19.6 kB view details)

Uploaded Source

Built Distribution

imbbag-1.2-py3-none-any.whl (43.1 kB view details)

Uploaded Python 3

File details

Details for the file imbbag-1.2.tar.gz.

File metadata

  • Download URL: imbbag-1.2.tar.gz
  • Upload date:
  • Size: 19.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for imbbag-1.2.tar.gz
Algorithm Hash digest
SHA256 c8f7e4d7a612bc1744014f491d80381b1ecf418eda69529e39572a3f395eeed3
MD5 fc64d6a42267a32278f09e70802cecb1
BLAKE2b-256 ea9e7def6b06ae43fe4219281224759bf80087f7e07d0d08adbc9abdbc0971ad

See more details on using hashes here.

File details

Details for the file imbbag-1.2-py3-none-any.whl.

File metadata

  • Download URL: imbbag-1.2-py3-none-any.whl
  • Upload date:
  • Size: 43.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for imbbag-1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4300d24d326ee51fa143d92886c9718032f67dd4dcded5907d7881435aa202cf
MD5 1e8c2f65aff86f17c1606457b762b179
BLAKE2b-256 d6c221c97c9c0196965e0ec73fc01a3fefc39f98ee3b344dbec6dbb71ebc334a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page