Skip to main content

Package contains a collection of bagging ensemble algorithms for imbalanced data

Project description

ImbBag

Imbalanced Bagging Ensemble Algorithms

Description

ImbBag is a specialized package that integrates a variety of bagging ensemble methods specifically designed for imbalanced data classification. This package provides a scikit-learn-based framework that simplifies the usage of these methods, making it easier for researchers and practitioners to apply them in their work, whether dealing with binary or multi-class classification problems.

Installation

pip install imbbag

Requirements

The following Python packages are required.

  • scikit-learn
  • imblearn
  • PyGAD
  • ARFS
  • mlxtend
  • scikit-learn-intelex
  • ARFS

Also, use Python 3.11

Available Bagging Ensemble Algorithms in the ImbBag Package

  • UnderBagging (UnderBag) - ** Multi-class
  • Exactly Balanced Bagging (EBBag) - ** Binary-class
  • OverBagging (OverBag) - ** Multi-class
  • SMOTE Bagging (SMOTEBag) - ** Multi-class
  • Roughly Balanced Bagging (RBBag) - ** Binary-class
  • Multi-class Roughly Balanced Bagging (MRBBag) - ** Multi-class
  • Bagging Ensemble Variation (BEV) - ** Binary-class
  • Lazy Bagging (LazyBag) - ** Multi-class
  • Multi Random Balance Bagging (MultiRandBalBag) - ** Multi-class
  • Neighborhood Balanced Bagging (NBBag) - ** Binary-class
  • Probability Threshold Bagging (PTBag) - ** Multi-class
  • Adaptive Synthetic Bagging (ADASYNBag) - ** Binary-class
  • RSYN Bagging (RSYNBag) - ** Binary-class
  • Resampling Ensemble Algorithm (REABag) - ** Multi-class
  • Under-bagging K-NN (UnderBagKNN) - ** Multi-class
  • Boundary Bagging (BBag) - ** Multi-class
  • Bagging of Extrapolation-SMOTE SVM (BEBS) - ** Binary-class
  • Evolutionary Under-sampling based Bagging (EUSBag)- ** Binary-class
  • Random Balanced Sampling with Bagging (RBSBag) - ** Multi-class
  • Cost-sensitive Bagging (CostBag) - ** Multi-class

Get Started

Here is an example:

from sklearn.tree import DecisionTreeClassifier
from sklearn. model_selection import train_test_split
from ImbBag import BBag

dataframe = read_csv('dataset.csv')
data = dataframe.values    
X = data[:,:-1]
Y = data[:,:-1]

# split the dataset into training and test sets
X_train ,X_test ,y_train ,y_test = train_test_split (X, y, test_size =0.2)

# instantiate the imbalance bagging classifier, training, prediction 
cls = BBag(estimator = DecisionTreeClassifier(), n_estimator = 50)
clf.fit(X_train , y_train)
y_pred = clf.predict(X_test)

Version History

  • v1.0.0 - Initial release - July 29, 2024

Credits

  • **Yousef Abdi
  • University of Tabriz

License

This project licensed under the MIT License.

Support

Report issues, ask questions, and provide suggestions using:

The project can be accessed at https://github.com/yousefabdi/imbbag

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imbbag-1.1.tar.gz (19.4 kB view details)

Uploaded Source

Built Distribution

imbbag-1.1-py3-none-any.whl (43.1 kB view details)

Uploaded Python 3

File details

Details for the file imbbag-1.1.tar.gz.

File metadata

  • Download URL: imbbag-1.1.tar.gz
  • Upload date:
  • Size: 19.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for imbbag-1.1.tar.gz
Algorithm Hash digest
SHA256 1e911a0f9d6a2c751023bf5d9966e40b742fbe8de81d695c27d483bf00ced4fc
MD5 c37039456f0b48e302a34a8a55270760
BLAKE2b-256 58f02bc3fa1b9df09051cf07e648754cc150625d7647a67211cab503fd1a17d7

See more details on using hashes here.

File details

Details for the file imbbag-1.1-py3-none-any.whl.

File metadata

  • Download URL: imbbag-1.1-py3-none-any.whl
  • Upload date:
  • Size: 43.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for imbbag-1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a07d439a949d57abb30b107d62cde3f1a5415aa716ff50010d4455bf70f4559f
MD5 0a3236825dd694a552a5f3e3f0fa4335
BLAKE2b-256 8871b3d4a5388fcb221f9c7213bbac344874d2afe6a215af1d20b36b0f2c5312

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page