Skip to main content

Testing dataset balancing techniques from previous works.

Project description

Travis CI build SonarCloud Quality SonarCloud Maintainability Codacy Maintainability Maintainability Pypi project

Testing dataset balancing techniques from previous works.

How do I install this package?

As usual, just download it using pip:

pip install miur_daad_balancing

Tests Coverage

Since some software handling coverages sometime get slightly different results, here’s three of them:

Coveralls Coverage SonarCloud Coverage Code Climate Coverate

Usage

Three balancing methods are available for the MIUR-DAAD project:

Umbalanced

This method just leaves the data as-is, and is used more as callback usefull to uniform the pipeline:

from miur_daad_balancing import umbalanced

training, testing = generate_my_data(...)
balanced_training, balanced_testing = umbalanced(training, testing)

Balanced

Applies a maximum threshold to every class in the training set as specified in the default package settings (3000):

from miur_daad_balancing import balanced

training, testing = generate_my_data(...)
balanced_training, balanced_testing = balanced(training, testing)

Full Balanced

Applies a maximum threshold to every class in the training set and balances to some default proportions the testing set:

from miur_daad_balancing import full_balanced

training, testing = generate_my_data(...)
balanced_training, balanced_testing = full_balanced(training, testing)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

miur_daad_balancing-1.0.0.tar.gz (5.8 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page