Skip to main content

No project description provided

Project description

Using HBAC to detect biased data segments

  • Hierarchical Bias-Aware Clustering (HBAC) on regression models.
  • Input: a trained model and a model's test data.
  • Output: analysis of biased/discriminated data segment according to HBAC:
    • Comparing distributions of discriminated and remaining data.
    • Segment predictor: trains a XGBoost binary classifier to evaluate distinguishability of discriminated and remaining data with descriptive features.

alt text

# Initialize HBAC 
hbac = HBAC_analyser()

# In this case, input includes model path, X data and Y data
hbac.hbac_on_model(model_path, X_test, y_test) 

hbac.pca_plot()
discrimated_cluster, bias =  hbac.get_max_bias_cluster(print_results=True)

# Displaying results in dataframes
hbac.clustered_data

# Mean per feature 'discrimnated' cluster vs 'remaining' clusters
hbac.mean_clusters

# Plot 3 most different features' distributions
hbac.plot_distributions(plot_top_features = 3)

# Train XGBoost a binary classifier to predict whther a datapoint will be discrimnated or not, without using error as feature.
hbac.segment_predictor(plot_roc_auc=True,shap_analysis=True)

Also see example.ipynb.

For the use of HBAC on classification models, see https://github.com/Sm2468/msc_thesis/tree/master/hbac%20scripts, on which this project was based.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hbac_bias_detection-0.1.8.tar.gz (853.1 kB view details)

Uploaded Source

Built Distribution

hbac_bias_detection-0.1.8-py2.py3-none-any.whl (11.9 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file hbac_bias_detection-0.1.8.tar.gz.

File metadata

  • Download URL: hbac_bias_detection-0.1.8.tar.gz
  • Upload date:
  • Size: 853.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.2

File hashes

Hashes for hbac_bias_detection-0.1.8.tar.gz
Algorithm Hash digest
SHA256 166e133fac9073f1cbc6c5d5d5c14d8762ee07ff8802e8e84d3c518e76cd931a
MD5 8f3091c875b55a8fe42cd0dd80cd2720
BLAKE2b-256 359e19241d681041414ce6d468c523d5164a63cdc610cbdeaeb64ea4dbccc570

See more details on using hashes here.

File details

Details for the file hbac_bias_detection-0.1.8-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for hbac_bias_detection-0.1.8-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 a07fd681f5e6575da7693a3f1611c6d6726c4bcee3021e6cd7a18db8bbddb71c
MD5 66657229324ffcb5fb4596acd131a075
BLAKE2b-256 ba89ca8478e7826cdf5a0c346c905b09da4067d4d1c40779a7a741fe549dc4a5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page