Skip to main content

No project description provided

Project description

Using HBAC to detect biased data segments

  • Hierarchical Bias-Aware Clustering (HBAC) on regression models.
  • Input: a trained model and a model's test data.
  • Output: analysis of biased/discriminated data segment according to HBAC:
    • Comparing distributions of discriminated and remaining data.
    • Segment predictor: trains a XGBoost binary classifier to evaluate distinguishability of discriminated and remaining data with descriptive features.

alt text

# Initialize HBAC 
hbac = HBAC_analyser()

# In this case, input includes model path, X data and Y data
hbac.hbac_on_model(model_path, X_test, y_test) 

hbac.pca_plot()
discrimated_cluster, bias =  hbac.get_max_bias_cluster(print_results=True)

# Displaying results in dataframes
hbac.clustered_data

# Mean per feature 'discrimnated' cluster vs 'remaining' clusters
hbac.mean_clusters

# Plot 3 most different features' distributions
hbac.plot_distributions(plot_top_features = 3)

# Train XGBoost a binary classifier to predict whther a datapoint will be discrimnated or not, without using error as feature.
hbac.segment_predictor(plot_roc_auc=True,shap_analysis=True)

Also see example.ipynb.

For the use of HBAC on classification models, see https://github.com/Sm2468/msc_thesis/tree/master/hbac%20scripts, on which this project was based.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hbac_bias_detection-0.1.4.tar.gz (849.5 kB view details)

Uploaded Source

Built Distribution

hbac_bias_detection-0.1.4-py2.py3-none-any.whl (1.8 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file hbac_bias_detection-0.1.4.tar.gz.

File metadata

  • Download URL: hbac_bias_detection-0.1.4.tar.gz
  • Upload date:
  • Size: 849.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.2

File hashes

Hashes for hbac_bias_detection-0.1.4.tar.gz
Algorithm Hash digest
SHA256 68c248f2d264dfbeb3e278f8d8f0d98ac8d836590d51ad5c01ab5697785639a7
MD5 c374db1c689817c99b81bca6dfdc0e3d
BLAKE2b-256 119e5391223ea7e64982eda36ce01c367744f796762c2aa7da2bf40236b6180d

See more details on using hashes here.

File details

Details for the file hbac_bias_detection-0.1.4-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for hbac_bias_detection-0.1.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 547196b7f29d196bef105387583a77f278571ab29e02a407cf730fa72651d588
MD5 02fea13c49de66bcbab7e0ef6b4c1571
BLAKE2b-256 38d27ad9208035c0d2f8c7d7be78c37691b5ff842c6d3dd2cee86079c2c442b8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page