Skip to main content

No project description provided

Project description

Using HBAC to detect biased data segments

  • Hierarchical Bias-Aware Clustering (HBAC) on regression models.
  • Input: a trained model and a model's test data.
  • Output: analysis of biased/discriminated data segment according to HBAC:
    • Comparing distributions of discriminated and remaining data.
    • Segment predictor: trains a XGBoost binary classifier to evaluate distinguishability of discriminated and remaining data with descriptive features.

alt text

# Initialize HBAC 
hbac = HBAC_analyser()

# In this case, input includes model path, X data and Y data
hbac.hbac_on_model(model_path, X_test, y_test) 

hbac.pca_plot()
discrimated_cluster, bias =  hbac.get_max_bias_cluster(print_results=True)

# Displaying results in dataframes
hbac.clustered_data

# Mean per feature 'discrimnated' cluster vs 'remaining' clusters
hbac.mean_clusters

# Plot 3 most different features' distributions
hbac.plot_distributions(plot_top_features = 3)

# Train XGBoost a binary classifier to predict whther a datapoint will be discrimnated or not, without using error as feature.
hbac.segment_predictor(plot_roc_auc=True,shap_analysis=True)

Also see example.ipynb.

For the use of HBAC on classification models, see https://github.com/Sm2468/msc_thesis/tree/master/hbac%20scripts, on which this project was based.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hbac_bias_detection-0.1.5.tar.gz (853.1 kB view details)

Uploaded Source

Built Distribution

hbac_bias_detection-0.1.5-py2.py3-none-any.whl (1.8 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file hbac_bias_detection-0.1.5.tar.gz.

File metadata

  • Download URL: hbac_bias_detection-0.1.5.tar.gz
  • Upload date:
  • Size: 853.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.2

File hashes

Hashes for hbac_bias_detection-0.1.5.tar.gz
Algorithm Hash digest
SHA256 c0a2d564af0d6a08e72c5707fc6125f8513f7178db8bfeccf24e90929a66ce07
MD5 efb1dc7049b225a613a15822286766ac
BLAKE2b-256 b5435695af56732a0d1fb74d3210de840c2aa710c6fe33f9f6f1d1a97791b00b

See more details on using hashes here.

File details

Details for the file hbac_bias_detection-0.1.5-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for hbac_bias_detection-0.1.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 5ee6e8daf3a638e5c28e4921dec09cd96c83e7442a4f436f7756717b420c2f98
MD5 2b691c185736c78b5c6b0015ed871d22
BLAKE2b-256 ef66ddc14d4866583a7fa742ca764e7367b16c37bb48f1f9eb78d472175f6204

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page