Transcendent adaptation for multiclass problems
Project description
Transcendent Multiclass
This repository enables users to apply Transcendent-like concept drift detection to both binary and multiclass problems.
Modifications have been made specifically to the ICE (Inductive Conformal Evaluator) implementation, while the other solutions (i.e. TCE, CCE, etc.) are out of the scope.
This project adapts Transcendent for multiclass problems by implementing two Nonconformity Measures (NCM) for Random forest and LightGBM classifiers.
Prerequisites
-
Setup the train/test split directory, which should contains the following files:
time_split/ ├── X_train.pkl ├── X_test.pkl ├── X_proper_train.pkl ├── X_cal.pkl ├── y_train.pkl ├── y_test.pkl ├── y_proper_train.pkl └── y_cal.pkl
-
Make sure to have a running and active version of Docker.
Usage:
-
Clone the repository and change directory:
git clone git@github.com:w-disaster/transcendent-multiclass.git && cd transcendent-multiclass
-
Configure the env variables and Run Inductive Conformal Evaluator:
PE_DATASET_NAME=<YOUR_PE_DATASET_NAME> SPLITTED_MPH_DATASET_PATH=<YOUR_PRE_SPLITTED_DATA> BEST_HYP_DIR=<YOUR_BEST_HYP_DIR> # Based on format produced by overfitting-analysis docker run -d \ --name mph-feature-extraction-$PE_DATASET_NAME \ -e BASE_DATASET_PATH=/usr/app/dataset/ \ -e PE_DATASET_TYPE=${PE_DATASET_NAME}_mph \ -e SPLITTED_MPH_DATASET_PATH=/usr/input_data/splitted_dataset/ \ -e BEST_HYP_DIR=/usr/input_data/best_hyp/ \ -e FEATURE_TYPE=dts \ -v $BEST_HYP_DIR:/usr/input_data/best_hyp/ \ -v $SPLITTED_MPH_DATASET_PATH:/usr/input_data/splitted_dataset/ \ -v ./results_multiclass/:/usr/app/models/ \ ghcr.io/malware-concept-drift-detection/transcendent-multiclass:main
A
results_multiclass/directory will be locally created containing the credibility ($p$-values) and confidence scores for both calibration and testing sets. -
Analysis post ICE:
Check whether novel families in the testing set produce smaller $p$-values, and thus can be discriminated from seen families.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file transcendent_multiclass_cdd_wdis-1.1.1.tar.gz.
File metadata
- Download URL: transcendent_multiclass_cdd_wdis-1.1.1.tar.gz
- Upload date:
- Size: 14.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1018-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e7fee7d1aac172d9a3821ee2ed2d5bc8634cbe26778c3a983574bfc9362cd3f8
|
|
| MD5 |
7da98c25290a5f879f3cfaaff7411261
|
|
| BLAKE2b-256 |
39687232c965e779425a15fc904813dd31cfc4abd4787b30e7e25eb604573890
|
File details
Details for the file transcendent_multiclass_cdd_wdis-1.1.1-py3-none-any.whl.
File metadata
- Download URL: transcendent_multiclass_cdd_wdis-1.1.1-py3-none-any.whl
- Upload date:
- Size: 18.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.3 Linux/6.11.0-1018-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
46bbdcd7756725cc1d22d742498b4454eef646a11d0d4f07f8c2521961abb73e
|
|
| MD5 |
ff612ded24a74df356da1c8aca3625ab
|
|
| BLAKE2b-256 |
197eb8b93d0c6c522f0a816085ca87bec7210630d4c62bc1a10153595c7623e6
|