Pipeline for building clinical outcome prediction models on training dataset and transfer learning on validation datasets.
Project description
Ciclops
Cross-platform training In CLinical Outcome PredictionS (ciclops) is the winning algorithm in 2019 Malaria DREAM Challenge SubChallenge 2.
Ciclops performs transfer learning from one transcriptomic platform's samples to another.
Installation
Install this package via pip:
pip install ciclops
or clone this program to your local directory:
https://github.com/GuanLab/ciclops.git
Usage
python ciclops [-h] [--train_path TRAIN_PATH] [--valid_path VALID_PATH]
[-m MODEL_TYPE] [--no_quantile] [--shap] [-n TOP_GENES]
Pipeline for building clinical outcome prediction models on training dataset and transfer learning on validation datasets.
optional arguments:
-h, --help show this help message and exit
--train_path TRAIN_PATH
Path to your training data, in .csv format; includes sample names as first column and labels as last column
--valid_path VALID_PATH
Path to your transfer validation data, in .csv format; includes sample names as first column and labels as last column
-m MODEL_TYPE, --model_type MODEL_TYPE
Machine learning models to use:
lgb: LightGBM;
xgb: XGBoost;
rf: Random Forest;
gpr: Gaussian Process Regression;
lr: Linear Regression;
default: lgb
--no_quantile If specified, do not use quantile normalization.
--shap Conduct SHAP analysis on the training and validation set.
Only for use with LightGBM, XGBoost, and Random Forest.
-n TOP_GENES, --top_genes TOP_GENES
If --shap is specified, indicate number of top genes from both training and validation sets that will be compared in post-SHAP analysis.
Default is 20.
It will generate the following folders:
./training/
: preprocessed training datasets for model training and 10-fold cross validation
./validation/
: validation dataset for transferring test
./params/
: trained machine learning model parameters
./performance/
: model performance in 10-fold cross validation and transferring test
./SHAP/
: SHAP analysis results
References
- For the original paper, please refer to the Guan Lab's 2022 iScience paper: Machine learning for artemisinin resistance in malaria treatment across in vivo-in vitro platforms.
- STAR Protocol (TBD)
- External data for testing/example purposes:
- Shaw, P.J. et al. (2015) ‘Plasmodium parasites mount an arrest response to dihydroartemisinin, as revealed by whole transcriptome shotgun sequencing (RNA-seq) and microarray study’, BMC Genomics. doi:10.1186/s12864-015-2040-0.
- GSE59098
- GSE151189
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ciclops-0.0.4.tar.gz
.
File metadata
- Download URL: ciclops-0.0.4.tar.gz
- Upload date:
- Size: 12.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 80202caba11f94a73e35453006ba9495ed6b45ca2655a558d4de312d7aedabf2 |
|
MD5 | f232bdeaba3f1374445949a3f90c1bf5 |
|
BLAKE2b-256 | ed3593901955e2933f9fcce59ef27c89a667bf0a601432eaa538501dd016b453 |
File details
Details for the file ciclops-0.0.4-py3-none-any.whl
.
File metadata
- Download URL: ciclops-0.0.4-py3-none-any.whl
- Upload date:
- Size: 13.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 94e878a490baf0e0000d22e63a16d1f5bd969ea826ea70f72af934b693059fcb |
|
MD5 | 84ad9208df86601224bb7f7ed506b625 |
|
BLAKE2b-256 | 4a97c9bb381967ea2d275ef4f3409411f212ae20ee779879a9e8cd8aa734f795 |