A machine-learning framework for predicting outcomes from time-series history.
Reason this release was yanked:
old version
Project description
Timesias
Forcast outcomes from time-series history. This is the top-performing algorithm for DII National Data Science Challenge.
Installation
Clone this program to your local directory:
git clone https://github.com/GuanLab/sepsis.git
Dependency
For visualization:
Input data format
The example data in the data/ are randomly generated data for the demonstration of the algorithm.
Two types of data is requied for model training and prediction:
gs.file
:.txt
file two columns. The first column is file name index. The second column is the gold standard (0/1), representing the final outbreak of sepsis
0.psv,1
1.psv,1
2.psv,0
3.psv,1
4.psv,0
5.psv,1
6.psv,1
*.psv
:.psv
table files separated by|
, which is the time-series feature records. The header of psv file are the feature names. To note, the first column is the time index.
HR|feature_1|feature_2|...|feature_n-1|feature_n
0.0|1|0.0|...|1.3|0.0
1.0|NaN|0.0|...|0.0|0.0
3.5|NaN|2.3|...|0.0|0.0
Model training and cross validation
timesias -g [GS_FILE_PATH] -t [LAST_N_RECORDS] -f [EXTRA_FEATURES] --shap
GS_FILE_PATH
: the path to the gold-standard file; for example,/data/gs.file
;LAST_N_RECORDS
: last n records to use for prediction. default: 16;EXTRA_FEATURES
: addtional features used for prediction. default: ['norm', 'std', 'missing_portion', 'baseline'], which are all features we used in DII Data challenge.
also use
timesias --help
to get instructions on the usage of our program.
This will generate models, which will be saved under a new directory ./models
.
Evaluation results during five-fold cross validation will be stored in eva.tsv
.
Top feature evaluation
if --shap
is indicated, SHAP analysis will be carried out to show top contributing measurements and last nth time points. This will generate an html report (top_feature_report.html
) like the following:
The corresponding shap values will be stored in shap_group_by_measurment.csv
and shap_group_by_timeslot.csv
.
Other applications of this method
This method can be generalized to be used on other hospitalization data. One application of this method is the COVID-19 DREAM Challenge, where this method also achieves top performance.
Reference
- For citation, please refer to our latest iScience paper: Assessment of the timeliness and robustness for predicting adult sepsis.
- For protocol(TBD)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
File details
Details for the file timesias-0.0.1.tar.gz
.
File metadata
- Download URL: timesias-0.0.1.tar.gz
- Upload date:
- Size: 3.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
8502e49a5b778b1e85de27a83aedc5cff9256fddb2e07b0e7750bbdb7b4f132a
|
|
MD5 |
1d693f50af2acfdaab84081926b3753a
|
|
BLAKE2b-256 |
956c5c390682f41682ff7e742479669b0cd8f0201955925e4e207a09cb597a71
|
File details
Details for the file timesias-0.0.1-py3.7.egg
.
File metadata
- Download URL: timesias-0.0.1-py3.7.egg
- Upload date:
- Size: 2.9 kB
- Tags: Egg
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
dfffa394763fc1968485b7510dccb2f5c60b73554c867bea2cfc40360482d496
|
|
MD5 |
0cea4b4033878a53a0483c56c82657e4
|
|
BLAKE2b-256 |
d1eb21adcbd24a3af9787aa6f995a230299c8a84b3aaa918a0452ff7b7f9bac8
|
File details
Details for the file timesias-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: timesias-0.0.1-py3-none-any.whl
- Upload date:
- Size: 3.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
8d4e987c7146988294686a97bc382e9a2ec854efed88deb1df7f102eb832d55d
|
|
MD5 |
887a310ab7f80c145217e62a00477c2a
|
|
BLAKE2b-256 |
98a0bf24d5a9768870c6d427ab93c91b0dd43934956bd22e7abea9b1f24c7203
|