A machine-learning framework for predicting outcomes from time-series history.

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Timesias

Forcast outcomes from time-series history. This is the top-performing algorithm for DII National Data Science Challenge.

Installation

Install this package via pip:

pip install timesias

or clone this program to your local directory:

git clone https://github.com/GuanLab/timesias.git

Dependency

For visualization:

boken (2.3.0)

Input data format

The example data in the data/ are randomly generated data for the demonstration of the algorithm.

Two types of data is requied for model training and prediction:

gs.file: gold standard file with two columns. The first column is paths for time-series records. The second column is the gold standard (0/1), representing the final outbreak of sepsis

./data/0.psv,1
./data/1.psv,1
./data/2.psv,0
./data/3.psv,1
./data/4.psv,0
./data/5.psv,1
./data/6.psv,1

*.psv: time series record files. .psv table files separated by |, which are the time-series records. The header of psv file are the feature names. To note, the first column is the time index.

HR|feature_1|feature_2|...|feature_n-1|feature_n
0.0|1|0.0|...|1.3|0.0 
1.0|NaN|0.0|...|0.0|0.0
3.5|NaN|2.3|...|0.0|0.0

Model training and cross validation

timesias -g [GS_FILE_PATH] -t [LAST_N_RECORDS] -f [EXTRA_FEATURES] -e [EVA_METRICS] --shap

GS_FILE_PATH: the path to the gold-standard file; for example, /data/gs.file;
LAST_N_RECORDS: last n records to use for prediction. default: 16;
EXTRA_FEATURES: addtional features used for prediction. default: ['norm', 'std', 'missing_portion', 'baseline'], which are all features we used in DII Data challenge.
EVA_METRICS: evaluation metrics to use. Available choices: auroc auprc cindex pearsonr spearmanr. For binary classification, AUROC and AUPRC are recommended; for regression, we recommend: C-index, Pearsonr and Spearmanr. default: AUROC AUPRC

also use:

 timesias --help

to get instructions on the usage of our program.

The above one-line command will yield the following results automatically:

./models.: where all hyperparameters of trained models will be saved.
./results: where all results mentioned below will be stored:
1. eva.tsv: Evaluation results during five-fold cross validation.
2. all results from top feature evaluations if --shap is used. the details will be mentioned in the next section.

Top feature evaluation

if --shap is indicated, SHAP analysis will be carried out to show top contributing measurements and last nth time points. This will generate an html report (./results/top_feature_report.html) like the following:

The corresponding shap values will be stored in ./results/shap_group_by_measurment.csv and ./results/shap_group_by_timeslot.csv.

Other applications of this method

This method can be generalized to be used on other hospitalization data. One application of this method is the COVID-19 DREAM Challenge, where this method also achieves top performance.

Reference

For citation, please refer to our latest iScience paper: Assessment of the timeliness and robustness for predicting adult sepsis.
For protocol(TBD)

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.0.4

May 13, 2021

0.0.1 yanked

Mar 24, 2021

Reason this release was yanked:

old version

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

timesias-0.0.4.tar.gz (4.4 kB view details)

Uploaded May 13, 2021 Source

File details

Details for the file timesias-0.0.4.tar.gz.

File metadata

Download URL: timesias-0.0.4.tar.gz
Upload date: May 13, 2021
Size: 4.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.5.0.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.7.4

File hashes

Hashes for timesias-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`57d4a58d3af997b2aeebca3b2f6197564f44e1fdc0488b2769b0aab144cf86d2`
MD5	`08dbb360eefb52353d584bfa115a3f63`
BLAKE2b-256	`993912379cdbd56410c94d2de94b3c32e197d74689e6a8e8ceabdcde1e5eebd4`

See more details on using hashes here.

timesias 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Timesias

Installation

Dependency

Input data format

Model training and cross validation

Top feature evaluation

Other applications of this method

Reference

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes