Skip to main content

A machine-learning framework for predicting outcomes from time-series history.

Reason this release was yanked:

old version

Project description

Timesias

Forcast outcomes from time-series history. This is the top-performing algorithm for DII National Data Science Challenge.

Installation

Clone this program to your local directory:

git clone https://github.com/GuanLab/sepsis.git

Dependency

For visualization:

Input data format

The example data in the data/ are randomly generated data for the demonstration of the algorithm.

Two types of data is requied for model training and prediction:

  • gs.file: .txt file two columns. The first column is file name index. The second column is the gold standard (0/1), representing the final outbreak of sepsis
0.psv,1
1.psv,1
2.psv,0
3.psv,1
4.psv,0
5.psv,1
6.psv,1
  • *.psv: .psv table files separated by |, which is the time-series feature records. The header of psv file are the feature names. To note, the first column is the time index.
HR|feature_1|feature_2|...|feature_n-1|feature_n
0.0|1|0.0|...|1.3|0.0 
1.0|NaN|0.0|...|0.0|0.0
3.5|NaN|2.3|...|0.0|0.0

Model training and cross validation

timesias -g [GS_FILE_PATH] -t [LAST_N_RECORDS] -f [EXTRA_FEATURES] --shap
  • GS_FILE_PATH: the path to the gold-standard file; for example, /data/gs.file;
  • LAST_N_RECORDS: last n records to use for prediction. default: 16;
  • EXTRA_FEATURES: addtional features used for prediction. default: ['norm', 'std', 'missing_portion', 'baseline'], which are all features we used in DII Data challenge.

also use

 timesias --help

to get instructions on the usage of our program.

This will generate models, which will be saved under a new directory ./models.

Evaluation results during five-fold cross validation will be stored in eva.tsv.

Top feature evaluation

if --shap is indicated, SHAP analysis will be carried out to show top contributing measurements and last nth time points. This will generate an html report (top_feature_report.html) like the following:

The corresponding shap values will be stored in shap_group_by_measurment.csv and shap_group_by_timeslot.csv.

Other applications of this method

This method can be generalized to be used on other hospitalization data. One application of this method is the COVID-19 DREAM Challenge, where this method also achieves top performance.

Reference

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

timesias-0.0.1.tar.gz (3.3 kB view details)

Uploaded Source

Built Distributions

timesias-0.0.1-py3.7.egg (2.9 kB view details)

Uploaded Egg

timesias-0.0.1-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file timesias-0.0.1.tar.gz.

File metadata

  • Download URL: timesias-0.0.1.tar.gz
  • Upload date:
  • Size: 3.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for timesias-0.0.1.tar.gz
Algorithm Hash digest
SHA256 8502e49a5b778b1e85de27a83aedc5cff9256fddb2e07b0e7750bbdb7b4f132a
MD5 1d693f50af2acfdaab84081926b3753a
BLAKE2b-256 956c5c390682f41682ff7e742479669b0cd8f0201955925e4e207a09cb597a71

See more details on using hashes here.

File details

Details for the file timesias-0.0.1-py3.7.egg.

File metadata

  • Download URL: timesias-0.0.1-py3.7.egg
  • Upload date:
  • Size: 2.9 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for timesias-0.0.1-py3.7.egg
Algorithm Hash digest
SHA256 dfffa394763fc1968485b7510dccb2f5c60b73554c867bea2cfc40360482d496
MD5 0cea4b4033878a53a0483c56c82657e4
BLAKE2b-256 d1eb21adcbd24a3af9787aa6f995a230299c8a84b3aaa918a0452ff7b7f9bac8

See more details on using hashes here.

File details

Details for the file timesias-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: timesias-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.5.0.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for timesias-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8d4e987c7146988294686a97bc382e9a2ec854efed88deb1df7f102eb832d55d
MD5 887a310ab7f80c145217e62a00477c2a
BLAKE2b-256 98a0bf24d5a9768870c6d427ab93c91b0dd43934956bd22e7abea9b1f24c7203

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page