The challenge is to create a model that uses data from the first 24 hours of intensive care to predict patient survival. (Kaggle Proj) https://www.kaggle.com/c/widsdatathon2020/overview
Project description
wids_datathon_2020
The challenge is to create a model that uses data from the first 24 hours of intensive care to predict patient survival. (Kaggle Proj) https://www.kaggle.com/c/widsdatathon2020/overview
How-to Perform Inference
This project provides a publicly-accessible and straight forward way to perform batch or realtime inference based on WiDS Datathon 2020 data.
$ pip install wids-datathon-2020
$ mkdir -p data/external data/raw data/interim data/processed data/predictions models/
$ wget -O data/external/widsdatathon2020.zip https://github.com/iainwo/kaggle/blob/master/wids-datathon-2020/data/external/widsdatathon2020.zip
$ python -m wids_datathon_2020.data.unzip_dataset
$ python -m wids_datathon_2020.learn data/raw/training_v2.csv
$ python -m wids_datathon_2020.inference data/raw/unlabeled.csv
$ head data/predictions/unlabeled.csv
How-to Develop
$ echo 'setup development environment'
$ git clone https://github.com/iainwo/kaggle.git
$ cd wids-datathon-2020/
$ make create_environment
$ conda activate wids_datathon_2020
$ make requirements
$ echo 'make some changes to the wids-datathon-2020 python module'
$ vim my-file.py
$ echo 'use the module'
$ make data
$ make model
$ make predictions
Other Commands
(wids_datathon_2020) talisman-2:wids-datathon-2020 iainwong$ make
Available rules:
clean Delete all compiled Python files
create_environment Set up python interpreter environment
data Make Dataset
data_final Make Dataset for Kaggle Submission
eda Generate visuals for feature EDA
lint Lint using flake8
model Make Model
predictions Make Predictions
requirements Install Python Dependencies
requirements_dev Install Development Deps
sync_data_from_s3 Download Data from S3
sync_data_to_s3 Upload Data to S3
test Run unit tests
test_environment Test python environment is setup correctly
Project Organization
├── LICENSE
├── Makefile <- Makefile with commands like `make data` or `make train`
├── README.md <- The top-level README for developers using this project.
├── data
│ ├── external <- Data from third party sources.
│ ├── interim <- Intermediate data that has been transformed.
│ ├── processed <- The final, canonical data sets for modeling.
│ └── raw <- The original, immutable data dump.
│
├── docs <- A default Sphinx project; see sphinx-doc.org for details
│
├── models <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks <- Jupyter notebooks. Naming convention is a number (for ordering),
│ the creator's initials, and a short `-` delimited description, e.g.
│ `1.0-jqp-initial-data-exploration`.
│
├── references <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports <- Generated analysis as HTML, PDF, LaTeX, etc.
│ └── figures <- Generated graphics and figures to be used in reporting
│
├── requirements.txt <- The requirements file for reproducing the analysis environment, e.g.
│ generated with `pip freeze > requirements.txt`
│
├── setup.py <- makes project pip installable (pip install -e .) so src can be imported
├── src <- Source code for use in this project.
│ ├── __init__.py <- Makes src a Python module
│ │
│ ├── data <- Scripts to download or generate data
│ │ └── make_dataset.py
│ │
│ ├── features <- Scripts to turn raw data into features for modeling
│ │ └── build_features.py
│ │
│ ├── models <- Scripts to train models and then use trained models to make
│ │ │ predictions
│ │ ├── predict_model.py
│ │ └── train_model.py
│ │
│ └── visualization <- Scripts to create exploratory and results oriented visualizations
│ └── visualize.py
│
└── tox.ini <- tox file with settings for running tox; see tox.testrun.org
Project based on the kaggle-data-science project template.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
wids-datathon-2020-0.1.2.tar.gz
(22.7 kB
view hashes)
Built Distribution
Close
Hashes for wids_datathon_2020-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a00e3be7ffef61632274a0459d3b6c01ad3e546f63f85f79eb573e9fbcd4dee5 |
|
MD5 | ade1a5f6b4a030bc14dd1366a8ff33cc |
|
BLAKE2b-256 | 616a51ec419f12cb12afd8ea1afec658aafdd099450b36ec2df9f2188d1ae95d |