Data loader for common datasets in Survival Analysis.
Project description
survival-datasets
A simple data loader for the most common datasets in Survival Analysis. Currently the following are included:
- Veterans Lung Cancer (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_veterans_lung_cancer.html)
- German Breast Cancer Study Group (GBSG2) (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_gbsg2.html)
- AIDS dataset (https://scikit-survival.readthedocs.io/en/stable/api/generated/sksurv.datasets.load_aids.html)
- NHANES (https://shap.readthedocs.io/en/latest/generated/shap.datasets.nhanesi.html)
- SUPPORT Study to Understand Prognoses Preferences Outcomes and Risks of Treatment (from DeepSurv paper, https://arxiv.org/abs/1606.00931)
- METABRIC The Molecular Taxonomy of Breast Cancer International Consortium (from DeepSurv paper, https://arxiv.org/abs/1606.00931)
- WHAS500 Worcester Heart Attack Study (https://scikit-survival.readthedocs.io/en/stable/api/datasets.html)
- FLCHAIN (https://scikit-survival.readthedocs.io/en/stable/api/datasets.html)
- SEER (from Kaggle, https://www.kaggle.com/code/jnegrini/breast-cancer-dataset)
Requirements
- Python 3.8 or later
- scikit-survival 0.17.2 or later
- pandas 1.4.3 or later
- numpy 1.22.4 or later
- shap 0.41 or later
- pyarrow 11.0 or later
Installation
Simply install via pip:
pip install survival-datasets
Examples
Import the datasets module from the package and load your dataset of choice:
from survdata import datasets
if __name__ == "__main__":
X, y = datasets.load_seer_dataset()
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
survival-datasets-0.1.5.tar.gz
(263.1 kB
view hashes)
Built Distribution
Close
Hashes for survival_datasets-0.1.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 87f5720816eb2e385b06f6815ff1d1bd8f27efd63b059d3ad51c222467b4ff68 |
|
MD5 | 93b75113836e3096fa90e72655b7f862 |
|
BLAKE2b-256 | eb3b7f7fbfbedca2c991fe122eac753dee27450a2c54a8c9bcf06f94a9992bf2 |