Dual-extraction method for phenotypic prediction and functional gene mining
Project description
DEM
Dual-extraction modeling: A multi-modal deep-learning architecture for phenotypic prediction and functional gene mining of complex traits
Latest news
v0.9.1 is released with a lot of improvements!
Please checkout the tutorials and documentations at cma2015.github.io/DEM.
- The DEM is implemented in the Python package
biodem
, which comprises 4 modules: data preprocessing, dual-extraction modeling, phenotypic prediction, and functional gene mining. - For more details, please check out our publication. 🖱️Click to copy citation
Installation
System requirements
- Python 3.10 / 3.11 / 3.12.
- Optional: Hardware accelerator supporting PyTorch.
Recommended: NVIDIA graphics card with 12GB memory or larger.
Install biodem
-
Create a conda environment:
mamba create -n dem python=3.11 mamba activate dem # Install PyTorch with CUDA support mamba install pytorch torchvision torchaudio pytorch-cuda=12.4 -c pytorch -c nvidia
-
Install biodem package from PyPI
pip install biodem
Usage
Please checkout the documentations at cma2015.github.io/DEM.
biodem
comprises 4 functional modules:
1. Data preprocessing
Nested cross-validation is recommended for data preprocessing.
- Steps:
- Split data into nested cross-validation sets.
- Imputation & standardization.
- Feature selection using the variance threshold filter and Random Forests.
- SNP2Gene transformation.
2. Dual-extraction modeling
- It takes preprocessed multi-omics data and phenotypic data as inputs. DEM is capable of performing both classification and regression tasks.
3. Phenotypic prediction
- It loads the trained DEM model checkpoint and performs phenotypic prediction.
4. Functional gene mining
- It performs functional gene mining based on the trained DEM model through feature ranking by permutation.
Citation
Please cite our paper if you use this package:
@article{renDualextractionModelingMultimodal2024a,
title = {Dual-Extraction Modeling: {{A}} Multi-Modal Deep-Learning Architecture for Phenotypic Prediction and Functional Gene Mining of Complex Traits},
shorttitle = {Dual-Extraction Modeling},
author = {Ren, Yanlin and Wu, Chenhua and Zhou, He and Hu, Xiaona and Miao, Zhenyan},
year = {2024},
month = sep,
journal = {Plant Communications},
volume = {5},
number = {9},
pages = {101002},
issn = {25903462},
doi = {10.1016/j.xplc.2024.101002},
langid = {english}
}
Asking for help
If you have any questions, please contact us via GitHub issues or email us.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file biodem-0.9.1.tar.gz
.
File metadata
- Download URL: biodem-0.9.1.tar.gz
- Upload date:
- Size: 93.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8a9f32d28d45ee7afb57a5bc4a99ba313f22570881fb6bac6ed7412f266bd8c6 |
|
MD5 | 7a0395fa88646a46ff08f2252d60ee61 |
|
BLAKE2b-256 | 7e0d341b0c9c75e7e128b18ebe0c4dbde8f3d8897be6f5d3fda799ef469f6fb1 |
File details
Details for the file biodem-0.9.1-py3-none-any.whl
.
File metadata
- Download URL: biodem-0.9.1-py3-none-any.whl
- Upload date:
- Size: 2.4 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1312362b3427d4e4e30fde2cf1abfcf5928c85e5d9d8837e86b9b37a65cb66d3 |
|
MD5 | 6eae68434258552d217996aa706a9992 |
|
BLAKE2b-256 | b433f7482df107b02ecd3b01d2e046d3b7fc090fd70eb665dd3bcddc419db976 |