Skip to main content

Dual-extraction method for phenotypic prediction and functional gene mining

Project description

DEM

Dual-extraction modeling: A multi-modal deep-learning architecture for phenotypic prediction and functional gene mining of complex traits

pypi-badge pypi-badge License docs-badge

Latest news

v0.9.1 is released with a lot of improvements!

Please checkout the tutorials and documentations at cma2015.github.io/DEM.

  • The DEM is implemented in the Python package biodem, which comprises 4 modules: data preprocessing, dual-extraction modeling, phenotypic prediction, and functional gene mining.
  • For more details, please check out our publication. 🖱️Click to copy citation
DEM architecture Modules of biodem

Installation

System requirements

  • Python 3.10 / 3.11 / 3.12.
  • Optional: Hardware accelerator supporting PyTorch.

Recommended: NVIDIA graphics card with 12GB memory or larger.

Install biodem

Conda / Mamba is recommended for installation.

  1. Create a conda environment:

    mamba create -n dem python=3.11
    mamba activate dem
    
    # Install PyTorch with CUDA support
    mamba install pytorch torchvision torchaudio pytorch-cuda=12.4 -c pytorch -c nvidia
    
  2. Install biodem package from PyPI

    pip install biodem
    

Usage

Please checkout the documentations at cma2015.github.io/DEM.



biodem comprises 4 functional modules:

1. Data preprocessing

Nested cross-validation is recommended for data preprocessing.

  • Steps:
    1. Split data into nested cross-validation sets.
    2. Imputation & standardization.
    3. Feature selection using the variance threshold filter and Random Forests.
    4. SNP2Gene transformation.

2. Dual-extraction modeling

  • It takes preprocessed multi-omics data and phenotypic data as inputs. DEM is capable of performing both classification and regression tasks.

3. Phenotypic prediction

  • It loads the trained DEM model checkpoint and performs phenotypic prediction.

4. Functional gene mining

  • It performs functional gene mining based on the trained DEM model through feature ranking by permutation.



Citation

Please cite our paper if you use this package:

@article{renDualextractionModelingMultimodal2024a,
  title = {Dual-Extraction Modeling: {{A}} Multi-Modal Deep-Learning Architecture for Phenotypic Prediction and Functional Gene Mining of Complex Traits},
  shorttitle = {Dual-Extraction Modeling},
  author = {Ren, Yanlin and Wu, Chenhua and Zhou, He and Hu, Xiaona and Miao, Zhenyan},
  year = {2024},
  month = sep,
  journal = {Plant Communications},
  volume = {5},
  number = {9},
  pages = {101002},
  issn = {25903462},
  doi = {10.1016/j.xplc.2024.101002},
  langid = {english}
}

Asking for help

If you have any questions, please contact us via GitHub issues or email us.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biodem-0.9.1.tar.gz (93.7 MB view details)

Uploaded Source

Built Distribution

biodem-0.9.1-py3-none-any.whl (2.4 MB view details)

Uploaded Python 3

File details

Details for the file biodem-0.9.1.tar.gz.

File metadata

  • Download URL: biodem-0.9.1.tar.gz
  • Upload date:
  • Size: 93.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.0

File hashes

Hashes for biodem-0.9.1.tar.gz
Algorithm Hash digest
SHA256 8a9f32d28d45ee7afb57a5bc4a99ba313f22570881fb6bac6ed7412f266bd8c6
MD5 7a0395fa88646a46ff08f2252d60ee61
BLAKE2b-256 7e0d341b0c9c75e7e128b18ebe0c4dbde8f3d8897be6f5d3fda799ef469f6fb1

See more details on using hashes here.

File details

Details for the file biodem-0.9.1-py3-none-any.whl.

File metadata

  • Download URL: biodem-0.9.1-py3-none-any.whl
  • Upload date:
  • Size: 2.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.0

File hashes

Hashes for biodem-0.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1312362b3427d4e4e30fde2cf1abfcf5928c85e5d9d8837e86b9b37a65cb66d3
MD5 6eae68434258552d217996aa706a9992
BLAKE2b-256 b433f7482df107b02ecd3b01d2e046d3b7fc090fd70eb665dd3bcddc419db976

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page