Skip to main content

No project description provided

Project description

EIR-auto-GP

Eir Auto GP Logo

EIR-auto-GP: Automated genomic prediction (GP) using deep learning models with EIR.

WARNING: This project is in alpha phase. Expect backwards incompatible changes and API changes.

Overview

EIR-auto-GP is a comprehensive framework for genomic prediction (GP) tasks, built on top of the EIR deep learning framework. EIR-auto-GP streamlines the process of preparing data, training, and evaluating models on genomic data, automating much of the process from raw input files to results analysis. Key features include:

  • Support for .bed/.bim/.fam PLINK files as input data.
  • Automated data processing and train/test splitting.
  • SNP-based feature selection based on GWAS, deep learning-based attributions, and a combination of both.
  • Ensemble prediction from multiple training runs.
  • Analysis and visualization of results.

Installation

First, ensure that plink2 is installed and available in your PATH.

Then, install EIR-auto-GP using pip:

pip install eir-auto-gp

Usage

Please refer to the Documentation for examples and information.

Workflow

  1. Data processing: EIR-auto-GP processes the input .bed/.bim/.fam PLINK files and .csv label file, preparing the data for model training and evaluation.
  2. Train/test split: The processed data is automatically split into training and testing sets, with the option of manually specifying splits.
  3. Training: Configurable number of training runs are set up and executed using EIR's deep learning models.
  4. SNP feature selection: GWAS based feature selection, deep learning-based feature selection with Bayesian optimization, and mixed strategies are supported.
  5. Test set prediction: Predictions are made on the test set using all training run folds.
  6. Ensemble prediction: An ensemble prediction is created from the individual predictions.
  7. Results analysis: Performance metrics, visualizations, and analysis are generated to assess the model's performance.

Citation

If you use EIR-auto-GP in a scientific publication, we would appreciate if you could use the following citation:

@article{sigurdsson2021deep,
  title={Deep integrative models for large-scale human genomics},
  author={Sigurdsson, Arnor Ingi and Westergaard, David and Winther, Ole and Lund, Ole and Brunak, S{\o}ren and Vilhjalmsson, Bjarni J and Rasmussen, Simon},
  journal={bioRxiv},
  year={2021},
  publisher={Cold Spring Harbor Laboratory}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eir_auto_gp-0.0.1a0.tar.gz (37.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

eir_auto_gp-0.0.1a0-py3-none-any.whl (41.8 kB view details)

Uploaded Python 3

File details

Details for the file eir_auto_gp-0.0.1a0.tar.gz.

File metadata

  • Download URL: eir_auto_gp-0.0.1a0.tar.gz
  • Upload date:
  • Size: 37.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.1 CPython/3.10.9 Darwin/22.3.0

File hashes

Hashes for eir_auto_gp-0.0.1a0.tar.gz
Algorithm Hash digest
SHA256 083b63fdc684f0c77072d68f1842240354d4d026fc3f2ba42f544bcbe18a0026
MD5 4ddfc2f2ff093a35043d9dd3df0bfa04
BLAKE2b-256 38d48bf26b25c21990aeaecb5f0769e22db022fa4c83670a8b311dfcfafe938a

See more details on using hashes here.

File details

Details for the file eir_auto_gp-0.0.1a0-py3-none-any.whl.

File metadata

  • Download URL: eir_auto_gp-0.0.1a0-py3-none-any.whl
  • Upload date:
  • Size: 41.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.4.1 CPython/3.10.9 Darwin/22.3.0

File hashes

Hashes for eir_auto_gp-0.0.1a0-py3-none-any.whl
Algorithm Hash digest
SHA256 4aecdb647500c38f5000fcd5982384dddb423117e16f735d667b66c2770a776b
MD5 83d5aac7f4dfb553b7ac8e246fb1683e
BLAKE2b-256 2a95830d87c5ea47f2182c9ff0b198f662b635fd5d69a6d29b63c7e6d8b8b19e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page