Skip to main content

GWAS software that combines traditional statistical methods with the power of Artificial Intelligence

Project description

About GWAStic

GWAStic is a software for Genome-Wide Association Study (GWAS) that combines traditional statistical methods with the power of Artificial Intelligence (AI) for comprehensive genetic analysis. It supports genomic prediction and offers diverse statistical outputs for phenotypic and genotypic data, making it an essential tool for researchers aiming to uncover complex genetic interactions and predict phenotypic outcomes efficiently.

Installation

ALT TEXT

ALT TEXT

ALT TEXT

Table of Contents

Key Features:

  • Cross Platform

  • Comprehensive Genetic Analysis: GWAStic offers a wide range of methods to analyze your genomic data, allowing you to explore the associations between genetic variants and traits of interest comprehensively.

  • AI-Enhanced Data Analysis: Harness the capabilities of machine learning and AI to uncover subtle patterns, interactions, and associations that may be missed by conventional statistical methods.

  • Genomic Prediction: Take your research to the next level by using GWAStic's advanced AI models for genomic prediction. Predict future health outcomes, disease risks, or phenotypic traits based on your genetic data and environmental factors.

  • User-Friendly Interface: GWAStic's intuitive interface makes it accessible to both novice and experienced researchers. Seamlessly navigate through your data, perform analyses, and visualize results with ease.

  • Customizable Workflows: Tailor your analysis to your specific research goals with customizable workflows. Define your parameters, select the appropriate statistical models, and integrate AI components as needed for a personalized analysis experience.

  • Collaborative Research: Collaborate seamlessly with colleagues and share your findings securely within the platform.

  • Frequent Updates: Stay at the forefront of genetic research with regular software updates. GWAStic incorporates the latest advancements in GWAS and AI methodologies to keep your analyses up-to-date.

myfile

1. Installation

GWAStic software was build and successfully tested on Windows operating system (Windows 7 and 10).

[!TIP] Video demonstration https://www.youtube.com/embed/vd4KqPqJvEo

Windows OS:

[!TIP] We recommend to install Anaconda and for managing dependencies, it is often recommended to create a new environment for your project:

Install Anaconda from https://www.anaconda.com/distribution/

Open the Anaconda Prompt

conda create --name gwastic_env python=3.9
conda activate gwastic_env

[!IMPORTANT] Install GWAStic via pip:

pip install gwastic_desktop

[!IMPORTANT] Run GWAStic:

Type gwastic in the Anaconda command line to start the software.

Linux OS:

[!TIP] We recommend to install Anaconda and for managing dependencies, it is often recommended to create a new environment for your project:

wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh
chmod +x Miniconda3-latest-Linux-x86_64.sh
./Miniconda3-latest-Linux-x86_64.sh
cd /home/username/miniconda3
source ~/miniconda3/bin/activate
conda create --name gwastic_env python=3.9
conda activate gwastic_env

[!IMPORTANT] Install GWAStic via pip:

pip install gwastic_desktop

[!IMPORTANT] Run GWAStic:

Type gwastic in the command line to start the software.

Mac OS:

[!TIP] We recommend to install Anaconda and for managing dependencies, it is often recommended to create a new environment for your project:

Install Anaconda (https://www.anaconda.com/distribution/)

Open the downloaded .pkg file to launch the installer and follow the on-screen instructions.

Open Terminal. You can do this by pressing Cmd + Space to open Spotlight Search, typing "Terminal", and pressing Enter.

conda create --name gwastic_env python=3.9
conda activate gwastic_env

[!IMPORTANT] Install GWAStic via pip:

pip install gwastic_desktop

For MacOS it's important to update matplotlib via conda:

conda install matplotlib

[!IMPORTANT] Run GWAStic:

Type gwastic in the command line to start the software.

2. Example datasets

[!NOTE] VCF file format (including vcf.gz) and Plink BED (binary) format are supported for all GWAS methods. In case of vcf, you first must convert the genotype data to bed file format. VCF example file

[!NOTE] Phenotypic data must be three columns (Family ID; Within-family ID; Value) text file delimited by space. Phenotype example file

[!TIP] We provide to two datasets to test GWASTic and validate the software:

Dataset 1 (Barley with row-type phenotype):

We have used a subset of data from a recent study (Milner et al.2019) focusing on the genetic basis of barley traits. The genotypic data was filtered by applying a genotyping rate cutoff of 0.02 and a minor allele frequency (MAF) threshold of 0.05. This resulted in a curated dataset comprising 949,174 SNPs. A random subset of 147 accessions from the Core 200 collection in the same study with available row-type phenotype data was selected. This phenotype describes the arrangement of kernels on the spike of the barley plant, specifically distinguishing between two-rowed and six-rowed barley - a crucial morphological and agricultural trait. The four distinct methods - XGB, RF, LR, and LMM - were employed to validate the peaks of two rowtype associated barley genes previously identified in (Milner et al. 2019).

Download the zip file containing the datasets from https://zenodo.org/records/11183758
Unpack the zip file
Start GWAStic
Choose the file barley_set\WGS300_005_0020.bed as genotypic file
Choose the file barley_set\bridge_row_type_GWAS.txt as phenotypic file
Select method and press Run GWAS

Dataset 2 (Arabidopsis thaliana with with Pseudomonas syringe):

For a quick testing and short run time, we provide a second dataset is on a hypersensitive response phenotype observed in 58 Arabidopsis thaliana host lines (∼900000 SNPs) when infected with Pseudomonas syringe expressing the avrRpm1 gene. Description of the original experiment can be found at https://arapheno.1001genomes.org/phenotype/17/.

Download the zip file containing the datasets from https://zenodo.org/records/11183758
Unpack the zip file
Start GWAStic
Choose the file small_set\example.bed as genotypic file
Choose the file small_set\pheno_gwas.csv as phenotypic file
Select method and press Run GWAS

3. References

Atwell S, Huang YS, Vilhjálmsson BJ, Willems G, Horton M, Li Y, Meng D, Platt A, Tarone AM, Hu TT, Jiang R, Muliyati NW, Zhang X, Amer MA, Baxter I, Brachi B, Chory J, Dean C, Debieu M, de Meaux J, Ecker JR, Faure N, Kniskern JM, Jones JD, Michael T, Nemri A, Roux F, Salt DE, Tang C, Todesco M, Traw MB, Weigel D, Marjoram P, Borevitz JO, Bergelson J, Nordborg M Nature. 2010 465(7298): 627-31. doi: 10.1038/nature08800

Lippert, C., Listgarten, J., Liu, Y. et al. FaST linear mixed models for genome-wide association studies. Nat Methods 8, 833–835 (2011). https://doi.org/10.1038/nmeth.1681

Milner,S. et al. (2019) Genebank genomics highlights the diversity of a global barley collection. Nature Genetics, 51(2):319-26. doi: 10.1038/s41588-018-0266-x.

Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, Maller J, de Bakker PIW: Daly MJ & Sham PC (in press) PLINK: a toolset for whole-genome association and population-based linkage analysis. American Journal of Human Genetics.

4. Acknowledgment

Gwastic has incorporated the FaST-LMM library (fastlmm.github.io), to enhance its Linear Mixed Models (LMM) feature. We thank Carl Kadie and David Heckerman for not only creating this exceptional tool but also providing outstanding support and discussions.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gwastic_desktop-0.5.5.tar.gz (25.5 MB view details)

Uploaded Source

Built Distribution

gwastic_desktop-0.5.5-py3-none-any.whl (25.7 MB view details)

Uploaded Python 3

File details

Details for the file gwastic_desktop-0.5.5.tar.gz.

File metadata

  • Download URL: gwastic_desktop-0.5.5.tar.gz
  • Upload date:
  • Size: 25.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.9.18 Windows/10

File hashes

Hashes for gwastic_desktop-0.5.5.tar.gz
Algorithm Hash digest
SHA256 a9453895ecdb4d5bde3691dd515bfdf53b17b5c9b5b2d7020297c76fa3ae9357
MD5 f94a7b07d313bb738b1e02ecb0731259
BLAKE2b-256 b5b4db77a07146c70dff13af0b87edff5f18e68602720efff449503f79b1cfc2

See more details on using hashes here.

File details

Details for the file gwastic_desktop-0.5.5-py3-none-any.whl.

File metadata

File hashes

Hashes for gwastic_desktop-0.5.5-py3-none-any.whl
Algorithm Hash digest
SHA256 9ac181ee4098bbf9d6931a359e02d5dc8871dc9bdbd5aa3e47c42c7732d0c123
MD5 0301a3a9644a2bbec985d1854b975c5c
BLAKE2b-256 33f8acbce1c9e906eb4e954771d365c1e5c390b02a950c253752ce4e247b5f02

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page