Skip to main content

a library for prioritizing the candidate genes by incorporating information of Knowledge-based gene sets, Effects of variants, GWAS and TWAS

Project description

POCKET

PriOritizing the Candidate genes by incorporating information of Knowledge-based gene sets, Effects of variants, GWAS and TWAS

Multi-omics datasets were used to prioritize the candidate gene in GWAS QTL regions, and a comprehensive scoring system was established. First, based on variation effect annotated with SNP effector (Cingolani et al., 2012) and GWAS P value of the variation, we used a Min-Max scaled score to evaluate the effects of variation in gene region. Second, based on TWAS P value and cis-eQTL results, we evaluated the gene expression effect in candidate region. Third, we used variations in gene region and upstream region of gene to categorize the gene into different haplotypes and calculated the haplotype-based association score. Fourth, we predicted the potential probability of whether the genes related to the phenotype or not, depends on 8,283 features which we collected from four datasets: (i) GO category, (ii) InterPro protein classification, (iii) gene expression datasets from Lu et al. (Lu et al., 2018), (iv) DEGs of known SOC related mutant or OE lines, (v) ICA ccomponents identified from population transcriptome in 20 and 40 DAF, then we used SVM to predict the gene function. Lastly, we summarized the scores from four processes to determine which genes were more likely effects the phenotype.

Note: we tested the POCKET in Rapeseed and Rice. And we assume that POCKET can be used for other species.

Requirements

We tested the code on linux platform. Requirements are:

GCC C and C++ compiler (gcc, g++)

Python >= 3.6

pip for installing python packages

Numpy

Scipy

scikit-learn

limix

pandas

Joblib

plink software

Installation

If all the requirements are met, you can install the POCKT library with the command:

python3 setup.py build install --user

or using

pip install pockt

Examples

You can follow the example to use the POCKET.

Cite

Tang et al., Genome- and transcriptome-wide association studies provide insights into the genetic basis of natural variation of seed oil content in Brassica napus. Molecular Plant, in review.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Gene_POCKET-0.0.3.linux-x86_64.tar.gz (32.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

Gene_POCKET-0.0.3-py3-none-any.whl (29.1 kB view details)

Uploaded Python 3

File details

Details for the file Gene_POCKET-0.0.3.linux-x86_64.tar.gz.

File metadata

  • Download URL: Gene_POCKET-0.0.3.linux-x86_64.tar.gz
  • Upload date:
  • Size: 32.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.7

File hashes

Hashes for Gene_POCKET-0.0.3.linux-x86_64.tar.gz
Algorithm Hash digest
SHA256 c7ff0a37cb33b837448327063d35371bd900c6dce391e186dc51acec26154429
MD5 39612628daef7fd3cc9f2531304eeccd
BLAKE2b-256 bbf0f74721aa070d7d81c710045b4298863256757bcf904db0f33ec7ff53c2fc

See more details on using hashes here.

File details

Details for the file Gene_POCKET-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: Gene_POCKET-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 29.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.7

File hashes

Hashes for Gene_POCKET-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 7962fd504b93c339a13034632d59c7f4d732c4ab69a6617ab519c9bc0570a01e
MD5 cfdeb69987ee27f1f310b548cece0e9c
BLAKE2b-256 07fbc059365a80583061bed526b13c1d40a87439fb0b90863c5d85ed44c7ed07

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page