Skip to main content

Fine-map transcriptome-wide association studies

Project description

FOCUS

FOCUS (Fine-mapping Of CaUsal gene Sets) is software to fine-map transcriptome-wide association study statistics at genomic risk regions. The software takes as input summary GWAS data along with eQTL weights and outputs a credible set of genes to explain observed genomic risk.

MA-FOCUS (Multi-Ancestry Fine-mapping Of CaUsal gene Sets) is an extension of FOCUS that leverages summary GWAS data with eQTL weights from multiple ancestries to increase the precision of credible sets for causal genes.

FOCUS are described in:

Probabilistic fine-mapping of transcriptome-wide association studies. Nicholas Mancuso, Malika K. Freund, Ruth Johnson, Huwenbo Shi, Gleb Kichaev, Alexander Gusev, and Bogdan Pasaniuc. Nature Genetics 51, 675-682 (2019).

MA-FOCUS are described in:

Multi-ancestry fine-mapping improves precision to identify causal genes in transcriptome-wide association studies. Zeyun Lu*, Shyamalika Gopalan*, Dong Yuan, David V. Conti, Bogdan Pasaniuc, Alexander Gusev, Nicholas Mancuso.

* indicates equal contribution

Installing

The easiest way to install is with pip:

pip install pyfocus --user

Check that FOCUS was installed by typing

focus --help

If that did not work, and pip install pyfocus --user was specified, please check that your local user path is included in $PATH environment variable. --user location and can be appended to $PATH by executing

export PATH=`python -m site --user-base`/bin/:$PATH

which can be saved in ~/.bashrc or ~/.bash_profile. To reload the environment type source ~/.bashrc or ~/source .bash_profile depending where you entered it.

Alternatively you can download the latest repo and then use setuptools:

git clone https://github.com/mancusolab/focus.git
cd focus
python setup.py install

We currently only support Python3.6+.

Example

Here is an example of how to perform LDL fine-mapping while prioritizing predictive models from adipose tissues:

focus finemap LDL_2010.clean.sumstats.gz 1000G.EUR.QC.1 gtex_v7.db --chr 1 --tissue adipose --locations 37:EUR --out LDL_2010.chr1

This command will scan LDL_2010.clean.sumstats.gz for risk regions 37:EUR generated by LDetect on GRCh37 for European ancestry and then perform TWAS+fine-mapping using LD estimated from plink-formatted 1000G.EUR.QC.1 and eQTL weights from gtex_v7.db GRCh37.

Here is an example of how to perform multi-ancestry fine-mapping from European and African ancestry (just use : to concatenate):

focus finemap LDL.EUR.sumstats.gz:LDL.AFR.sumstats.gz 1000G.EUR.QC.1:1000G.AFR.QC.1 genoa_EUR.db:genoa_AFR.db --chr 1 --tissue LCL --locations 37:EUR-AFR --out LDL_mafocus_chr1

This command will scan GWAS summary data LDL.EUR.sumstats.gz and LDL.AFR.sumstats.gz for risk regions 37:EUR-AFR generated by modified LDetect on GRCh37 for European and African ancestry and then perform TWAS+fine-mapping using LD estimated from plink-formatted 1000G.EUR.QC.1 and 1000G.AFR.QC.1 and eQTL weights from genoa_EUR.db and genoa_AFR.db.

Note: --locations is a new required parameter compared to versions prior to v0.8. This parameter specifies the genomic regions to be fine-mapped. We recommend to use independent regions for single ancestry or multiple ancestries. Please see the wiki for more instructions.

Please see the wiki for more details on how to use focus, ma-focus and links to database files.

Notes

Version 0.802: Fix the bug that .gitignore includes *.tsv so that gencode files couldn't be pushed to github.

Version 0.801: Added gencode_map_v38 and multiple LD block files in GRCh38. Fixed prior_prob bugs.

Version 0.8: Added MA-FOCUS support. Added GWAS imputation using imp-G. Added additional choice for prior probability for causal genes (number of genes in the risk regions).

Version 0.6.10: Fixed bug where weight database allele mismatch with GWAS broke inference.

Version 0.6.5: Fixed bug in newer versions of matplotlib not accepting string for colormaps. Fixed legend bug in plot. Fixed bug that mismatched string and category when supplying custom locations.

Version 0.6: Fixed bug where only one of the two alleles was reversed complemented breaking alignment. For now these instances are dropped. Added option --use-ens-id for FUSION import to indicate the main model label is an Ensembl ID rather than HGNC symbol.

Version 0.5: Plotting sorts genes based on tx start. Various bug fixes that limited the number of queried SNPs and plotting when using newer matplotlib.

Version 0.4: Added FUSION import support.

Version 0.3: Initial release. More to come soon.

Software and support

If you have any questions or comments please contact nicholas.mancuso@med.usc.edu and zeyunlu@usc.edu

For performing various inferences using summary data from large-scale GWASs please find the following useful software:

  1. Association between predicted expression and complex trait/disease FUSION and PrediXcan
  2. Estimating local heritability or genetic correlation HESS
  3. Estimating genome-wide heritability or genetic correlation UNITY
  4. Fine-mapping using summary-data PAINTOR
  5. Imputing summary statistics using LD FIZI
  6. TWAS simulator (https://github.com/mancusolab/twas_sim)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyfocus-0.802.tar.gz (2.4 MB view details)

Uploaded Source

Built Distributions

pyfocus-0.802-py3.7.egg (3.4 MB view details)

Uploaded Egg

pyfocus-0.802-py3-none-any.whl (2.4 MB view details)

Uploaded Python 3

File details

Details for the file pyfocus-0.802.tar.gz.

File metadata

  • Download URL: pyfocus-0.802.tar.gz
  • Upload date:
  • Size: 2.4 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.7.9

File hashes

Hashes for pyfocus-0.802.tar.gz
Algorithm Hash digest
SHA256 9ce00078d8a587577561d811ad8ba58630f4b65ce7d4415826542f8884511929
MD5 c3ec8c42f52af6589c58db5b111bf059
BLAKE2b-256 ea8a1889af31bc7dc7057249ae21440a8c68e767ada2cec58f89b189ea075d01

See more details on using hashes here.

File details

Details for the file pyfocus-0.802-py3.7.egg.

File metadata

  • Download URL: pyfocus-0.802-py3.7.egg
  • Upload date:
  • Size: 3.4 MB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.7.9

File hashes

Hashes for pyfocus-0.802-py3.7.egg
Algorithm Hash digest
SHA256 ffb126952665fed2e9a6b10a1de8b11cea0c3613f8f53ba1d0042d5ae0dd8454
MD5 6dc40ebd0da242a9dd7e78aec79efac5
BLAKE2b-256 294faa7c72d858f6d03a5279e79f033e853db4ed04fdc79c24652ac57b0787a2

See more details on using hashes here.

File details

Details for the file pyfocus-0.802-py3-none-any.whl.

File metadata

  • Download URL: pyfocus-0.802-py3-none-any.whl
  • Upload date:
  • Size: 2.4 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.7.9

File hashes

Hashes for pyfocus-0.802-py3-none-any.whl
Algorithm Hash digest
SHA256 be88e7eeab609c1c41238c9e2b791e0dae88fd3850311770366f467e864203b2
MD5 6322a93307bc54981670607583d10071
BLAKE2b-256 ca4408f3950057768da5dd9ddd371c70b82100a5f46c5f2e5cc9b5bad542158d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page