continuous integration of association summary statistics for network analysis
Project description
cimr
cimr is not yet released for public use
continuous integration and analysis of complex traits
YoSon Park
Useful links: Source repository | Issues & Ideas | Documentation | cimr-d
cimr (continuously integrated meta-resource) is a convenience tool for continuous analyses of variant-based association results from GWAS (genome-wide association studies), eQTL (expression-quantitative trait loci mapping) or other association studies. cimr aims to streamline the pre-analysis processing steps, provide standardized input files and automate scripting for standard downstream analyses.
Installation
Installing python
cimr requires python :math: ge 3.6. Installation of data analysis bundles such as miniconda or anaconda are recommended and will install all python packages cimr depends on. However, all required python packages can be downloaded and installed with setup.py or requirements.txt provided here.
Installing git lfs
cimr-d and some functionalities in cimr may use git large file storage (LFS) . See how to install git .
To install git-lfs on Ubuntu, run:
curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash sudo apt-get install -y git git-lfs git-lfs install
Alternatively, you can install git-lfs through conda:
conda install -c conda-forge git-lfs && git lfs install
Installing cimr
You can use pip to install the latest stable release of cimr:
pip3 install cimr
If you want to try out the nightly build of cimr at your own risk, clone the repository from git:
git clone https://github.com/greenelab/cimr.git cd cimr pip3 install -r requirements.txt python3 setup.py build python3 setup.py install
Analysis examples
Quality assurance and processing of association summary statistics files
You can use cimr to standardize public datasets using a yaml file, e.g.:
# example.yaml data_file: description: >- Global Lipid Genetics Consortium GWAS results for high-density cholesterol levels location: url: https://zenodo.org/record/3338180/files/HDL_Cholesterol.txt.gz md5: 2b28816a0a363db1a09ad9a6ba1a6620 columns: variant_id: panel_variant_id variant_chrom: chromosome variant_pos: position rsnum: variant_id data_info: citation: 10.1038/ng.2797 data_source: http://lipidgenetics.org/ data_type: gwas context: hdl cholesterol build: b38 sample_size: 187167 n_cases: na can_be_public: true method: name: linear regression tool: PLINK;SNPTEST;EMMAX;Merlin;GENABEL;MMAP website: >- http://zzz.bwh.harvard.edu/plink/download.shtml; https://mathgen.stats.ox.ac.uk/genetics_software/snptest/snptest.html; https://genome.sph.umich.edu/wiki/EMMAX; https://csg.sph.umich.edu/abecasis/Merlin/tour/assoc.html; http://www.genabel.org/sites/default/files/html_for_import/GenABEL_tutorial_html/GenABEL-tutorial.html; https://mmap.github.io/ contributor: name: Contributor Name github: contributorgithub email: contributoremail@emaildomain.emailextension
Details can be found in the cimr-d contributions.md.
Once the yaml file is prepared, you can run cimr locally:
cimr processor -process -yaml-file example.yaml
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file cimr-0.1.7.tar.gz
.
File metadata
- Download URL: cimr-0.1.7.tar.gz
- Upload date:
- Size: 28.5 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ed83369a3aa4e9b74746a3223bbfc3d2ea68bae61af9c69198555df7b9a9b282 |
|
MD5 | d84eb9877e12b9c6851d92750d1e6f81 |
|
BLAKE2b-256 | 40e9544161104bb9c2404f77fa1b620a982cbf563994d204b44c8270abfe82c7 |