A useful module for any CompBio
Project description
# JKBio
A set of awesome functions & tools for Computational Genomists
![long genome](documentation/genome.jpg) ## Content
utils: where a bunch of helper functions and usefull general scripts are stoed - plots: a set of plotting tools based on [matplotlib]() and [bokeh]() to make volcano plots / CNV maps etc.. - helper: and additional helper functions to save data, do merging of dataframes…
terra: contains a set of functions that uses [dalmatian]() to interact with the [GCP]() powered genomics HPC platform: [Terra]().
sequencing: contains a set of function to works with bed/bam/fastqs…
rna: contains function to work with RNAseq (and related) data. - pyDESeq2: it is a python integration of [deseq2]() (the differential expression analyser) with [rpy2]()
mutations: a set of functions to work with maf files, vcf files etc..
google: functions and packages linked to google’s apis - google_sheet: function to upload a df as a google sheet - gcp: sets of functions to interact with google storage (relies on gsutil)
epigenetics: where we have things related to epigenomics - rose: where an updated version of the rose algorithm is stored (as a git submodule) - chipseq: has functions to read, merge, denoise, ChIP seq data, it contains a lot of functions required for the AML paper.
taigr: a version of taiga that do not requires RCurl (and can save you when you have a faulty RCurl-Curl link)
data: should not contain anything when pulled but is used by any of the functions in the folder, to save some required data files
cell_line_mapping: a set of functions to map cell line ids to other cell line ids based on an up to date google spreadsheet.
## Install
### with pip (WIP)
pip install JKBio ### dev mode (better for now)
`bash git clone git://github.com/jkobject/JKBio.git cd JKBio git submodule update --init `
then you can import files in python with e.g: `python from JKBio import TerraFunction as terra `
if JKBio is not in your path, first do:
`python import sys sys.path.append(RELATIVE_PATH_TO_JKBio) `
now you can install the necessary python packages:
`bash pip install requirements.txt pip install rpy2-bioconductor-extensions gseapy macs2 deeptools `
or if not using the requirements.txt (computation results might change):
`bash pip install numpy pandas `
`bash pip install bokeh dalmatian firecloud_dalmatian google_api_python_client gsheets gspread ipdb ipython matplotlib Pillow pybedtools pyBigWig pysam pytest requests rpy2 scikit_learn scipy seaborn setuptools taigapy taigapy typing venn rpy2-bioconductor-extensions gseapy macs2 deeptools `
then install the following tools: - [htslib/samtools](http://www.htslib.org/) - [bwa](https://github.com/lh3/bwa) just used once: - [bowtie2](http://bowtie-bio.sourceforge.net/bowtie2/index.shtml)
finaly you can install R packages (GSEABase, erccdashboard, GSVA, DESeq2):
`bash R -e 'if(!requireNamespace("BiocManager", quietly = TRUE)){install.packages("BiocManager")};BiocManager::install(c("GSEABase", "erccdashboard", "GSVA", "DESeq2"));' ` ## About
As I am working in different domains of computational genomics, I need to have a set of reusable function that will help me during my work. It can be functions to work with different tools that I have to use. Functions to do some plots. etc..
I will be trying to keep each of these functions functional and documented. Feel free to pull and start use anything that might be useful to you. If you see anything suspicious or not working. A pull request would definitely get reviewed within a day.
I hope to be able to give back to the community and maybe save a couple of hours to couple of researchers.
Best.
Apache license 2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file JKBio-1.1.tar.gz
.
File metadata
- Download URL: JKBio-1.1.tar.gz
- Upload date:
- Size: 28.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/45.0.0 requests-toolbelt/0.9.1 tqdm/4.26.0 CPython/3.6.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6dd5b6933fb2cd9016253ebe4492fd380e2d0764a6f5ba91e170d632a576d641 |
|
MD5 | 95c5f1fc65b7ac2ec28eb87e2508fb86 |
|
BLAKE2b-256 | 722543430c82dcb7b52598670347ff5a9de786b1775556a2305c8b225df31904 |